E501: Combining diacritics counted against line length #856

reynoldsnlp · 2019-03-22T22:09:51Z

It looks like unicode combining diacritics are counted as full characters, which causes E501 to fire, even when it is visually less than 79 characters:

text = ['ся́ду', 'твёрдый', 'ю́бкой', 'ба́юшки', 'опя́ть', 'съе́сть', 'жи́ть', 'éли']
a = 'а́́́́́́́́́́́́́́́́́́́'

It seems to me that combining characters should be excluded from the count.

The text was updated successfully, but these errors were encountered:

sigmavirus24 · 2019-03-23T14:14:58Z

Which version of python is pycodestyle running on and which version of pycodestyle are you using?

reynoldsnlp · 2019-03-27T17:20:30Z

Python 3.6.7
pycodestyle==2.3.1

asottile · 2019-03-27T17:26:10Z

shouldn't be too hard to implement using unicodedata:

>>> s = "a = 'а́́́́́́́́́́''а́́́́́́́́́́''а́́́́́́́́́́''а́́́́́́́́́́'"
>>> len(s)
65
>>> sum(not unicodedata.combining(c) for c in s)
16

I also alphabetized the import statements by moving `bisect` to the top.

reynoldsnlp added a commit to reynoldsnlp/pycodestyle that referenced this issue May 30, 2019

Fix PyCQA#856

ed954b4

I also alphabetized the import statements by moving `bisect` to the top.

asottile changed the title ~~E501 Combining diacritics counted against line length~~ E501: Combining diacritics counted against line length Jun 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

E501: Combining diacritics counted against line length #856

E501: Combining diacritics counted against line length #856

reynoldsnlp commented Mar 22, 2019

sigmavirus24 commented Mar 23, 2019

reynoldsnlp commented Mar 27, 2019

asottile commented Mar 27, 2019

E501: Combining diacritics counted against line length #856

E501: Combining diacritics counted against line length #856

Comments

reynoldsnlp commented Mar 22, 2019

sigmavirus24 commented Mar 23, 2019

reynoldsnlp commented Mar 27, 2019

asottile commented Mar 27, 2019