On Tue, Nov 14, 2017 at 10:36 PM, Nikolai Aleksandrovich Pavlov
wrote:
> @tonymec @sergeevabc did not only use that notation for Cyrillic:
Ah, all right; I read too fast (as BTW I often do), sorry.
> . Also it is implementation detail. And grep with glibc’s ERE or BRE which
> do follow LC_COLLAT
* Marvin Renich [171114 14:57]:
> It suggests using [[:lower:][:upper:]] to do something close to what you
> want (it will also find non-Russian letters). The help does not mention
> any character class that includes exactly Russian letters, so the best
> you are going to be able to do is [А-яЁё]
* sergeevabc [171114 08:15]:
> @10110111, stumbled upon your comment accidentally and decided to test on my
> end.
> ```
> $ set LC_ALL=ru_RU.utf8
>
> $ grep --version
> grep (GNU grep) 3.0
>
> $ echo Ёжик под зелёной ёлкой. | grep --color "[а-яА-Я ]"
> Ёжик под зелёной ёлкой.
> ^ ^
This (not catching Ёё with [А-Яа-я]) is expected when $LC_COLLATE is a
locale with no knowledge of Cyrillic alphabetization, for instance C:
Ё U+0401 CYRILLIC CAPITAL LETTER IO
А U+0410 CYRILLIC CAPITAL LETTER A
Я U+042F CYRILLIC CAPITAL LETTER YA
а U+0430 CYRILLIC SMALL LETTER A
я U+044F CYRILLIC
2017-06-07 16:47 GMT+03:00 Ruslan Kabatsayev :
> Not correct, not semantically. What Vim does is deliberately ignoring
> LC_COLLATE while grep acts according to this category which puts “ё”
> between “е” and “ж” like in the Russian alphabet.
>
> I'm not really sure that LC_COLLATE or even any LC_*
2017-06-07 14:17 GMT+03:00 Christian Brabandt :
> Hm, ё is 'ё' U+0451 Dec:1105 CYRILLIC SMALL LETTER IO (io) ё, while я
> is
> 'я' U+044F Dec:1103 CYRILLIC SMALL LETTER YA (ja) я.
> Also Ё is 'Ё' U+0401 Dec:1025 CYRILLIC CAPITAL LETTER IO (IO) Ё, while
> А is
> 'А' U+0410 Dec:1040 CYRILLIC CAPITAL