Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly
On Tue, Jul 07, 2009 at 10:58:56PM +0400, eshkin...@gmail.com wrote: Bastian Blank wa...@debian.org writes: You have to describe _why_ it is wrong. Different languages may have different collation rules. Yes, i try :) Yes, different languages have different collation, but ru_RU.UTF8 locale is unicode locale and as i can understand this table: http://unicode.org/charts/collation/chart_Cyrillic.html Where is the reference to the russian language? why after U0453 ? This is mistake, IMHO. Please present evidence in form of official documents for the russian language. According to wikipedia[1], this letter is not used in russian anyway. Bastian [1]: http://en.wikipedia.org/wiki/List_of_Cyrillic_letters -- Warp 7 -- It's a law we can live with. -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly
2009-07-08
Thread
=D0=A1=D0=B5=D1=80=D0=B3=D0=B5=D0=B9_?= =?UTF-8?Q?=D0=91=D1=83=D1=80=D0=BB=D0=B0=D0=B4=D1=8F=D0=BD
Bastian Blank wa...@debian.org writes: On Tue, Jul 07, 2009 at 10:58:56PM +0400, eshkin...@gmail.com wrote: Bastian Blank wa...@debian.org writes: You have to describe _why_ it is wrong. Different languages may have different collation rules. Yes, i try :) Yes, different languages have different collation, but ru_RU.UTF8 locale is unicode locale and as i can understand this table: http://unicode.org/charts/collation/chart_Cyrillic.html Where is the reference to the russian language? I don't know, is you really need it ? Please, see below. why after U0453 ? This is mistake, IMHO. Please present evidence in form of official documents for the russian language. According to wikipedia[1], this letter is not used in russian anyway. Yes, this is what I was trying to say :) Russian locale define this non-Russian (Ukrainian) letter (why ?), and define it at IMHO very strange position - after U0434, because U0453 is after U0434 in unicode ( http://unicode.org/charts/collation/chart_Cyrillic.html ). I assume that someone trying to implement correct Ukrainian collation with Russian locale, but do this incorrectly... This is not a problem for Russian text, because Russian is not use this letter. If definition of this U0491 letter can be removed from Russian locale, or placed at correct position, like in Ukrainian locale - before U0434 at least, then Ukrainian collation rules will be work properly with Russian locale too. It would be useful IMHO. BTW, as you can see in [1], U0491 (ґ) is located _before_ U0434 (д). [1]: http://en.wikipedia.org/wiki/List_of_Cyrillic_letters -- С уважением, Сергей Бурладян -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly
Package: locales Version: 2.9-12 Severity: normal ru_RU.UTF8 locale collate UKR-GHE (U0491 and U0490) incorrectly, here is example: wrong: s...@seb:~$ (export LANG=ru_RU.UTF-8; echo абвгґдеєжзиіїйклмнопрстуфхцчшщьюя | sed -e 's/\(.\)/\1\n/g' | sort | head) а б в г д ґ е є ж correct: s...@seb:~$ (export LANG=uk_UA.UTF-8; echo абвгґдеєжзиіїйклмнопрстуфхцчшщьюя | sed -e 's/\(.\)/\1\n/g' | sort | head) а б в г ґ д е є ж correct: s...@seb:~$ (export LANG=en_US.UTF-8; echo абвгґдеєжзиіїйклмнопрстуфхцчшщьюя | sed -e 's/\(.\)/\1\n/g' | sort | head) а б в г ґ д е є ж -- System Information: Debian Release: squeeze/sid APT prefers testing APT policy: (800, 'testing'), (800, 'stable'), (70, 'unstable'), (65, 'experimental') Architecture: i386 (i686) Kernel: Linux 2.6.26-2-686 (SMP w/1 CPU core) Locale: LANG=ru_RU.UTF-8, LC_CTYPE=ru_RU.UTF-8 (charmap=UTF-8) Shell: /bin/sh linked to /bin/bash Versions of packages locales depends on: ii debconf [debconf-2.0] 1.5.26 Debian configuration management sy ii libc6 [glibc-2.9-1] 2.9-4 GNU C Library: Shared libraries locales recommends no packages. locales suggests no packages. -- debconf information: * locales/default_environment_locale: ru_RU.UTF-8 * locales/locales_to_be_generated: en_GB ISO-8859-1, en_GB.ISO-8859-15 ISO-8859-15, en_GB.UTF-8 UTF-8, en_US ISO-8859-1, en_US.ISO-8859-15 ISO-8859-15, en_US.UTF-8 UTF-8, ru_RU ISO-8859-5, ru_RU.CP1251 CP1251, ru_RU.KOI8-R KOI8-R, ru_RU.UTF-8 UTF-8, ru_UA.UTF-8 UTF-8, uk_UA.UTF-8 UTF-8 -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly
On Tue, Jul 07, 2009 at 06:01:15PM +0400, Sergey Burladyan wrote: ru_RU.UTF8 locale collate UKR-GHE (U0491 and U0490) incorrectly, here is example: You have to describe _why_ it is wrong. Different languages may have different collation rules. Bastian -- War isn't a good life, but it's life. -- Kirk, A Private Little War, stardate 4211.8 -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly
2009-07-07
Thread
=D0=A1=D0=B5=D1=80=D0=B3=D0=B5=D0=B9_?= =?UTF-8?Q?=D0=91=D1=83=D1=80=D0=BB=D0=B0=D0=B4=D1=8F=D0=BD
Bastian Blank wa...@debian.org writes: On Tue, Jul 07, 2009 at 06:01:15PM +0400, Sergey Burladyan wrote: ru_RU.UTF8 locale collate UKR-GHE (U0491 and U0490) incorrectly, here is example: You have to describe _why_ it is wrong. Different languages may have different collation rules. Yes, i try :) Yes, different languages have different collation, but ru_RU.UTF8 locale is unicode locale and as i can understand this table: http://unicode.org/charts/collation/chart_Cyrillic.html U0491 must be after U0433 and before U0434 but /usr/share/i18n/locales/ru_RU define it like this: reorder-after U0453 why after U0453 ? This is mistake, IMHO. -- С уважением, Сергей Бурладян -- To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org