Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly

2009-07-08 Thread Bastian Blank
On Tue, Jul 07, 2009 at 10:58:56PM +0400, eshkin...@gmail.com wrote:
 Bastian Blank wa...@debian.org writes:
  You have to describe _why_ it is wrong. Different languages may have
  different collation rules.
 Yes, i try :) Yes, different languages have different collation, but
 ru_RU.UTF8 locale is unicode locale and as i can understand this table:
 http://unicode.org/charts/collation/chart_Cyrillic.html

Where is the reference to the russian language?

 why after U0453 ? This is mistake, IMHO.

Please present evidence in form of official documents for the russian
language. According to wikipedia[1], this letter is not used in russian
anyway.

Bastian

[1]: http://en.wikipedia.org/wiki/List_of_Cyrillic_letters

-- 
Warp 7 -- It's a law we can live with.



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly

2009-07-08 Thread =D0=A1=D0=B5=D1=80=D0=B3=D0=B5=D0=B9_?= =?UTF-8?Q?=D0=91=D1=83=D1=80=D0=BB=D0=B0=D0=B4=D1=8F=D0=BD
Bastian Blank wa...@debian.org writes:
 On Tue, Jul 07, 2009 at 10:58:56PM +0400, eshkin...@gmail.com wrote:
  Bastian Blank wa...@debian.org writes:
   You have to describe _why_ it is wrong. Different languages may have
   different collation rules.
  Yes, i try :) Yes, different languages have different collation, but
  ru_RU.UTF8 locale is unicode locale and as i can understand this table:
  http://unicode.org/charts/collation/chart_Cyrillic.html
 
 Where is the reference to the russian language?

I don't know, is you really need it ? Please, see below.

  why after U0453 ? This is mistake, IMHO.
 
 Please present evidence in form of official documents for the russian
 language. According to wikipedia[1], this letter is not used in russian
 anyway.

Yes, this is what I was trying to say :) Russian locale define this
non-Russian (Ukrainian) letter (why ?), and define it at IMHO very
strange position - after U0434, because U0453 is after U0434
in unicode ( http://unicode.org/charts/collation/chart_Cyrillic.html ).

I assume that someone trying to implement correct Ukrainian collation
with Russian locale, but do this incorrectly... This is not a problem for
Russian text, because Russian is not use this letter.

If definition of this U0491 letter can be removed from Russian locale, or
placed at correct position, like in Ukrainian locale - before U0434 at least,
then Ukrainian collation rules will be work properly with Russian locale too.

It would be useful IMHO.

BTW, as you can see in [1], U0491 (ґ) is located _before_ U0434 (д).

 [1]: http://en.wikipedia.org/wiki/List_of_Cyrillic_letters

-- 
С уважением, Сергей Бурладян



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly

2009-07-07 Thread Sergey Burladyan
Package: locales
Version: 2.9-12
Severity: normal


ru_RU.UTF8 locale collate UKR-GHE (U0491 and U0490) incorrectly, here is 
example:

wrong:
s...@seb:~$ (export LANG=ru_RU.UTF-8; echo абвгґдеєжзиіїйклмнопрстуфхцчшщьюя 
| sed -e 's/\(.\)/\1\n/g' | sort | head)

а
б
в
г
д
ґ
е
є
ж

correct:
s...@seb:~$ (export LANG=uk_UA.UTF-8; echo абвгґдеєжзиіїйклмнопрстуфхцчшщьюя 
| sed -e 's/\(.\)/\1\n/g' | sort | head)

а
б
в
г
ґ
д
е
є
ж

correct:
s...@seb:~$ (export LANG=en_US.UTF-8; echo абвгґдеєжзиіїйклмнопрстуфхцчшщьюя 
| sed -e 's/\(.\)/\1\n/g' | sort | head)

а
б
в
г
ґ
д
е
є
ж


-- System Information:
Debian Release: squeeze/sid
  APT prefers testing
  APT policy: (800, 'testing'), (800, 'stable'), (70, 'unstable'), (65, 
'experimental')
Architecture: i386 (i686)

Kernel: Linux 2.6.26-2-686 (SMP w/1 CPU core)
Locale: LANG=ru_RU.UTF-8, LC_CTYPE=ru_RU.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages locales depends on:
ii  debconf [debconf-2.0] 1.5.26 Debian configuration management sy
ii  libc6 [glibc-2.9-1]   2.9-4  GNU C Library: Shared libraries

locales recommends no packages.

locales suggests no packages.

-- debconf information:
* locales/default_environment_locale: ru_RU.UTF-8
* locales/locales_to_be_generated: en_GB ISO-8859-1, en_GB.ISO-8859-15 
ISO-8859-15, en_GB.UTF-8 UTF-8, en_US ISO-8859-1, en_US.ISO-8859-15 
ISO-8859-15, en_US.UTF-8 UTF-8, ru_RU ISO-8859-5, ru_RU.CP1251 CP1251, 
ru_RU.KOI8-R KOI8-R, ru_RU.UTF-8 UTF-8, ru_UA.UTF-8 UTF-8, uk_UA.UTF-8 UTF-8



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly

2009-07-07 Thread Bastian Blank
On Tue, Jul 07, 2009 at 06:01:15PM +0400, Sergey Burladyan wrote:
 ru_RU.UTF8 locale collate UKR-GHE (U0491 and U0490) incorrectly, here is 
 example:

You have to describe _why_ it is wrong. Different languages may have
different collation rules.

Bastian

-- 
War isn't a good life, but it's life.
-- Kirk, A Private Little War, stardate 4211.8



-- 
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org



Bug#536085: locales: ru_RU.UTF8 collate UKR-GHE incorrectly

2009-07-07 Thread =D0=A1=D0=B5=D1=80=D0=B3=D0=B5=D0=B9_?= =?UTF-8?Q?=D0=91=D1=83=D1=80=D0=BB=D0=B0=D0=B4=D1=8F=D0=BD
Bastian Blank wa...@debian.org writes:

 On Tue, Jul 07, 2009 at 06:01:15PM +0400, Sergey Burladyan wrote:
  ru_RU.UTF8 locale collate UKR-GHE (U0491 and U0490) incorrectly, here is 
  example:
 
 You have to describe _why_ it is wrong. Different languages may have
 different collation rules.

Yes, i try :) Yes, different languages have different collation, but
ru_RU.UTF8 locale is unicode locale and as i can understand this table:
http://unicode.org/charts/collation/chart_Cyrillic.html

U0491 must be after U0433 and before U0434

but /usr/share/i18n/locales/ru_RU define it like this:
reorder-after U0453

why after U0453 ? This is mistake, IMHO.

-- 
С уважением, Сергей Бурладян



--
To UNSUBSCRIBE, email to debian-bugs-dist-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org