Re: Unicode::Collate question

Rich Thu, 04 Dec 2003 09:35:58 -0800

Sadahiro Tomoyuki wrote:

> 
>> So I guess I need a Ligua:XX::Sort module for each language I operate
>> on,
>> in my original posting I was misled to believe that Unicode::Collate
>> would
>> be the tool to use.
>> 
>> Thanks to all for the useful links provided in this thread.
> 
> As far as I found, CPAN provides at least five modules
> for collation localized for a specific natural language:
> [package name, language name, encoding]
> 
> No::Sort, Norwegian, ISO-8859-1
>     http://search.cpan.org/~gaas/Norge-1.07/
> 
> Cz::Sort, Czech, ISO-8859-2
>     http://search.cpan.org/~janpaz/Cstools-3.42/
> 
> Lingua::Klingon::Collate, Klingon, ASCII/EBCDIC (Perl native)
>     http://search.cpan.org/~pne/Lingua-Klingon-Collate-1.01/
> 
> Lingua::JA::Sort::JIS, Japanese, UTF-8
>     http://search.cpan.org/~sadahiro/Lingua-JA-Sort-JIS-0.04/
> 
> ShiftJIS::Collate, Japanese, Shift-JIS
>     http://search.cpan.org/~sadahiro/ShiftJIS-Collate-1.02/
> 
> Regards,
> SADAHIRO Tomoyuki


Has anyone had a look at the OpenI18N/ICU locale data?

The locales there are all UTF-8 and have java rule based collation data, so
they *might* be useful for creating a more comprehensive (and accurate) set
of sort modules? The downside is this data is pretty rough ATM but does
seem to be improving slowly.

I guess p6 is going to use ICU as the basis for I18N - sure hope the APIs
are easier though :)

Cheers
-- 
Rich
[EMAIL PROTECTED]

Re: Unicode::Collate question

Reply via email to