On Sun, 22 Aug 2004, Aitor Santamarķa Merino wrote:

> Yes, but as I mentioned, the problem is that, how do we get collating
> tables, etc for other countries than US and Germany? We could rely on
> user's efforts to create those tables, but this can be quite laborious

Probably best to have a look at the locale tables in Linux. If one can
make an automatic conversion from there it would go a long way (probably
not 100% MS compatible but is that desirable for collation algorithms? --
they've probably changed themselves too).

On Linux we have in
/usr/share/i18n/locales
a file called iso14651_1 which has the Unicode collating table (you can
search the internet for iso 14651). That on its own would go a long way
(good enough for English, German, Dutch, Italian -- Spanish has some
exceptions with ch and ll, Lithuanian has y between i and k). Then some
individual country/language combinations do a few modifications on that.

e.g. a fragment of bg_BG:

LC_COLLATE

% We have made the following changes to the basic collation scheme in
% the file iso14651_t1:
%   1. The Cyrillic script is first in the order.
%   2. The non-Bulgarian Cyrillic letters are sorted according to
%      their transliteration with Bulgarian Cyrillic letters.

copy "iso14651_t1"
reorder-after <9>
<CYR-A>
<CYR-BE>
<CYR-VE>
<CYR-GHE>
[...]

It's still a lot of work but it's not that the information itself isn't
there.

Bart


-------------------------------------------------------
SF.Net email is sponsored by Shop4tech.com-Lowest price on Blank Media
100pk Sonic DVD-R 4x for only $29 -100pk Sonic DVD+R for only $33
Save 50% off Retail on Ink & Toner - Free Shipping and Free Gift.
http://www.shop4tech.com/z/Inkjet_Cartridges/9_108_r285
_______________________________________________
Freedos-kernel mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/freedos-kernel

Reply via email to