Werner LEMBERG <[EMAIL PROTECTED]> wrote:

> I'm searching a large word list for Thai which is freely available,
> i.e., either under a license similar to GPL (resp. compatible to the
> GPL) or in the public domain.
>
> Do you know whether such a file is available?

The ICU package includes a sorted Thai word list in a UTF-8 file called
th18057.txt.  Since you may not wish to download the whole package and I
don't know if the Thai file is available separately, I have uploaded it
(for a limited time only) to:

    http://home.adelphia.net/~dewell/th18057.txt    (334,028 bytes)

If you can process SCSU, and would appreciate a 59% reduction in file
size, try:

    http://home.adelphia.net/~dewell/th18057-scsu.txt    (135,731 bytes)

A word of warning: there is a U+FFFD (which probably means something was
corrupted) roughly 90% of the way through the file.  I don't know if
that's only in my copy or in the one distributed with ICU as well.

-Doug Ewell
 Fullerton, California



Reply via email to