On Monday 07 March 2005 20:39, Laurent Godard wrote:
> Hi
>
> > can somebody tell me which encodings I can have in the thesaurus. I need
> > cp1251, but I can gest the right name. I tried CP-1251, CP 1251,
> > miccp1251, microsoft-cp 1251 but without any result.
>
> Comming from the myThes standalone package (see ligucomponenet project
> site), the suported encodings are
> ISO8859-1, ISO8859-2, ISO8859-3, ISO8859-4, ISO8859-5, ISO8859-6,
> ISO8859-7, ISO8859-8, ISO8859-9, ISO8859-10, KOI8-R, CP-1251,
> ISO8859-14, ISCII-DEVANAGARI
>
> I only tested ISO8859-1 when migrating teh french thesaurus
> (if you need a migration tool from version 1 thesaurus, you'll find one
> here : http://www.indesko.com/sites/en/downloads/openoffice.org_thesa/view
I use this tool but it does not read CP-1251 correctly. So, converted the file 
to utf8 to use this tool. After that I converted the result file back in 
CP-1251 and then I use perl script to get the dat file. The problem is that 
in OOo 2.0 thesaurus didn't work. I tried to convert files in KOI8-R encoding 
and put this "KOI8-R" on top and then it works. But when files are in cp1251 
encoding and it have "CP-1251" on top, thesaurus doesn't wok. Also I 
understand that case doesn't matter. So, i do not know what encoding name to 
put on top of my files with encoding cp1251.

-- 
Hristo Simeonov Hristov
Leader of OpenOffice.org Bulgarian

Attachment: pgpnn515m2wqU.pgp
Description: PGP signature

Reply via email to