On Monday 07 March 2005 20:39, Laurent Godard wrote: > Hi > > > can somebody tell me which encodings I can have in the thesaurus. I need > > cp1251, but I can gest the right name. I tried CP-1251, CP 1251, > > miccp1251, microsoft-cp 1251 but without any result. > > Comming from the myThes standalone package (see ligucomponenet project > site), the suported encodings are > ISO8859-1, ISO8859-2, ISO8859-3, ISO8859-4, ISO8859-5, ISO8859-6, > ISO8859-7, ISO8859-8, ISO8859-9, ISO8859-10, KOI8-R, CP-1251, > ISO8859-14, ISCII-DEVANAGARI > > I only tested ISO8859-1 when migrating teh french thesaurus > (if you need a migration tool from version 1 thesaurus, you'll find one > here : http://www.indesko.com/sites/en/downloads/openoffice.org_thesa/view I use this tool but it does not read CP-1251 correctly. So, converted the file to utf8 to use this tool. After that I converted the result file back in CP-1251 and then I use perl script to get the dat file. The problem is that in OOo 2.0 thesaurus didn't work. I tried to convert files in KOI8-R encoding and put this "KOI8-R" on top and then it works. But when files are in cp1251 encoding and it have "CP-1251" on top, thesaurus doesn't wok. Also I understand that case doesn't matter. So, i do not know what encoding name to put on top of my files with encoding cp1251.
-- Hristo Simeonov Hristov Leader of OpenOffice.org Bulgarian
pgpnn515m2wqU.pgp
Description: PGP signature
