Thank you. That did it. Kind regards, Rudolf
Hussein Shafie <hussein at xmlmind.com> Sent by: xmleditor-support-bounces at xmlmind.com 13-10-2008 10:15 Please respond to "xmleditor-support at xmlmind.com" <xmleditor-support at xmlmind.com> To Rudolf de Grijs <rdegrijs at epo.org> cc "xmleditor-support at xmlmind.com" <xmleditor-support at xmlmind.com> Subject Re: [XXE] Uanble to create dictionary Dutch language Rudolf de Grijs wrote: > > Could someone explain what I'm doing wrong? > Your encoding is almost certainly correct and you correctly specified it by using command line option "-cs UTF-8". However you didn't specify what characters are allowed in a word. You need to create a hints file (Use UTF-8 here too. Use command line option "-hints hints_file" after "-cs UTF-8") containing something like this: %chars ?????????? (What's above is for *French*). This is really needed because by default, only the ASCII uppercase and lowercase letters, digits, hyphen and dot are declared as acceptable ``word characters''. More information http://www.xmlmind.com/_dictbuilder/doc/hints_file.html > I try to create a new dictionary for the Dutch language. When I try to > create this dictionary with an UTF-8 encoded word list, all words with > diacretic characters are rejected. I'm pretty sure that the encoding is > correct. > > I'm using the following command: > > /dictbuilder -cs UTF-8 basiswoorden290507-utf8.txt -o dutch.cdi/ > > Here follows a partial list of the result that appears on the sonsole > > Cannot add word 'wijkco?rdinator' > Cannot add word 'wildwaterkano?n' > Cannot add word 'woningco?peraties' > Cannot add word 'wooncarri?re' > Cannot add word 'wrijvingsco?ffici?nt' > Cannot add word 'zalfoli?n' > Cannot add word 'zangcarri?re' > Cannot add word 'zeehondencr?che' > Cannot add word 'zelfgecre?erde' > Cannot add word 'zenderco?rdinator' > Cannot add word 'zenuwpati?nt' > Cannot add word 'zenuwpati?nte' > Cannot add word 'ziekenfondspati?nt' > Cannot add word 'ziekenhuispati?nt' > Cannot add word 'zonnebrandcr?me' > Cannot add word 'zonnecr?me' > Cannot add word 'zorgco?rdinator' > Cannot add word 'zo?geografie' > Cannot add word 'zo?logie' > Cannot add word 'zo?logisch' > Cannot add word 'zo?loog' > Cannot add word 'zo?morf' > Cannot add word 'zo?morfe' > Cannot add word 'zo?plankton' > Cannot add word 'zuivelco?peratie' > Cannot add word 'zwemcarri?re' > Cannot add word 'z?ta' > Cannot add word '?landseilanden' > Cannot add word '?' > Cannot add word '?' > Cannot add word '?' > Cannot add word '?' > Cannot add word '?' > Cannot add word '?' > Cannot add word '?ch?ance' > Cannot add word '?l?gance' > Cannot add word '?minence' > Cannot add word '??n' > Cannot add word '?re' > Cannot add word '?berhaupt' > Cannot add word '?bermensch' > > Kind regards, > > Rudolf de Grijs > > > ------------------------------------------------------------------------ > > > -- > XMLmind XML Editor Support List > xmleditor-support at xmlmind.com > http://www.xmlmind.com/mailman/listinfo/xmleditor-support -- XMLmind XML Editor Support List xmleditor-support at xmlmind.com http://www.xmlmind.com/mailman/listinfo/xmleditor-support -------------- next part -------------- An HTML attachment was scrubbed... URL: http://www.xmlmind.com/pipermail/xmleditor-support/attachments/20081013/3b81c1e7/attachment.htm

