Thank you. That did it.

Kind regards,
Rudolf




Hussein Shafie <hussein at xmlmind.com> 
Sent by: xmleditor-support-bounces at xmlmind.com
13-10-2008 10:15
Please respond to
"xmleditor-support at xmlmind.com" <xmleditor-support at xmlmind.com>


To
Rudolf de Grijs <rdegrijs at epo.org>
cc
"xmleditor-support at xmlmind.com" <xmleditor-support at xmlmind.com>
Subject
Re: [XXE] Uanble to create dictionary Dutch language






Rudolf de Grijs wrote:
> 
> Could someone explain what I'm doing wrong?
> 

Your encoding is almost certainly correct and you correctly specified it 
by using command line option "-cs UTF-8". However you didn't specify 
what characters are allowed in a word.

You need to create a hints file (Use UTF-8 here too. Use command line 
option "-hints hints_file" after "-cs UTF-8") containing something like 
this:

%chars ??????????

(What's above is for *French*).

This is really needed because by default, only the ASCII uppercase and 
lowercase letters, digits, hyphen and dot are declared as acceptable 
``word characters''.

More information http://www.xmlmind.com/_dictbuilder/doc/hints_file.html




> I try to create a new dictionary for the Dutch language. When I try to 
> create this dictionary with an UTF-8 encoded word list, all words with 
> diacretic characters are rejected. I'm pretty sure that the encoding is 
> correct.
> 
> I'm using the following command:
> 
> /dictbuilder -cs UTF-8 basiswoorden290507-utf8.txt -o dutch.cdi/
> 
> Here follows a partial list of the result that appears on the sonsole
> 
> Cannot add word 'wijkco?rdinator'
> Cannot add word 'wildwaterkano?n'
> Cannot add word 'woningco?peraties'
> Cannot add word 'wooncarri?re'
> Cannot add word 'wrijvingsco?ffici?nt'
> Cannot add word 'zalfoli?n'
> Cannot add word 'zangcarri?re'
> Cannot add word 'zeehondencr?che'
> Cannot add word 'zelfgecre?erde'
> Cannot add word 'zenderco?rdinator'
> Cannot add word 'zenuwpati?nt'
> Cannot add word 'zenuwpati?nte'
> Cannot add word 'ziekenfondspati?nt'
> Cannot add word 'ziekenhuispati?nt'
> Cannot add word 'zonnebrandcr?me'
> Cannot add word 'zonnecr?me'
> Cannot add word 'zorgco?rdinator'
> Cannot add word 'zo?geografie'
> Cannot add word 'zo?logie'
> Cannot add word 'zo?logisch'
> Cannot add word 'zo?loog'
> Cannot add word 'zo?morf'
> Cannot add word 'zo?morfe'
> Cannot add word 'zo?plankton'
> Cannot add word 'zuivelco?peratie'
> Cannot add word 'zwemcarri?re'
> Cannot add word 'z?ta'
> Cannot add word '?landseilanden'
> Cannot add word '?'
> Cannot add word '?'
> Cannot add word '?'
> Cannot add word '?'
> Cannot add word '?'
> Cannot add word '?'
> Cannot add word '?ch?ance'
> Cannot add word '?l?gance'
> Cannot add word '?minence'
> Cannot add word '??n'
> Cannot add word '?re'
> Cannot add word '?berhaupt'
> Cannot add word '?bermensch'
> 
> Kind regards,
> 
> Rudolf de Grijs
> 
> 
> ------------------------------------------------------------------------
> 
> 
> --
> XMLmind XML Editor Support List
> xmleditor-support at xmlmind.com
> http://www.xmlmind.com/mailman/listinfo/xmleditor-support


 
--
XMLmind XML Editor Support List
xmleditor-support at xmlmind.com
http://www.xmlmind.com/mailman/listinfo/xmleditor-support
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
http://www.xmlmind.com/pipermail/xmleditor-support/attachments/20081013/3b81c1e7/attachment.htm
 

Reply via email to