Hi,

2009/2/2 Sunday Bolaji <[email protected]>

> Hi,
>     I have tried your suggestion on temporary solution to unicode
> normilisation
> and it worked but one thing is not clear to me, are we going to have
> separate dictionary for all the with different encoding or are we
> putting in our dictionary file.
> Another thing i observed with
> hunspell is that if the number characters of correct word in the
> dictionary file is more than the characters of word wrongly type,
> hunspell will suggest diffreent word of the same length as wrong word.
> Examples are given below :
> (1)
> "jókòó" is the correct word in the dictionary, but it will not suggest
> it if i type "joke" despite specified in the REP table to replace " o"
> with " òó ". it will only suggest " jókòó" if the wrong type word is "
> jokoo "


> (2) " ọ̀rọ̀ " is the correct word in the dictionary, but it will not
> suggest it ,if " ọrọ " is type despite specified in the REP table to replace
> " ọ " with " ọ̀ ". And this is due to that  " ọ " is a precomposed single
> character and " ọ̀ "
>  and is combination of " ọ " and tone mark. The REP table is shown for
> similar characters. Please is there anything i can to solve this problem.


REP and MAP suggestions are not combined with similarity algorithms, unlike
the PHONE and ph: phonetic suggestions.

Check the following suggestion parameters:

-- affix file ---
PHONE 4
PHONE ó o
PHONE ò o
PHONE ọ̀ o
PHONE ọ o

Hunspell will convert "jókòó" to "jokoo" before comparing with the input
word "joke".
You can use PHONE for normalization, too. Unfortunately, there was a
potential problem with PHONE and diacritics under Windows, so it better to
use ph: fields (separated by tabulators) for OpenOffice.org 3.0. Also ph:
can work better for bigger word differences, too.

--- dic file ----
jókòó  ph:joko

 ọ̀rọ̀ ph:oro


Regards,
László





>
>
>
>
> REP  94
>
> REP  a  à
>
> REP  à  á
>
> REP  a  á
>
> REP  á  à
>
> REP  a  àà
>
> REP  à  àà
>
> REP  a  àá
>
> REP  à  àá
>
> REP  á  àá
>
> REP  a  áà
>
> REP  à  áà
>
> REP  á  áà
>
> REP  a  aa
>
> REP  a  aá
>
> REP  ai  àì
>
> REP  ai  a
>
> REP  ài
> à
>
> REP  ái  á
>
> REP  e  è
>
> REP  è  é
>
> REP  e  é
>
> REP  é  è
>
> REP  e  ẹ̀
>
> REP  e  ẹ́
>
> REP  ẹ  ẹ̀
>
> REP  ẹ̀  ẹ́
>
> REP  ẹ  ẹ́
>
> REP  ẹ́  ẹ̀
>
> REP  e  ẹ
>
> REP  è  ẹ̀
>
> REP  é  ẹ́
>
> REP  e  èè
>
> REP  è  èè
>
> REP  e  éè
>
> REP  e  éé
>
> REP  é  éé
>
> REP  e  èé
>
> REP  e  eé
>
> REP  e  ee
>
> REP  ẹ́  ẹ́ẹ̀
>
> REP  e  ẹ́ẹ̀
>
> REP  ẹ  ẹ́ẹ̀
>
> REP  e  ẹ̀ẹ̀
>
> REP  ẹ  ẹ̀ẹ̀
>
> REP  ẹ  ẹ̀ẹ́
>
> REP  e  ẹ̀ẹ́
>
> REP  e  ẹẹ
>
> REP  ẹ  ẹẹ
>
> REP  i  ì
>
> REP  ì  í
>
> REP  i  í
>
> REP  í  ì
>
> REP  i  íì
>
> REP  i  in
>
> REP  n  ǹ
>
> REP  n  ń
>
> REP  o  ọ̀
>
> REP  o  ọ́
>
> REP  o  ò
>
> REP  ò  ó
>
> REP  o  ó
>
> REP  ó  ò
>
> REP  ọ  ọ̀
>
> REP  ọ̀  ọ́
>
> REP  ọ  ọ́
>
> REP  ọ́
> ọ̀
>
> REP  o  ọ
>
> REP  ò
> ọ̀
>
> REP  ó  ọ́
>
> REP  o  òò
>
> REP  ò  òò
>
> REP  o  oo
>
> REP  o  oó
>
> REP  o  òó
>
> REP  o  ọ̀ọ̀
>
> REP  ọ  ọ̀ọ̀
>
> REP  ọ̀  ọ̀ọ̀
>
> REP  ọ̀
> ọ̀ọ́
>
> REP  ọ  ọ̀ọ́
>
> REP  o  ọ̀ọ́
>
> REP  ọ́
> ọ̀ọ́
>
> REP  s  ṣ
>
> REP  ṣ  s
>
> REP  u  ù
>
> REP  u  ú
>
> REP  u  ùú
>
> REP  ù  ùú
>
> REP  ú  ùú
>
> REP  ù  ùù
>
> REP  u  ùù
>
> REP  h y
>
> REP  E Ẹ
>
> REP  S Ṣ
>
> REP  O Ọ
>
> Best regards,Jeje
>
>
>
>
>
>
>
>
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>
>

Reply via email to