Javier Sola wrote:
> The only good solution that I see is to used dictionary based line
> breaking, and also spellchecker, but this takes some work with ICU and
> with OpenOffice, as well as very good word lists.
>
> For dictionary-based breaking, Tsheng must be reclasified as non-boundary.

We thought in the past long about this in the case of 
Thai, and we could not find any solution.

Could you please give a concrete example what you mean?
You probably mean line breaking = word breaking, right?
But that does not clarify either, what you mean for me....

Very good word list is a requirement for ANY language
for quality spell checking, exactly like a very good
affix file.

How comes ICU here?

I think, that when word breaks are the same as syllable
breaks, there is NO solution at all. Unfortunately.

He can not change the original text, and can not modify
either syllable break or word break.

A machine can not find out from syllables, which combination
is valid and which is not using just a syllable list.

-eleonora


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lingucomponent.openoffice.org
For additional commands, e-mail: dev-h...@lingucomponent.openoffice.org

Reply via email to