Javier Sola wrote: > The only good solution that I see is to used dictionary based line > breaking, and also spellchecker, but this takes some work with ICU and > with OpenOffice, as well as very good word lists. > > For dictionary-based breaking, Tsheng must be reclasified as non-boundary.
We thought in the past long about this in the case of Thai, and we could not find any solution. Could you please give a concrete example what you mean? You probably mean line breaking = word breaking, right? But that does not clarify either, what you mean for me.... Very good word list is a requirement for ANY language for quality spell checking, exactly like a very good affix file. How comes ICU here? I think, that when word breaks are the same as syllable breaks, there is NO solution at all. Unfortunately. He can not change the original text, and can not modify either syllable break or word break. A machine can not find out from syllables, which combination is valid and which is not using just a syllable list. -eleonora --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lingucomponent.openoffice.org For additional commands, e-mail: dev-h...@lingucomponent.openoffice.org