Re: [lingu-dev] Hyphen should be forwarded to spellchecker

Ruud Baars Fri, 05 Jun 2009 04:09:10 -0700

Thomas Lange - Sun Germany - ham02 - Hamburg schreef:

Hi all,


Olivier R. wrote:

Per Eriksson a écrit :

I would like to know the status for 64400. What is the current status?

I would like to know also what solution will be applied.

It would useful to prepare dictionaries accordingly.


It is planned that hyphens should become part of the word as recognized
by the breakiterator. The ecxact types of hyphens and languages where to
apply this (probably all western languages at least, maybe all
languages) is still up for discussion. From my point of view I currently
see no reason why the same behavior should not be applied to all languages.

It would be best to have a design that has a language dependendword-iteration.Which chars can and cannot be part of a word is rather depending on thelanguage, plus the programmed behaviour of the components below that.

For Dutch, the -, the ' (under certain circumstances) are word chars.

Some general construction like SRX does for sentences, could be used foapplicar isolationg the words in a sentence too.This construct would preferably be applicable to all programs usingspellchecking. Grammar has partly the same issues for findign the tokens.

As to when the change will happen:
It should be fixed for OOo 3.2 but details are not yet known.

That would be perfect! We could prepare the word list to recognizeincorrectly used dashes ...

Can we already assume that if a word with hyphen is not recognized, eachparts of the word will be checked separately?

That needs to be discussed with László since that will probably needed
to be taken care of by hunspell.
However as mentioned in the issue, a problem will be that the current
spell check API can not mark parts of the text to be checked as wrong.
Also even if it could be done in the spell check API the applications
are not yet prepared for it.
See my comment from /Wed Sep 3 11:31:15 +0000 2008 in the issue,
especially in c).

Of course in order to see if that approach works at all we need to have
the hyphen to be part of the word as soon as possible. Only then we can
see in detail what additional problems will arise and how to handle them.
/
The long run optimal solution would be that this problem is handled by a
grammar checker that is also a spell checker and thus can make use of
the grammar checking API that allows for proper handling of this problem
(mark only part of the text and provide suggestions only for that part).

My idea! Let's integrate 'automated proofreading' !


Regards,
Thomas



---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lingucomponent.openoffice.org
For additional commands, e-mail: dev-h...@lingucomponent.openoffice.org

Re: [lingu-dev] Hyphen should be forwarded to spellchecker

Reply via email to