On 6/24/11 7:32 PM, Hannes Korte wrote:
How about a button like "Incomplete sentence" in the entity UI. When the
user hits it, he/she gets the context and can select the complete
sentence. I guess this will get really complicated to merge all the
different user annotations then. But at least we don't need an
additional annotation task for the sentence labeling.
For sentence information we could have an annotation which marks
end-of-sentence
characters. If the a user now find an invalid sentence he can insert
such an annotation
and the generated sentence annotations can be corrected.
It does not really matter where this information comes from, I first
thought it might
be nice to have a dedicated ui for this. But it could also be part of an
ui to label entities,
as suggested by Hannes, but the created annotation would still be the same.
The entity annotation itself could also be treated as a confirmation
that something
is not an end-of-sentence character. Lets take Yahoo! for example, if it
is labeled
as organization and one token we should annotate the exclamation mark as an
end-of-sentence character which is not a sentence end.
Jörn