What happens if all the entity tokens are at the beginning of every line?
I find that openlp then thinks that any string near the beginning of a line
is an entity,
regardless of the content or word context



On Mon, Oct 14, 2013 at 12:48 PM, Thomas Zastrow <[email protected]>wrote:

> Thanks. That explains a lot ... :-)
>
> Does it play a role it it is one or two blanks?
>
>
>
> Am 14.10.2013 21:44, schrieb William Colen:
> > Yes, it does. Include a blank between any element, including punctuations
> > and annotations. The corpus must be tokenized.
> >
> >
> > 2013/10/14 Thomas Zastrow <[email protected]>
> >
> >> Hello,
> >>
> >> I have a question: when creating training material, does it make a
> >> difference if there are " " (blanks) around the NE? In other words, is
> >> it the same to have:
> >>
> >> <START:loc>Hamburg<END>
> >>
> >> or:
> >>
> >> <START:loc> Hamburg <END>
> >>
> >> The example in the documentation shows up with the " " ... ?
> >>
> >> Best,
> >>
> >> Tom
> >>
> >> P.S.: ca. 1300 sentences for a free German NE model are done :-)
> >>
> >
>
>


-- 
This e-mail message, and any attachments, is intended only for the use of
the individual or entity identified in the alias address of this message
and may contain information that is confidential, privileged and subject to
legal restrictions and penalties regarding its unauthorized disclosure and
use. Any unauthorized review, copying, disclosure, use or distribution is
strictly prohibited. If you have received this e-mail message in error,
please notify the sender immediately by reply e-mail and delete this
message, and any attachments, from your system. Thank you.

Reply via email to