Coref training format

Jörn Kottmann Wed, 28 Mar 2012 05:16:24 -0700

Hi all,

after all my issues with wordfreak I now believe the easiest way to
train the coref component is by defining a training format and just
write the missing training code.


I suggest the following training format:

- Documents are separated by an empty line like it is done for the namefinder

- One parse per line (like it is done in the parser format)

- Additional #id tag on noun phrases in the parse so that the corefcomponent

  knows which noun phrases are coreferent and which not.

Any comments or suggestions?

Jörn

Coref training format

Reply via email to