Hi all,
after all my issues with wordfreak I now believe the easiest way to
train the coref component is by defining a training format and just
write the missing training code.
I suggest the following training format:
- Documents are separated by an empty line like it is done for the name
finder
- One parse per line (like it is done in the parser format)
- Additional #id tag on noun phrases in the parse so that the coref
component
knows which noun phrases are coreferent and which not.
Any comments or suggestions?
Jörn