NE Training + Dictionary?

Thomas Zastrow Thu, 10 Oct 2013 02:59:42 -0700

Hello,

There seems to be no free German NE model available, so I started tothink about creating one - just using free resources like Wikipedia etc.


I still have some questions:

Somewhere in the documnetation, I read about a dictionary driven NErecognizer in OpenNLP. But I didn't found any further information aboutit. Anyway, would it be possible to combine the statistic approach withdictionaries? For example, having a list of country names would be useful.

As far as I understood, the name finder is at the moment only stable forone property, like person names. I would like to have the traditionaldivison into persons, locations, organizations and misc. When creatingmanually the training data, would it be OK to add all four kinds alreadyto the text and then, maybe create later 4 models for the differentproperties?

The name finder uses as input sentences and tokens. Would it be OK toalso have POS tags assigned to the training data? That would make itmuch easier to manually annotate the data when e.g. NEs are alreadymarked by the POS tagger.

Thats it for the moment, I'm quite sure I will come back later with morequestions :-)


Best,

Tom

--
Dr. Thomas Zastrow
Riemerfeldring 7a

85748 Garching
Tel.: 0162 422 8029
www.thomas-zastrow.de

NE Training + Dictionary?

Reply via email to