Hi there,

I added a model for NameFinder (de) based on Tiger treebank 2.2 and attached it to the issue.

For details see https://issues.apache.org/jira/browse/OPENNLP-1223.

I first extracted 6.271 sentences mentioning names and trained based on that (filtered) data. Or is it better to use the complete training data (including the sentences without names)?

Best regards,

Johannes


Am 28.09.2018 um 09:28 schrieb Joern Kottmann:
Hello,

we can only distribute artifacts at Apache which can be licensed under
the AL 2.0.

I am not sure what the situation withe the tiger corpus is, but it
might have a clause in its license which would restrict this.

Anyway, +1 to release a model trained on the tiger corpus, and to add
support to train on it.

Jörn
On Wed, Sep 26, 2018 at 4:06 PM J. Fiala <[email protected]> wrote:
Hi there,

I saw there is no model for Name Finder for language german.

Would you be interested to have on based on tiger or is someone else
already working on that?

I could not find an issue for adding models to NameFinder in other
languages, should I create a new one?

Thanks & Best regards,
Johannes



Reply via email to