Add support to convert for the Leipzig corpus into doccat training data
------------------------------------------------------------------------
Key: OPENNLP-79
URL: https://issues.apache.org/jira/browse/OPENNLP-79
Project: OpenNLP
Issue Type: Improvement
Components: Doccat
Reporter: Jörn Kottmann
Assignee: Jörn Kottmann
Fix For: tools-1.5.1-incubating
Add a converter which can convert the Leipzig corpus into training data for the
document categorizer which
can be used to train a language detection model.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.