[
https://issues.apache.org/jira/browse/OPENNLP-15?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12972055#action_12972055
]
Jörn Kottmann commented on OPENNLP-15:
--------------------------------------
The handling of the file encoding must be improved. Currently the UTF-8
encoding is hardcoded for both generated input files, german and english.
The input files are generated by conll03 scripts. The input encoding might be
defined by the
conll03 scripts (must be checked). In the case that is true the encoding could
be hardcoded,
otherwise it should be configurable.
If the encoding is hardcoded, the cmd line interface must be improved to not
accept an encoding parameter
which is not used.
> Add support for the CoNLL 03 data format
> ----------------------------------------
>
> Key: OPENNLP-15
> URL: https://issues.apache.org/jira/browse/OPENNLP-15
> Project: OpenNLP
> Issue Type: Improvement
> Components: Name Finder
> Affects Versions: tools-1.5.0-sourceforge
> Reporter: Jörn Kottmann
> Fix For: tools-1.5.1-incubating
>
>
> Adding support to convert CoNLL 03 Reurters Support to NameFinder.
> Work on this issue began over at sourceforge:
> http://sourceforge.net/tracker/?func=detail&aid=3081785&group_id=3368&atid=353368
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.