[
https://issues.apache.org/jira/browse/OPENNLP-53?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Martin Wiesner updated OPENNLP-53:
----------------------------------
Fix Version/s: 2.6.0
> Parser should have simple interface to process a tokenized input sentence
> -------------------------------------------------------------------------
>
> Key: OPENNLP-53
> URL: https://issues.apache.org/jira/browse/OPENNLP-53
> Project: OpenNLP
> Issue Type: Improvement
> Components: Parser
> Reporter: Jörn Kottmann
> Priority: Major
> Fix For: 2.6.0
>
>
> The parser expects a tokenized sentence as input, but currently it must be
> converted to a string where each
> token is separated by a white space.
> This interface turned out to be inconvenient if the input if the input
> sentence is
> provided as a list of strings or a string with a token span list. In both case
> a new string must be created. In this new string the offsets of the
> individual tokens
> must be remember in order to retrieve the parse tree out of the Parse objects.
> Create a more convenient way of interacting with an already tokenized
> sentence which
> is not in a whitespace separated format.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)