[
https://issues.apache.org/jira/browse/CONNECTORS-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15117992#comment-15117992
]
Karl Wright commented on CONNECTORS-1270:
-----------------------------------------
The size of the actual models is not scary-big:
{code}
01/26/2016 03:46 PM 5,110,658 en-ner-location.bin
01/26/2016 03:47 PM 5,297,172 en-ner-organization.bin
01/26/2016 03:46 PM 5,207,953 en-ner-person.bin
01/26/2016 03:47 PM 98,533 en-sent.bin
01/26/2016 03:46 PM 439,890 en-token.bin
{code}
So, I think either including them as a resource or downloading on the fly would
work.
Unfortunately, while I can find numerous models free for the downloading on
opennlp.sourceforge.net/models-1.5, it's not clear what their license is. It's
clear that opennlp moved at some point from sourceforge to apache, but it is
not clear whether the available models came along. So, downloading on the fly
is the only real option.
> Import OpenNLP connector into trunk
> -----------------------------------
>
> Key: CONNECTORS-1270
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1270
> Project: ManifoldCF
> Issue Type: Task
> Reporter: Karl Wright
> Assignee: Rafa Haro
> Fix For: ManifoldCF 2.4
>
>
> An OpenNLP connector has been contributed on github. Need to import it into
> MCF, first to a branch, then to trunk.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)