Hi all
Changed the download location for the OpenNLP server from sourceforge
to the IKS dev server with revision 1170500 [1]
In addition the mod date of all the files was updated to the
14.Sep.2011 ~10:00. This should cause invalid models downloaded from
the sourceforge server to be overwritten by the correct versions with
the next maven build.
In case someone downloads illegal versions of the OpenNLP models from
the old location later than today he will need to manually delete such
files (because maven will think the corrupted versions are newer). In
this case one needs to delete all *.bin files in the following two
folders
{stanbol-root}/data/opennlp/lang/en/src/main/resources/org/apache/stanbol/data
{stanbol-root}/data/opennlp/ner/en/src/main/resources/org/apache/stanbol/data
best
Rupert Westenthaler
[1] http://svn.apache.org/viewvc?view=revision&revision=1170500
On Tue, Sep 13, 2011 at 6:28 PM, Rupert Westenthaler <[email protected]> wrote:
> Hi all,
>
> I will copy all the required files to the IKS server and also change
> the paths in the build files in the coming days.
>
> At completion I will write a short notice to this thread
>
> best
> Rupert
>
> On Tue, Sep 13, 2011 at 4:35 PM, Fabian Christ
> <[email protected]> wrote:
>> Hi,
>>
>> yes +1 for hosting on IKS server.
>>
>> - Fabian
>>
>> Am Montag, 12. September 2011 schrieb Olivier Grisel :
>>
>>> 2011/9/12 Stefane Fermigier <[email protected] <javascript:;>>:
>>> >
>>> > On Sep 12, 2011, at 10:48 AM, Rupert Westenthaler wrote:
>>> >
>>> >> On Mon, Sep 12, 2011 at 10:34 AM, Reto Bachmann-Gmür
>>> >> <[email protected]<javascript:;>>
>>> wrote:
>>> >>> Add them to svn? The files aren't that big. A similar issue but maybe
>>> >>> a bit harder to solve is the downloaded dbpedia data, I don't think
>>> >>> the released version should depend on third party servers for
>>> >>> compiling.
>>> >>>
>>> >> The reason why the OpenNLP models are not yet hosted @apache.org is
>>> >> because of licenses issues.
>>> >
>>> > Which are ? What's the license on the NLP models ? If they are on
>>> SourceForge, they should me open source.
>>>
>>> Those models are statistically derived from copyrighted material that
>>> is available for NLP researchers under a restrictive license "for
>>> research purpose only". Hence the license of such derived work is
>>> somewhat "gray". Better have models trained on explicitly annotated
>>> corpus freely redistributable for a any purpose. That's why I started
>>> the pignlproc project to build models from Wikipedia and contacted the
>>> OpenNLP developers to collaborate on this. They started an effort in
>>> that direction but nobody has enough time to finish building & testing
>>> models with good enough quality so far.
>>>
>>> > If they are under a license incompatible with apache.org, OK, but
>>> nothing prevents the IKS project from hosting open source stuff, right ?
>>>
>>> +1 for mirroring the models on a IKS server.
>>>
>>> --
>>> Olivier
>>> http://twitter.com/ogrisel - http://github.com/ogrisel
>>>
>>
>>
>> --
>> Fabian
>> http://twitter.com/fctwitt
>>
>
>
>
> --
> | Rupert Westenthaler [email protected]
> | Bodenlehenstraße 11 ++43-699-11108907
> | A-5500 Bischofshofen
>
--
| Rupert Westenthaler [email protected]
| Bodenlehenstraße 11 ++43-699-11108907
| A-5500 Bischofshofen