Re: Stemming, Stoplists and Language Models?

Tommaso Teofili Tue, 04 Mar 2014 11:26:05 -0800

I've attached the adapted source code to be donated into the Jira issue [1]


Regards,
Tommaso

[1] : https://issues.apache.org/jira/browse/OPENNLP-657


2014-02-28 10:11 GMT+01:00 Tommaso Teofili <[email protected]>:

>
>
>
> 2014-02-27 22:55 GMT+01:00 Tommaso Teofili <[email protected]>:
>
>
>>
>>
>> 2014-02-27 12:16 GMT+01:00 Jörn Kottmann <[email protected]>:
>>
>> On 02/23/2014 06:35 PM, Tommaso Teofili wrote:
>>>
>>>> I have implemented a very simple set of nlp tools at [1], with
>>>> implementations for ngrams [2] and language modeling [3] tasks too.
>>>> I'd be happy to donate it to Apache OpenNLP if the community is
>>>> interested.
>>>>
>>>
>>> Yes, that sounds very interesting. We already have ngram support, maybe
>>> we can merge your implementation
>>> with the current one in case there are any missing features.
>>>
>>
>> sure
>>
>>
>>>
>>> It would be nice if you could create an issue to contribute the code.
>>>
>>
>> yes, I'll do that
>>
>
> done, here it is: https://issues.apache.org/jira/browse/OPENNLP-657
>
> Regards,
> Tommaso
>
>
>>
>>
>>>
>>> Do you think we should directly include in opennlp-tools or first ship
>>> it as an addon or make it part of the sandbox?
>>
>>
>> maybe I'd put it in the sandbox to start, where to move things after that
>> would also depend a bit on where the different features best fit: ngram /
>> language modeling would fit well in opennlp-tools and maybe CFGs too, maybe
>> gradient descent / regression in opennlp-ml, not sure about naive bayes and
>> anomaly detection but I guess we can decide that also later on.
>>
>> Thanks,
>> Tommaso
>>
>>
>>>
>>>
>>> Jörn
>>>
>>
>>
>

Re: Stemming, Stoplists and Language Models?

Reply via email to