I've attached the adapted source code to be donated into the Jira issue [1]
Regards, Tommaso [1] : https://issues.apache.org/jira/browse/OPENNLP-657 2014-02-28 10:11 GMT+01:00 Tommaso Teofili <[email protected]>: > > > > 2014-02-27 22:55 GMT+01:00 Tommaso Teofili <[email protected]>: > > >> >> >> 2014-02-27 12:16 GMT+01:00 Jörn Kottmann <[email protected]>: >> >> On 02/23/2014 06:35 PM, Tommaso Teofili wrote: >>> >>>> I have implemented a very simple set of nlp tools at [1], with >>>> implementations for ngrams [2] and language modeling [3] tasks too. >>>> I'd be happy to donate it to Apache OpenNLP if the community is >>>> interested. >>>> >>> >>> Yes, that sounds very interesting. We already have ngram support, maybe >>> we can merge your implementation >>> with the current one in case there are any missing features. >>> >> >> sure >> >> >>> >>> It would be nice if you could create an issue to contribute the code. >>> >> >> yes, I'll do that >> > > done, here it is: https://issues.apache.org/jira/browse/OPENNLP-657 > > Regards, > Tommaso > > >> >> >>> >>> Do you think we should directly include in opennlp-tools or first ship >>> it as an addon or make it part of the sandbox? >> >> >> maybe I'd put it in the sandbox to start, where to move things after that >> would also depend a bit on where the different features best fit: ngram / >> language modeling would fit well in opennlp-tools and maybe CFGs too, maybe >> gradient descent / regression in opennlp-ml, not sure about naive bayes and >> anomaly detection but I guess we can decide that also later on. >> >> Thanks, >> Tommaso >> >> >>> >>> >>> Jörn >>> >> >> >
