On 4/27/11 7:56 PM, Chris Collins wrote:
I think that is a great idea. I didn't really want to blast the mailing list
as I am not a contributor as of today. I have been using ONLP for a couple of
years now, when it came time to train sentence and POS models in languages not
currently supported I was surprised to see no guidelines, suggestions or best
practices. Further I see that with 1.5 support for reading training sets
became more flexible but I have no idea what the public facing plans are for
supporting new languages and what the methodology was going to be. I am not
looking for an answer to these questions from you, but I certainly would of
appreciated a better eco system on the ONLP website. If there was such a thing
I would certainly participate in what our findings were (albeit perhaps not the
best ones :-} )
We finally started to work on the documentation and the 1.5.1 release
will come with a docbook
containing documentation, also how to train OpenNLP on certain data sets.
It would be really nice if you could share your experience with us, on
which languages
and which data sets did you train?
Jörn