2011/12/4 Erel Segal <[email protected]>: > > OK, well, how can I find the original corpus that was used to create this > model? I would like to use it as a basis for training a new model.
Well that's the main issue: it's not available for redistribution unfortunately. There is an effort to build one from open data (e.g. wikipedia, wikinews...) but the project is a bit stalled. https://cwiki.apache.org/OPENNLP/opennlp-annotations.html If you have some spare time, please feel free to join. -- Olivier http://twitter.com/ogrisel - http://github.com/ogrisel
