Yep. Tom, Drew and I have a lot of it working already (sentence detection, NER, others). I think POS tagging will be useful too.
On a related note, one of the things we talked about is if there is any interest in patches that can make it easier to use Lucene's token stream (especially the new AttributeSource stuff) instead of re-inventing the wheel here. Plus, especially with Lucene trunk, things all work off of bytes instead of chars, so they are a lot faster. Another advantage is there are a ton of Lucene impls out there and it might make adoption of OpenNLP easier b/c analysis might be able to be shared. -Grant On Jan 31, 2011, at 1:13 PM, Jörn Kottmann wrote: > Hi all, > > the Lucene team created an issue to integrate OpenNLP: > https://issues.apache.org/jira/browse/LUCENE-2899 > > Nice to see this effort. > > Jörn
