+1 as well. I think it should be in core rather than utils due to dependency issues.
On Sat, Jan 16, 2010 at 7:16 AM, Olivier Grisel <olivier.gri...@ensta.org>wrote: > 2010/1/16 Grant Ingersoll <gsing...@apache.org>: > > I think we should start a new module, that will be the seed for a > subproject, called NLP and that contains the stuff for NLP. > > > > Either that or put them in the utils module, which is where I envision > all of things that are "helpful" for ML go, but aren't required. > > +1 for an explicit "org.apache.mahout.nlp module". Tools to turn > wikipedia dumps into term freq vectors could also move there instead > of "examples". > > -- > Olivier > http://twitter.com/ogrisel - http://code.oliviergrisel.name > -- Ted Dunning, CTO DeepDyve