--features in trainlogistic what is this for?

2012-04-27 Thread Yang
when I run mahout trainlogistic , there is an optional param --features from the book Mahout in action, it says: --features The size of the internal feature vector to use in building the model. A larger value here can be helpful, especially with text-like input data so is this something like

Re: --features in trainlogistic what is this for?

2012-04-27 Thread Ted Dunning
Putting a smaller value here will degrade prediction quality because more and more features will collide in the hashed feature space. Increasing this beyond a certain point, however, will not significantly increase prediction quality and it will increase memory usage. On Fri, Apr 27, 2012 at