Hi Justin, I am glad to know that trees are working well for you.
The trees will support minimum samples per node in a future release. Thanks for the feedback. -Manish On Fri, Jun 13, 2014 at 8:55 PM, Justin Yip <yipjus...@gmail.com> wrote: > Hello, > > I have been playing around with mllib's decision tree library. It is > working great, thanks. > > I have a question regarding overfitting. It appears to me that the current > implementation doesn't allows user to specify the minimum number of samples > per node. This results in some nodes only contain very few samples, which > potentially leads to overfitting. > > I would like to know if there is workaround or any way to prevent > overfitting? Or will decision tree supports min-samples-per-node in future > releases? > > Thanks. > > Justin > > >