Re: Help Troubleshooting Naive Bayes

2014-10-02 Thread Mike Bernico
ure dimension, number of instances, number >> of classes, and number of partitions. Do you mind sharing those >> numbers? -Xiangrui >> >> On Wed, Oct 1, 2014 at 6:31 PM, Mike Bernico >> wrote: >> > Hi Everyone, >> > >> > I'm working on t

Help Troubleshooting Naive Bayes

2014-10-01 Thread Mike Bernico
Hi Everyone, I'm working on training mllib's Naive Bayes to classify TF/IDF vectoried docs using Spark 1.1.0. I've gotten this to work fine on a smaller set of data, but when I increase the number of vectorized documents I get hung up on training. The only messages I'm seeing are below. I'm pr