Could you provide more information? For example, spark version, dataset size (number of instances/number of features), cluster size, error messages from both the drive and the executor. -Xiangrui
On Mon, Nov 10, 2014 at 11:28 AM, tsj <tsj...@gmail.com> wrote: > Hello all, > > I have some text data that I am running different algorithms on. > I had no problems with LibSVM and Naive Bayes on the same data, > but when I run Decision Tree, the execution hangs in the middle > of DecisionTree.trainClassifier(). The only difference from the example > given on the site is that I am using 6 categories instead of 2, and the > input is text that is transformed to labeled points using TF-IDF. It > halts shortly after this log output: > > spark.SparkContext: Job finished: collect at DecisionTree.scala:1347, took > 1.019579676 s > > Any ideas as to what could be causing this? > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/MLLib-Decision-Tress-algorithm-hangs-others-fine-tp18515.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org