Could you provide more information? For example, spark version,
dataset size (number of instances/number of features), cluster size,
error messages from both the drive and the executor. -Xiangrui
On Mon, Nov 10, 2014 at 11:28 AM, tsj tsj...@gmail.com wrote:
Hello all,
I have some text data that I am running different algorithms on.
I had no problems with LibSVM and Naive Bayes on the same data,
but when I run Decision Tree, the execution hangs in the middle
of DecisionTree.trainClassifier(). The only difference from the example
given on the site is that I am using 6 categories instead of 2, and the
input is text that is transformed to labeled points using TF-IDF. It
halts shortly after this log output:
spark.SparkContext: Job finished: collect at DecisionTree.scala:1347, took
1.019579676 s
Any ideas as to what could be causing this?
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/MLLib-Decision-Tress-algorithm-hangs-others-fine-tp18515.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org
-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org