Re: MLLib Decision Tress algorithm hangs, others fine

2014-11-11 Thread Xiangrui Meng
Could you provide more information? For example, spark version,
dataset size (number of instances/number of features), cluster size,
error messages from both the drive and the executor. -Xiangrui

On Mon, Nov 10, 2014 at 11:28 AM, tsj tsj...@gmail.com wrote:
 Hello all,

 I have some text data that I am running different algorithms on.
 I had no problems with LibSVM and Naive Bayes on the same data,
 but when I run Decision Tree, the execution hangs in the middle
 of DecisionTree.trainClassifier(). The only difference from the example
 given on the site is that I am using 6 categories instead of 2, and the
 input is text that is transformed to labeled points using TF-IDF. It
 halts shortly after this log output:

 spark.SparkContext: Job finished: collect at DecisionTree.scala:1347, took
 1.019579676 s

 Any ideas as to what could be causing this?



 --
 View this message in context: 
 http://apache-spark-user-list.1001560.n3.nabble.com/MLLib-Decision-Tress-algorithm-hangs-others-fine-tp18515.html
 Sent from the Apache Spark User List mailing list archive at Nabble.com.

 -
 To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
 For additional commands, e-mail: user-h...@spark.apache.org


-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org



MLLib Decision Tress algorithm hangs, others fine

2014-11-10 Thread tsj
Hello all,

I have some text data that I am running different algorithms on. 
I had no problems with LibSVM and Naive Bayes on the same data, 
but when I run Decision Tree, the execution hangs in the middle 
of DecisionTree.trainClassifier(). The only difference from the example 
given on the site is that I am using 6 categories instead of 2, and the 
input is text that is transformed to labeled points using TF-IDF. It
halts shortly after this log output:

spark.SparkContext: Job finished: collect at DecisionTree.scala:1347, took
1.019579676 s

Any ideas as to what could be causing this?



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/MLLib-Decision-Tress-algorithm-hangs-others-fine-tp18515.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org