[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2018-03-03 Thread facaiy
Github user facaiy commented on the issue: https://github.com/apache/spark/pull/17503 Colsed since its duplicate PR #20632 has been merged. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2018-01-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17503 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2018-01-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17503 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-12-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17503 Can one of the admins verify this patch? --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-09-26 Thread facaiy
Github user facaiy commented on the issue: https://github.com/apache/spark/pull/17503 HI, @WeichenXu123. As said by @srowen , the benefit of this would be for speed at predict time or for model storage. Hence I'm not sure whether benchmark is really need for the PR. ---

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-09-26 Thread WeichenXu123
Github user WeichenXu123 commented on the issue: https://github.com/apache/spark/pull/17503 Can you do some benchmark to show how much improvement this change will bring ? --- - To unsubscribe, e-mail:

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-08-26 Thread facaiy
Github user facaiy commented on the issue: https://github.com/apache/spark/pull/17503 Hi, @yanboliang . Do you have time to take a look at first? Thanks very much. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-07-04 Thread facaiy
Github user facaiy commented on the issue: https://github.com/apache/spark/pull/17503 @jkbradley May you have time reviewing the pr? I believe that it will be a little improvement for predict. Thanks. --- If your project is set up for it, you can reply to this email and have your

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-06-02 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/17503 (gentle ping @jkbradley) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-04-27 Thread sethah
Github user sethah commented on the issue: https://github.com/apache/spark/pull/17503 I think the benefit of this would be for speed at predict time or for model storage. @srowen the nodes don't have to be equal to be merged, they just have to output the same prediction. Since this a

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-04-27 Thread facaiy
Github user facaiy commented on the issue: https://github.com/apache/spark/pull/17503 I have the same question with you. I guess that Impurity info is useful to debug and analysis tree model. However, as tree is grown from root to leaf when training, hence it seems needless to merge

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-04-26 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/17503 I was saying that I thought the nodes had more info than just the majority-class prediction. If they did, then they're much more rarely combinable, because they vary in more than just their

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-04-25 Thread facaiy
Github user facaiy commented on the issue: https://github.com/apache/spark/pull/17503 @srowen I am not sure whether I understand your question clearly. RandomForest uses LearningNode to construct tree model when training, and convert them to Leaf or InternalNode at last. Hence, all

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-04-24 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17503 **[Test build #3675 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3675/testReport)** for PR 17503 at commit

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-04-24 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17503 **[Test build #3675 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3675/testReport)** for PR 17503 at commit

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-04-24 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/17503 It looks reasonable though I don't feel qualified to review it. I thought the nodes had more than just the majority class - like the empirical distribution at the node? That would make them not

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-04-24 Thread facaiy
Github user facaiy commented on the issue: https://github.com/apache/spark/pull/17503 @srowen Hi, could you review the PR? The PR is simple, though many code for unit test are added. Thanks. --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-04-05 Thread facaiy
Github user facaiy commented on the issue: https://github.com/apache/spark/pull/17503 @jkbradley @hhbyyh Could you review the PR? thanks. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #17503: [SPARK-3159][MLlib] Check for reducible DecisionTree

2017-04-01 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17503 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this