Github user facaiy commented on the issue:
https://github.com/apache/spark/pull/17503
Colsed since its duplicate PR #20632 has been merged.
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/17503
Can one of the admins verify this patch?
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/17503
Can one of the admins verify this patch?
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/17503
Can one of the admins verify this patch?
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional
Github user facaiy commented on the issue:
https://github.com/apache/spark/pull/17503
HI, @WeichenXu123.
As said by @srowen , the benefit of this would be for speed at predict time
or for model storage. Hence I'm not sure whether benchmark is really need for
the PR.
---
Github user WeichenXu123 commented on the issue:
https://github.com/apache/spark/pull/17503
Can you do some benchmark to show how much improvement this change will
bring ?
---
-
To unsubscribe, e-mail:
Github user facaiy commented on the issue:
https://github.com/apache/spark/pull/17503
Hi, @yanboliang . Do you have time to take a look at first? Thanks very
much.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your
Github user facaiy commented on the issue:
https://github.com/apache/spark/pull/17503
@jkbradley May you have time reviewing the pr? I believe that it will be a
little improvement for predict. Thanks.
---
If your project is set up for it, you can reply to this email and have your
Github user HyukjinKwon commented on the issue:
https://github.com/apache/spark/pull/17503
(gentle ping @jkbradley)
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and
Github user sethah commented on the issue:
https://github.com/apache/spark/pull/17503
I think the benefit of this would be for speed at predict time or for model
storage. @srowen the nodes don't have to be equal to be merged, they just have
to output the same prediction. Since this a
Github user facaiy commented on the issue:
https://github.com/apache/spark/pull/17503
I have the same question with you. I guess that Impurity info is useful to
debug and analysis tree model. However, as tree is grown from root to leaf when
training, hence it seems needless to merge
Github user srowen commented on the issue:
https://github.com/apache/spark/pull/17503
I was saying that I thought the nodes had more info than just the
majority-class prediction. If they did, then they're much more rarely
combinable, because they vary in more than just their
Github user facaiy commented on the issue:
https://github.com/apache/spark/pull/17503
@srowen I am not sure whether I understand your question clearly.
RandomForest uses LearningNode to construct tree model when training, and
convert them to Leaf or InternalNode at last. Hence, all
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/17503
**[Test build #3675 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3675/testReport)**
for PR 17503 at commit
Github user SparkQA commented on the issue:
https://github.com/apache/spark/pull/17503
**[Test build #3675 has
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3675/testReport)**
for PR 17503 at commit
Github user srowen commented on the issue:
https://github.com/apache/spark/pull/17503
It looks reasonable though I don't feel qualified to review it. I thought
the nodes had more than just the majority class - like the empirical
distribution at the node? That would make them not
Github user facaiy commented on the issue:
https://github.com/apache/spark/pull/17503
@srowen Hi, could you review the PR? The PR is simple, though many code for
unit test are added. Thanks.
---
If your project is set up for it, you can reply to this email and have your
reply appear
Github user facaiy commented on the issue:
https://github.com/apache/spark/pull/17503
@jkbradley @hhbyyh Could you review the PR? thanks.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have
Github user AmplabJenkins commented on the issue:
https://github.com/apache/spark/pull/17503
Can one of the admins verify this patch?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this
19 matches
Mail list logo