[ https://issues.apache.org/jira/browse/SPARK-10785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14990930#comment-14990930 ]
Joseph K. Bradley commented on SPARK-10785: ------------------------------------------- Yes, we should sample still. Extensions to multiple input columns is a different issue, and we can do that later if needed. Other than that, this should be analogous to the tree work. > Scale QuantileDiscretizer using distributed binning > --------------------------------------------------- > > Key: SPARK-10785 > URL: https://issues.apache.org/jira/browse/SPARK-10785 > Project: Spark > Issue Type: Improvement > Components: ML > Reporter: Joseph K. Bradley > > [SPARK-10064] improves binning in decision trees by distributing the > computation. QuantileDiscretizer should do the same. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org