[ 
https://issues.apache.org/jira/browse/SPARK-17090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15426033#comment-15426033
 ] 

Qian Huang commented on SPARK-17090:
------------------------------------

Hi, seth, are you working on it? If not, I am glad to try. I will generate 
couple of datasets with different sizes, features and use them to run 
linear/logistic regression with various value of aggregation depth to get a 
expert formula.

> Make tree aggregation level in linear/logistic regression configurable
> ----------------------------------------------------------------------
>
>                 Key: SPARK-17090
>                 URL: https://issues.apache.org/jira/browse/SPARK-17090
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML
>            Reporter: Seth Hendrickson
>            Priority: Minor
>
> Linear/logistic regression use treeAggregate with default aggregation depth 
> for collecting coefficient gradient updates to the driver. For high 
> dimensional problems, this can case OOM error on the driver. We should make 
> it configurable, perhaps via an expert param, so that users can avoid this 
> problem if their data has many features.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to