[jira] [Commented] (SPARK-3530) Pipeline and Parameters

Sandy Ryza (JIRA) Wed, 17 Sep 2014 18:04:51 -0700

    [ 
https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14138319#comment-14138319
 ]


Sandy Ryza commented on SPARK-3530:
-----------------------------------

bq. Isn't the "fit multiple models at once" part a bit of an early optimization 
?

I personally think this is a useful feature in nearly all situations and that 
parameter search is one of the most important problems for a machine learning 
framework to address.  Avoiding premature optimization usually refers to 
getting bang for buck in terms of time spent.  However, if this something we 
think might even be eventually useful, it's worth making API decisions that 
will accommodate it.

> Pipeline and Parameters
> -----------------------
>
>                 Key: SPARK-3530
>                 URL: https://issues.apache.org/jira/browse/SPARK-3530
>             Project: Spark
>          Issue Type: Sub-task
>          Components: ML, MLlib
>            Reporter: Xiangrui Meng
>            Assignee: Xiangrui Meng
>            Priority: Critical
>
> This part of the design doc is for pipelines and parameters. I put the design 
> doc at
> https://docs.google.com/document/d/1rVwXRjWKfIb-7PI6b86ipytwbUH7irSNLF1_6dLmh8o/edit?usp=sharing
> I will copy the proposed interfaces to this JIRA later. Some sample code can 
> be viewed at: https://github.com/mengxr/spark-ml/
> Please help review the design and post your comments here. Thanks!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-3530) Pipeline and Parameters

Reply via email to