[ https://issues.apache.org/jira/browse/SPARK-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14138319#comment-14138319 ]
Sandy Ryza commented on SPARK-3530: ----------------------------------- bq. Isn't the "fit multiple models at once" part a bit of an early optimization ? I personally think this is a useful feature in nearly all situations and that parameter search is one of the most important problems for a machine learning framework to address. Avoiding premature optimization usually refers to getting bang for buck in terms of time spent. However, if this something we think might even be eventually useful, it's worth making API decisions that will accommodate it. > Pipeline and Parameters > ----------------------- > > Key: SPARK-3530 > URL: https://issues.apache.org/jira/browse/SPARK-3530 > Project: Spark > Issue Type: Sub-task > Components: ML, MLlib > Reporter: Xiangrui Meng > Assignee: Xiangrui Meng > Priority: Critical > > This part of the design doc is for pipelines and parameters. I put the design > doc at > https://docs.google.com/document/d/1rVwXRjWKfIb-7PI6b86ipytwbUH7irSNLF1_6dLmh8o/edit?usp=sharing > I will copy the proposed interfaces to this JIRA later. Some sample code can > be viewed at: https://github.com/mengxr/spark-ml/ > Please help review the design and post your comments here. Thanks! -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org