[ 
https://issues.apache.org/jira/browse/SPARK-14033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15207118#comment-15207118
 ] 

Joseph K. Bradley commented on SPARK-14033:
-------------------------------------------

Academically speaking, I agree with you that there is a distinction between an 
Estimator and a Transformer.

Practically speaking, though, in my experience that distinction is not 
significant for most users.  If a new user wants to use Logistic Regression, 
they will look for LogisticRegression (and have reported being confused by 
finding the separate Estimator and Model classes).  If an expert wants to use 
it, then they will presumably have enough background knowledge to understand 
the semantics of the merged concepts.

This should also help users coming from other popular ML libraries like 
scikit-learn, which uses these merged semantics.

As a Scala user, I like the idea of complete immutability, but that will likely 
require much more code refactoring for users who have become used to Param 
setter methods modifying instances.

It will be good to know if the proposal will disrupt users' workflows.  I 
believe it should still work for existing workflows, with some minor code 
modifications.

> Merging Estimator, Model, & Transformer
> ---------------------------------------
>
>                 Key: SPARK-14033
>                 URL: https://issues.apache.org/jira/browse/SPARK-14033
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML
>            Reporter: Joseph K. Bradley
>            Assignee: Timothy Hunter
>         Attachments: StyleMutabilityMergingEstimatorandModel.pdf
>
>
> This JIRA is for merging the spark.ml concepts of Estimator and Model.
> Goal: Have clearer semantics which match existing libraries (such as 
> scikit-learn).
> For details, please see the linked design doc.  Comment on this JIRA to give 
> feedback on the proposed design.  Once the proposal is discussed and this 
> work is confirmed as ready to proceed, this JIRA will serve as an umbrella 
> for the merge tasks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to