Xiangrui Meng created SPARK-5956: ------------------------------------ Summary: Transformer/Estimator should be copyable. Key: SPARK-5956 URL: https://issues.apache.org/jira/browse/SPARK-5956 Project: Spark Issue Type: Sub-task Components: ML Affects Versions: 1.3.0 Reporter: Xiangrui Meng
In a pipeline, we don't save additional params specified in `fit()` to transformers, because we should not modify them. The current solution is to store training parameters in the pipeline model and apply those parameters at `transform()`. A better solution would be making transformers copyable. Calling `.copy` on a transformer produces a new transformer with a different UID but same parameters. Then we can use the copied transformers in the pipeline model, with additional params stored. `copy` may not be a good name because it is not an exact copy. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org