[ https://issues.apache.org/jira/browse/SPARK-6916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14496456#comment-14496456 ]
Joseph K. Bradley commented on SPARK-6916: ------------------------------------------ Yeah, I guess I don't know what users will use the other metadata for. We may want to add an option later. I'll change VectorIndexer to match StringIndexer so it does not maintain non-ML metadata. But unlike StringIndexer, it will maintain some ML metadata such as feature names and statistics for numerical columns. > StringIndexer should preserve non-ML metadata > --------------------------------------------- > > Key: SPARK-6916 > URL: https://issues.apache.org/jira/browse/SPARK-6916 > Project: Spark > Issue Type: Improvement > Components: ML > Affects Versions: 1.4.0 > Reporter: Joseph K. Bradley > Priority: Minor > > StringIndexer sets the output column metadata to hold the ML-specific > metadata created after indexing the column. It does not carry over non-ML > metadata from the input column. I figure it should. (That's the behavior I > used for VectorIndexer.) > Ping [~mengxr] -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org