[ https://issues.apache.org/jira/browse/SPARK-12375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16223120#comment-16223120 ]
Weichen Xu commented on SPARK-12375: ------------------------------------ Coordinated with [~yuhaoyan] I take this over. Thanks! > VectorIndexer: allow unknown categories > --------------------------------------- > > Key: SPARK-12375 > URL: https://issues.apache.org/jira/browse/SPARK-12375 > Project: Spark > Issue Type: Sub-task > Components: ML > Reporter: Joseph K. Bradley > Assignee: yuhao yang > > Add option for allowing unknown categories, probably via a parameter like > "allowUnknownCategories." > If true, then handle unknown categories during transform by assigning them to > an extra category index. > The API should resemble the API used for StringIndexer. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org