Joseph K. Bradley created SPARK-12375:
-----------------------------------------

             Summary: VectorIndexer: allow unknown categories
                 Key: SPARK-12375
                 URL: https://issues.apache.org/jira/browse/SPARK-12375
             Project: Spark
          Issue Type: Sub-task
          Components: ML
            Reporter: Joseph K. Bradley


Add option for allowing unknown categories, probably via a parameter like 
"allowUnknownCategories."
If true, then handle unknown categories during transform by assigning them to 
an extra category index.

The API should resemble the API used for StringIndexer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to