[ https://issues.apache.org/jira/browse/SPARK-10513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14959949#comment-14959949 ]
Yanbo Liang commented on SPARK-10513: ------------------------------------- [~josephkb] For 4: If a column of StringType has "" value (not null), StringIndexer will transform it right, but OneHotEncoder will throw exception caused of "" can not as a feature name. I think we should discuss that whether it is legal that one category feature contains "" value, otherwise we should filter these kinds of values or replaced "" with other user specified values? > Springleaf Marketing Response > ----------------------------- > > Key: SPARK-10513 > URL: https://issues.apache.org/jira/browse/SPARK-10513 > Project: Spark > Issue Type: Sub-task > Components: ML > Reporter: Yanbo Liang > Assignee: Yanbo Liang > > Apply ML pipeline API to Springleaf Marketing Response > (https://www.kaggle.com/c/springleaf-marketing-response) -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org