Yanbo Liang created SPARK-11349:
-----------------------------------
Summary: Support transform string label for RFormula
Key: SPARK-11349
URL: https://issues.apache.org/jira/browse/SPARK-11349
Project: Spark
Issue Type: Sub-task
Components: ML
Reporter: Yanbo Liang
Currently RFormula can only handle label with NumericType or BinaryType (cast
it to DoubleType as the label of Linear Regression training), we should also
support label of StringType which is needed for Logistic Regression (glm with
family = "binomial").
For label of StringType, we should use StringIndexer to transform it to 0-based
index.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]