Simon Tao created SPARK-37178: --------------------------------- Summary: Add Target Encoding to ml.feature Key: SPARK-37178 URL: https://issues.apache.org/jira/browse/SPARK-37178 Project: Spark Issue Type: New Feature Components: ML Affects Versions: 3.2.0 Reporter: Simon Tao
Target Encoding is a mechanism of converting categorical features to continues features based on the posterior probability __ calculated from values of the label (target) column. Target Encoding can help to improve accuracy of machine learning algorithms when columns with high cardinality are used as features during training phase. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org