Simon Tao created SPARK-37178:
---------------------------------

             Summary: Add Target Encoding to ml.feature
                 Key: SPARK-37178
                 URL: https://issues.apache.org/jira/browse/SPARK-37178
             Project: Spark
          Issue Type: New Feature
          Components: ML
    Affects Versions: 3.2.0
            Reporter: Simon Tao


Target Encoding is a mechanism of converting categorical features to continues 
features based on the posterior probability __ calculated from values of the 
label (target) column.

 

Target Encoding can help to improve accuracy of machine learning algorithms 
when columns with high cardinality are used as features during training phase.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to