[ https://issues.apache.org/jira/browse/SPARK-30449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17009924#comment-17009924 ]
Krishna Kumar Tiwari commented on SPARK-30449: ---------------------------------------------- I am already working on this, will share the PR soon. > Introducing get_dummies method in pyspark > ----------------------------------------- > > Key: SPARK-30449 > URL: https://issues.apache.org/jira/browse/SPARK-30449 > Project: Spark > Issue Type: New Feature > Components: PySpark > Affects Versions: 2.4.4 > Reporter: Krishna Kumar Tiwari > Priority: Major > > Introducing get_dummies method in pyspark same as pandas. > Many times when using categorical variable and we want to flatten the data to > do one-hot encoding to generate columns and fill the matrix, get_dummies is > very useful in that scenario. > > The objective here is to introduce get_dummies to pyspark. > -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org