Daniel Davies created SPARK-37788:
-------------------------------------

             Summary: ColumnOrName vs Column in PySpark Functions module
                 Key: SPARK-37788
                 URL: https://issues.apache.org/jira/browse/SPARK-37788
             Project: Spark
          Issue Type: Question
          Components: PySpark
    Affects Versions: 3.2.0
            Reporter: Daniel Davies


PySpark has mainly migrated to supporting both Column input types as well as 
string names of columns ("ColumnOrName") in it's functions module. There seem 
to be a small number of functions that need updating; either on conversions of 
input string names representing columns into the Column type, or simple 
annotation changes that indicate the function supports column string names.

Below are the functions I've seen:
 * F.overlay: Annotation only
 * F.least: Annotation only
 * F.slice: Needs a conversion
 * F.array_repeat: Needs a conversion

See here for additional context: 
[https://github.com/apache/spark/pull/35032#issuecomment-1003033776]

 

 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to