[
https://issues.apache.org/jira/browse/SPARK-37788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17466988#comment-17466988
]
Apache Spark commented on SPARK-37788:
--------------------------------------
User 'Daniel-Davies' has created a pull request for this issue:
https://github.com/apache/spark/pull/35071
> ColumnOrName vs Column in PySpark Functions module
> --------------------------------------------------
>
> Key: SPARK-37788
> URL: https://issues.apache.org/jira/browse/SPARK-37788
> Project: Spark
> Issue Type: Question
> Components: PySpark
> Affects Versions: 3.2.0
> Reporter: Daniel Davies
> Priority: Minor
>
> PySpark has mainly migrated to supporting both Column input types as well as
> string names of columns ("ColumnOrName") in it's functions module. There seem
> to be a small number of functions that need updating; either on conversions
> of input string names representing columns into the Column type, or simple
> annotation changes that indicate the function supports column string names.
> Below are the functions I've seen:
> * F.overlay: Annotation only
> * F.least: Annotation only
> * F.slice: Needs a conversion
> * F.array_repeat: Needs a conversion
> See here for additional context:
> [https://github.com/apache/spark/pull/35032#issuecomment-1003033776]
> I'm happy to make a quick PR fixing these, if there is no reason for these
> functions being handled as a special case.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]