[
https://issues.apache.org/jira/browse/SPARK-54380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18040320#comment-18040320
]
Ariane Horbach commented on SPARK-54380:
----------------------------------------
Hey [~raisa], I'm interested in working on this issue. Before I start, I have a
few questions to ensure I approach this correctly:
* {*}Use case justification{*}: Could you describe a specific scenario where
placing nulls first would be beneficial? Alternatively, is there another
justification to consider, such as alignment with SQL standard behavior (e.g.,
{{{}ORDER BY ... NULLS FIRST/LAST{}}})?
* {*}Backward compatibility{*}: Should we maintain the current default
behavior (nulls last) to avoid breaking existing code? If so, would you support
adding a {{nulls_first}} parameter with a default value of {{{}False{}}}?
I am new contributor, so I want to make sure I understand the requirements and
align with the project's design principles before implementing this feature. I
appreciate any guidance you can provide!
> Adding NULLS_FIRST flag to `array_sort`
> ---------------------------------------
>
> Key: SPARK-54380
> URL: https://issues.apache.org/jira/browse/SPARK-54380
> Project: Spark
> Issue Type: Request
> Components: PySpark
> Affects Versions: 4.0.1
> Reporter: Raisa
> Priority: Minor
>
> Could you please consider adding an option to place NULLS first to
> `pyspark.sql.functions.array_sort` as currently NULLS are placed last?
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]