[
https://issues.apache.org/jira/browse/SPARK-49882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17892209#comment-17892209
]
Dongjoon Hyun commented on SPARK-49882:
---------------------------------------
Nice! Thank you, Hyukjin.
> Handle or document `NumPy 2.1` difference in Python 3.13
> --------------------------------------------------------
>
> Key: SPARK-49882
> URL: https://issues.apache.org/jira/browse/SPARK-49882
> Project: Spark
> Issue Type: Sub-task
> Components: PySpark
> Affects Versions: 4.0.0
> Reporter: Dongjoon Hyun
> Priority: Major
>
> Although SPARK-48710 fixed to use NumPy 2.0 compatible types. Python 3.13
> requires NumPy 2.1 (SPARK-49869) and seems to reveal another instances of
> differences.
> New `NumPy 2.1` seems to have a different output style in Python 3.13. Of
> course, the values are correct.
> - https://github.com/apache/spark/actions/runs/11186188886/job/31100777649
> {code}
> **********************************************************************
> File "/__w/spark/spark/python/pyspark/core/rdd.py", line 2463, in
> __main__.RDD.sampleStdev
> Failed example:
> sc.parallelize([1, 2, 3]).sampleStdev()
> Expected:
> 1.0
> Got:
> np.float64(1.0)
> **********************************************************************
> File "/__w/spark/spark/python/pyspark/core/rdd.py", line 2436, in
> __main__.RDD.stdev
> Failed example:
> sc.parallelize([1, 2, 3]).stdev()
> Expected:
> 0.816...
> Got:
> np.float64(0.816496580927726)
> **********************************************************************
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]