[ https://issues.apache.org/jira/browse/SPARK-9691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Reynold Xin updated SPARK-9691: ------------------------------- Fix Version/s: 1.4.2 > PySpark SQL rand function treats seed 0 as no seed > -------------------------------------------------- > > Key: SPARK-9691 > URL: https://issues.apache.org/jira/browse/SPARK-9691 > Project: Spark > Issue Type: Bug > Components: PySpark, SQL > Affects Versions: 1.4.0, 1.4.1, 1.4.2, 1.5.0 > Reporter: Joseph K. Bradley > Assignee: Yin Huai > Fix For: 1.4.2, 1.5.0 > > > In PySpark SQL's rand() function, it tests for a seed in a way such that seed > 0 is treated as no seed, leading to non-deterministic results when a user > would expect deterministic results. > See: > [https://github.com/apache/spark/blob/98e69467d4fda2c26a951409b5b7c6f1e9345ce4/python/pyspark/sql/functions.py#L271] -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org