[ https://issues.apache.org/jira/browse/SPARK-41773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Apache Spark reassigned SPARK-41773: ------------------------------------ Assignee: Apache Spark > Window.partitionBy is not respected with row_number > ---------------------------------------------------- > > Key: SPARK-41773 > URL: https://issues.apache.org/jira/browse/SPARK-41773 > Project: Spark > Issue Type: Sub-task > Components: Connect > Affects Versions: 3.4.0 > Reporter: Hyukjin Kwon > Assignee: Apache Spark > Priority: Major > > {code} > File "/.../spark/python/pyspark/sql/connect/window.py", line 292, in > pyspark.sql.connect.window.Window.orderBy > Failed example: > df.withColumn("row_number", row_number().over(window)).show() > Expected: > +---+--------+----------+ > | id|category|row_number| > +---+--------+----------+ > | 1| a| 1| > | 1| a| 2| > | 1| b| 3| > | 2| a| 1| > | 2| b| 2| > | 3| b| 1| > +---+--------+----------+ > Got: > +---+--------+----------+ > | id|category|row_number| > +---+--------+----------+ > | 1| b| 1| > | 1| a| 2| > | 1| a| 3| > | 2| b| 1| > | 2| a| 2| > | 3| b| 1| > +---+--------+----------+ > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org