[ https://issues.apache.org/jira/browse/SPARK-41743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Apache Spark reassigned SPARK-41743: ------------------------------------ Assignee: (was: Apache Spark) > groupBy(...).agg(...).sort does not actually sort the output > ------------------------------------------------------------ > > Key: SPARK-41743 > URL: https://issues.apache.org/jira/browse/SPARK-41743 > Project: Spark > Issue Type: Sub-task > Components: Connect > Affects Versions: 3.4.0 > Reporter: Hyukjin Kwon > Priority: Major > > {code} > ********************************************************************** > File "/.../spark/python/pyspark/sql/connect/group.py", line 211, in > pyspark.sql.connect.group.GroupedData.agg > Failed example: > df.groupBy(df.name).agg(F.min(df.age)).sort("name").show() > Differences (ndiff with -expected +actual): > +-----+--------+ > | name|min(age)| > +-----+--------+ > + | Bob| 5| > |Alice| 2| > - | Bob| 5| > +-----+--------+ > + <BLANKLINE> > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org