[ https://issues.apache.org/jira/browse/SPARK-41833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sandeep Singh updated SPARK-41833: ---------------------------------- Description: {code:java} File "/Users/s.singh/personal/spark-oss/python/pyspark/sql/connect/functions.py", line 1117, in pyspark.sql.connect.functions.array Failed example: df.select(array('age', 'age').alias("arr")).collect() Expected: [Row(arr=[2, 2]), Row(arr=[5, 5])] Got: [Row(arr=array([2, 2])), Row(arr=array([5, 5]))]{code} was: {code:java} File "/Users/s.singh/personal/spark-oss/python/pyspark/sql/connect/dataframe.py", line 584, in pyspark.sql.connect.dataframe.DataFrame.unionByName Failed example: df1.unionByName(df2).show() Expected: +----+----+----+ |col0|col1|col2| +----+----+----+ | 1| 2| 3| | 6| 4| 5| +----+----+----+ Got: +----+----+----+ |col0|col1|col2| +----+----+----+ | 1| 2| 3| | 4| 5| 6| +----+----+----+ <BLANKLINE>{code} > DataFrame.collect() output parity with pyspark > ---------------------------------------------- > > Key: SPARK-41833 > URL: https://issues.apache.org/jira/browse/SPARK-41833 > Project: Spark > Issue Type: Sub-task > Components: Connect > Affects Versions: 3.4.0 > Reporter: Sandeep Singh > Priority: Major > > {code:java} > File > "/Users/s.singh/personal/spark-oss/python/pyspark/sql/connect/functions.py", > line 1117, in pyspark.sql.connect.functions.array > Failed example: > df.select(array('age', 'age').alias("arr")).collect() > Expected: > [Row(arr=[2, 2]), Row(arr=[5, 5])] > Got: > [Row(arr=array([2, 2])), Row(arr=array([5, 5]))]{code} -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org