[jira] [Commented] (SPARK-41745) SparkSession.createDataFrame does not respect the column names in the row
[ https://issues.apache.org/jira/browse/SPARK-41745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17653528#comment-17653528 ] Apache Spark commented on SPARK-41745: -- User 'HyukjinKwon' has created a pull request for this issue: https://github.com/apache/spark/pull/39342 > SparkSession.createDataFrame does not respect the column names in the row > - > > Key: SPARK-41745 > URL: https://issues.apache.org/jira/browse/SPARK-41745 > Project: Spark > Issue Type: Sub-task > Components: Connect >Affects Versions: 3.4.0 >Reporter: Hyukjin Kwon >Assignee: Ruifeng Zheng >Priority: Major > Fix For: 3.4.0 > > > {code} > File "/.../spark/python/pyspark/sql/connect/group.py", line 173, in > pyspark.sql.connect.group.GroupedData.pivot > Failed example: > df1.show() > Differences (ndiff with -expected +actual): > - +--+++ > ? --- > + +--++-+ > - |course|year|earnings| > + |_1| _2| _3| > - +--+++ > ? --- > + +--++-+ > - |dotNET|2012| 1| > ? --- > + |dotNET|2012|1| > - | Java|2012| 2| > ? --- > + | Java|2012|2| > - |dotNET|2012|5000| > ? --- > + |dotNET|2012| 5000| > - |dotNET|2013| 48000| > ? --- > + |dotNET|2013|48000| > - | Java|2013| 3| > ? --- > + | Java|2013|3| > - +--+++ > ? --- > + +--++-+ > + > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-41745) SparkSession.createDataFrame does not respect the column names in the row
[ https://issues.apache.org/jira/browse/SPARK-41745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17653527#comment-17653527 ] Apache Spark commented on SPARK-41745: -- User 'HyukjinKwon' has created a pull request for this issue: https://github.com/apache/spark/pull/39342 > SparkSession.createDataFrame does not respect the column names in the row > - > > Key: SPARK-41745 > URL: https://issues.apache.org/jira/browse/SPARK-41745 > Project: Spark > Issue Type: Sub-task > Components: Connect >Affects Versions: 3.4.0 >Reporter: Hyukjin Kwon >Assignee: Ruifeng Zheng >Priority: Major > Fix For: 3.4.0 > > > {code} > File "/.../spark/python/pyspark/sql/connect/group.py", line 173, in > pyspark.sql.connect.group.GroupedData.pivot > Failed example: > df1.show() > Differences (ndiff with -expected +actual): > - +--+++ > ? --- > + +--++-+ > - |course|year|earnings| > + |_1| _2| _3| > - +--+++ > ? --- > + +--++-+ > - |dotNET|2012| 1| > ? --- > + |dotNET|2012|1| > - | Java|2012| 2| > ? --- > + | Java|2012|2| > - |dotNET|2012|5000| > ? --- > + |dotNET|2012| 5000| > - |dotNET|2013| 48000| > ? --- > + |dotNET|2013|48000| > - | Java|2013| 3| > ? --- > + | Java|2013|3| > - +--+++ > ? --- > + +--++-+ > + > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-41745) SparkSession.createDataFrame does not respect the column names in the row
[ https://issues.apache.org/jira/browse/SPARK-41745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17653334#comment-17653334 ] Apache Spark commented on SPARK-41745: -- User 'grundprinzip' has created a pull request for this issue: https://github.com/apache/spark/pull/39330 > SparkSession.createDataFrame does not respect the column names in the row > - > > Key: SPARK-41745 > URL: https://issues.apache.org/jira/browse/SPARK-41745 > Project: Spark > Issue Type: Sub-task > Components: Connect >Affects Versions: 3.4.0 >Reporter: Hyukjin Kwon >Assignee: Ruifeng Zheng >Priority: Major > Fix For: 3.4.0 > > > {code} > File "/.../spark/python/pyspark/sql/connect/group.py", line 173, in > pyspark.sql.connect.group.GroupedData.pivot > Failed example: > df1.show() > Differences (ndiff with -expected +actual): > - +--+++ > ? --- > + +--++-+ > - |course|year|earnings| > + |_1| _2| _3| > - +--+++ > ? --- > + +--++-+ > - |dotNET|2012| 1| > ? --- > + |dotNET|2012|1| > - | Java|2012| 2| > ? --- > + | Java|2012|2| > - |dotNET|2012|5000| > ? --- > + |dotNET|2012| 5000| > - |dotNET|2013| 48000| > ? --- > + |dotNET|2013|48000| > - | Java|2013| 3| > ? --- > + | Java|2013|3| > - +--+++ > ? --- > + +--++-+ > + > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-41745) SparkSession.createDataFrame does not respect the column names in the row
[ https://issues.apache.org/jira/browse/SPARK-41745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17653335#comment-17653335 ] Apache Spark commented on SPARK-41745: -- User 'grundprinzip' has created a pull request for this issue: https://github.com/apache/spark/pull/39330 > SparkSession.createDataFrame does not respect the column names in the row > - > > Key: SPARK-41745 > URL: https://issues.apache.org/jira/browse/SPARK-41745 > Project: Spark > Issue Type: Sub-task > Components: Connect >Affects Versions: 3.4.0 >Reporter: Hyukjin Kwon >Assignee: Ruifeng Zheng >Priority: Major > Fix For: 3.4.0 > > > {code} > File "/.../spark/python/pyspark/sql/connect/group.py", line 173, in > pyspark.sql.connect.group.GroupedData.pivot > Failed example: > df1.show() > Differences (ndiff with -expected +actual): > - +--+++ > ? --- > + +--++-+ > - |course|year|earnings| > + |_1| _2| _3| > - +--+++ > ? --- > + +--++-+ > - |dotNET|2012| 1| > ? --- > + |dotNET|2012|1| > - | Java|2012| 2| > ? --- > + | Java|2012|2| > - |dotNET|2012|5000| > ? --- > + |dotNET|2012| 5000| > - |dotNET|2013| 48000| > ? --- > + |dotNET|2013|48000| > - | Java|2013| 3| > ? --- > + | Java|2013|3| > - +--+++ > ? --- > + +--++-+ > + > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-41745) SparkSession.createDataFrame does not respect the column names in the row
[ https://issues.apache.org/jira/browse/SPARK-41745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17653103#comment-17653103 ] Apache Spark commented on SPARK-41745: -- User 'zhengruifeng' has created a pull request for this issue: https://github.com/apache/spark/pull/39313 > SparkSession.createDataFrame does not respect the column names in the row > - > > Key: SPARK-41745 > URL: https://issues.apache.org/jira/browse/SPARK-41745 > Project: Spark > Issue Type: Sub-task > Components: Connect >Affects Versions: 3.4.0 >Reporter: Hyukjin Kwon >Priority: Major > > {code} > File "/.../spark/python/pyspark/sql/connect/group.py", line 173, in > pyspark.sql.connect.group.GroupedData.pivot > Failed example: > df1.show() > Differences (ndiff with -expected +actual): > - +--+++ > ? --- > + +--++-+ > - |course|year|earnings| > + |_1| _2| _3| > - +--+++ > ? --- > + +--++-+ > - |dotNET|2012| 1| > ? --- > + |dotNET|2012|1| > - | Java|2012| 2| > ? --- > + | Java|2012|2| > - |dotNET|2012|5000| > ? --- > + |dotNET|2012| 5000| > - |dotNET|2013| 48000| > ? --- > + |dotNET|2013|48000| > - | Java|2013| 3| > ? --- > + | Java|2013|3| > - +--+++ > ? --- > + +--++-+ > + > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-41745) SparkSession.createDataFrame does not respect the column names in the row
[ https://issues.apache.org/jira/browse/SPARK-41745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17653043#comment-17653043 ] Ruifeng Zheng commented on SPARK-41745: --- I will take a look > SparkSession.createDataFrame does not respect the column names in the row > - > > Key: SPARK-41745 > URL: https://issues.apache.org/jira/browse/SPARK-41745 > Project: Spark > Issue Type: Sub-task > Components: Connect >Affects Versions: 3.4.0 >Reporter: Hyukjin Kwon >Priority: Major > > {code} > File "/.../spark/python/pyspark/sql/connect/group.py", line 173, in > pyspark.sql.connect.group.GroupedData.pivot > Failed example: > df1.show() > Differences (ndiff with -expected +actual): > - +--+++ > ? --- > + +--++-+ > - |course|year|earnings| > + |_1| _2| _3| > - +--+++ > ? --- > + +--++-+ > - |dotNET|2012| 1| > ? --- > + |dotNET|2012|1| > - | Java|2012| 2| > ? --- > + | Java|2012|2| > - |dotNET|2012|5000| > ? --- > + |dotNET|2012| 5000| > - |dotNET|2013| 48000| > ? --- > + |dotNET|2013|48000| > - | Java|2013| 3| > ? --- > + | Java|2013|3| > - +--+++ > ? --- > + +--++-+ > + > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org