Re: Performance evaluation of Trino 468, Spark 4.0.0-RC2, and Hive 4 on Tez/MR3

2025-07-02 Thread Sungwoo Park
Hello, We have published a follow-up blog that compares the latest versions: 1) Trino 476, 2) Spark 4.0.0, 3) Hive 4 on MR3 2.1. At the end, we discuss MPP and MapReduce. https://mr3docs.datamonad.com/blog/2025-07-02-performance-evaluation-2.1 --- Sungwoo On Tue, Apr 22, 2025 at 7:08 PM Sungwoo

[PYSPARK] createDataFrame throws exception with Python 3.12.3

2025-07-02 Thread Eyck Troschke
Dear Spark Development Community, are their any known issues with SparkSession.createDataFrame in PySpark 4.0 with Python 3.12.3? In my environment spark.range() and spark.read.json() work as expected, but spark.createDataFrame() throws an exception: Example: from pyspark.sql import Row, Spar