Re: Spark 3.3.0/3.2.2: java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 15

2022-09-01 Thread FengYu Cao
tach the file to it? I can take a look. > > Chao > > On Thu, Sep 1, 2022 at 4:03 AM FengYu Cao wrote: > > > > I'm trying to upgrade our spark (3.2.1 now) > > > > but with spark 3.3.0 and spark 3.2.2, we had error with specific parquet > file > > &

Spark 3.3.0/3.2.2: java.io.IOException: can not read class org.apache.parquet.format.PageHeader: don't know what type: 15

2022-09-01 Thread FengYu Cao
I'm trying to upgrade our spark (3.2.1 now) but with spark 3.3.0 and spark 3.2.2, we had error with specific parquet file Is anyone else having the same problem as me? Or do I need to provide any information to the devs ? ``` org.apache.spark.SparkException: Job aborted due to stage failure: Ta

Re: [ANNOUNCE] Apache Spark 3.2.1 released

2022-01-28 Thread FengYu Cao
https://spark.apache.org/downloads.html *2. Choose a package type:* menu shows that Pre-built for Hadoop 3.3 but download link is *spark-3.2.1-bin-hadoop3.2.tgz* need an update? L. C. Hsieh 于2022年1月29日周六 14:26写道: > Thanks Huaxin for the 3.2.1 release! > > On Fri, Jan 28, 2022 at 10:14 PM Dong