[ https://issues.apache.org/jira/browse/SPARK-40158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
BingKun Pan updated SPARK-40158: -------------------------------- Description: # Remove useless configuration: hadoopConf.set(ParquetWriteSupport.SPARK_ROW_SCHEMA, readDataSchemaAsJson) # extract common code: {quote}ParquetFileFormat.buildReaderWithPartitionValues ([https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala#L202-L228]) parquet/ParquetScan.createReaderFactory ([https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/parquet/ParquetScan.scala#L66-L93]) {quote} > Remove useless configuration & extract common code for parquet read > ------------------------------------------------------------------- > > Key: SPARK-40158 > URL: https://issues.apache.org/jira/browse/SPARK-40158 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 3.4.0 > Reporter: BingKun Pan > Priority: Minor > Fix For: 3.4.0 > > > # Remove useless configuration: > hadoopConf.set(ParquetWriteSupport.SPARK_ROW_SCHEMA, readDataSchemaAsJson) > # extract common code: > {quote}ParquetFileFormat.buildReaderWithPartitionValues > ([https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFileFormat.scala#L202-L228]) > parquet/ParquetScan.createReaderFactory > ([https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/v2/parquet/ParquetScan.scala#L66-L93]) > {quote} -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org