Liang-Chi Hsieh created SPARK-6037:
--------------------------------------
Summary: Avoiding duplicate Parquet schema merging
Key: SPARK-6037
URL: https://issues.apache.org/jira/browse/SPARK-6037
Project: Spark
Issue Type: Improvement
Components: SQL
Reporter: Liang-Chi Hsieh
Priority: Minor
FilteringParquetRowInputFormat manually merges Parquet schemas before computing
splits. However, it is duplicate because the schemas are already merged in
ParquetRelation2. We don't need to re-merge them at InputFormat.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]