[ https://issues.apache.org/jira/browse/SPARK-6037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Cheng Lian updated SPARK-6037: ------------------------------ Assignee: Liang-Chi Hsieh > Avoiding duplicate Parquet schema merging > ----------------------------------------- > > Key: SPARK-6037 > URL: https://issues.apache.org/jira/browse/SPARK-6037 > Project: Spark > Issue Type: Improvement > Components: SQL > Reporter: Liang-Chi Hsieh > Assignee: Liang-Chi Hsieh > Priority: Minor > > FilteringParquetRowInputFormat manually merges Parquet schemas before > computing splits. However, it is duplicate because the schemas are already > merged in ParquetRelation2. We don't need to re-merge them at InputFormat. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org