Colin Ma created HIVE-16969:
-------------------------------

             Summary: Improvement performance of MapOperator for Parquet
                 Key: HIVE-16969
                 URL: https://issues.apache.org/jira/browse/HIVE-16969
             Project: Hive
          Issue Type: Improvement
    Affects Versions: 3.0.0
            Reporter: Colin Ma
            Assignee: Colin Ma
             Fix For: 3.0.0


For a table with many partition files, 
MapOperator.cloneConfsForNestedColPruning() will update the 
hive.io.file.readNestedColumn.paths many times. The larger value of 
hive.io.file.readNestedColumn.paths will cause the poor performance for 
ParquetHiveSerDe.processRawPrunedPaths(). 
So, the unnecessary paths should be appended to 
hive.io.file.readNestedColumn.paths.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to