[
https://issues.apache.org/jira/browse/HIVE-9372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rui Li updated HIVE-9372:
-------------------------
Attachment: HIVE-9372.2.patch
Address Xuefu's comment. I also make the perf log to cover the whole
{{CombineHiveInputFormat.getSplits}} method.
The patch can reduce the getSplits time from 1.5s to 1s for an Orc table with
over 1800 input paths.
> Parallel checking non-combinable paths in CombineHiveInputFormat
> ----------------------------------------------------------------
>
> Key: HIVE-9372
> URL: https://issues.apache.org/jira/browse/HIVE-9372
> Project: Hive
> Issue Type: Improvement
> Reporter: Rui Li
> Assignee: Rui Li
> Attachments: HIVE-9372.1.patch, HIVE-9372.2.patch
>
>
> Checking if an input path is combinable is expensive. So we should make it
> parallel.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)