Prasanth Jayachandran created HIVE-15065:
--------------------------------------------
Summary: SimpleFetchOptimizer should decide based on metastore
stats when available
Key: HIVE-15065
URL: https://issues.apache.org/jira/browse/HIVE-15065
Project: Hive
Issue Type: Bug
Components: Logical Optimizer
Affects Versions: 2.2.0
Reporter: Prasanth Jayachandran
Assignee: Prasanth Jayachandran
Currently the decision to use fetch optimizer or not is based on scanning the
filesystem for file lengths and see if the aggregated size is less the fetch
task threshold. This can be very expensive for cloud environment. This issue is
mitigated to some extent by HIVE-14920 but still that requires file system
scan. We can make decision based on the stats from metastore and falling back
when stats is not available. Since fast stats (numRows and fileSize) is always
available this should work most of the time.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)