Yifan Zhang has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/19471


Change subject: IMPALA-11081: Fix incorrect results in partition key scan
......................................................................

IMPALA-11081: Fix incorrect results in partition key scan

This patch fixes incorrect results caused by short-circuit partition
key scan in the case where a file contains multiple blocks. IMPALA-8834
introduced the optimization that generating only one scan range that
corresponding to the first block per file, and backends only read footers
for parquet/ORC files, which leads to incorrect results if the first
block doesn't include a file footer. This bug is fixed by returning
a scan range corresponding to the last block to make sure it contains a
file footer.

Testing:
- Added an e2e test to verify the fix.
- Ran TestPartitionKeyScans to make sure no regressions.

Change-Id: I17331ed6c26a747e0509dcbaf427cd52808943b1
---
M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
M tests/query_test/test_queries.py
2 files changed, 42 insertions(+), 1 deletion(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/71/19471/1
--
To view, visit http://gerrit.cloudera.org:8080/19471
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I17331ed6c26a747e0509dcbaf427cd52808943b1
Gerrit-Change-Number: 19471
Gerrit-PatchSet: 1
Gerrit-Owner: Yifan Zhang <chinazhangyi...@163.com>

Reply via email to