Alex Behm has posted comments on this change.

Change subject: IMPALA-2373: Extrapolate row counts for HDFS tables.
......................................................................


Patch Set 4:

(2 comments)

I'm going to rebase and fix the remaining tests with SHOW PARTITIONS / SHOW 
TABLE STATS.

http://gerrit.cloudera.org:8080/#/c/6840/4/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
File fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java:

PS4, Line 624: Otherwise, the input cardinality is based on the per-partition 
row count stats
             :    * and/or the table-level row count stats, depending on which 
of those are available.
             :    * Partitions without stats are ignored.
> That part describes the logic in computeCardinalities. Sure you want to lea
Everything under "Row count extrapolation" describes the intended behavior. The 
logic really is spread between the two functions, so I chose to leave this 
comment at the top level.


http://gerrit.cloudera.org:8080/#/c/6840/3/tests/metadata/test_explain.py
File tests/metadata/test_explain.py:

PS3, Line 127: 50
> There are definitely ways to improve this. If you haven't done so, can you 
Filed IMPALA-5360 and IMPALA-5361.


-- 
To view, visit http://gerrit.cloudera.org:8080/6840
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-MessageType: comment
Gerrit-Change-Id: I972c8a03ed70211734631a7dc9085cb33622ebc4
Gerrit-PatchSet: 4
Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-Owner: Alex Behm <alex.b...@cloudera.com>
Gerrit-Reviewer: Alex Behm <alex.b...@cloudera.com>
Gerrit-Reviewer: Dimitris Tsirogiannis <dtsirogian...@cloudera.com>
Gerrit-Reviewer: Marcel Kornacker <mar...@cloudera.com>
Gerrit-Reviewer: Mostafa Mokhtar <mmokh...@cloudera.com>
Gerrit-HasComments: Yes

Reply via email to