Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17478 )

Change subject: IMPALA-10709: Min/max filters should be enabled for joins on 
sorted columns in Parquet tables
......................................................................


Patch Set 30:

(4 comments)

Thanks for working on this. I will review it tomorrow, I just had some early 
questions/comments.

http://gerrit.cloudera.org:8080/#/c/17478/30//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/17478/30//COMMIT_MSG@13
PS30, Line 13: only the leading column will be assigned a min/max filter
In case of ZORDER I think we could use the other columns as well.

Fast code path can still work to find the right pages if the BoundaryOrder 
happens to be ASC/DESC.


http://gerrit.cloudera.org:8080/#/c/17478/30//COMMIT_MSG@26
PS30, Line 26: When minmax_filter_sorted_columns is set to false, no min/max 
filters
             : will be specifically assigned to the leading sort by columns.
In that case the min/max filters are not used at all, right?


http://gerrit.cloudera.org:8080/#/c/17478/30//COMMIT_MSG@35
PS30, Line 35: minmax_filter_fast_code_path
If we have good confidence in the new algorithm then I think we can get rid of 
this query option. Maybe keep the VERIFICATION in debug builds.


http://gerrit.cloudera.org:8080/#/c/17478/30/be/src/exec/parquet/hdfs-parquet-scanner.cc
File be/src/exec/parquet/hdfs-parquet-scanner.cc:

http://gerrit.cloudera.org:8080/#/c/17478/30/be/src/exec/parquet/hdfs-parquet-scanner.cc@1196
PS30, Line 1196: ASCENDING
Do we plan to make it work for DESCENDING as well?
Though I'm not sure if it would be really useful, since probably an ordered 
write will always mean ASCENDING order.



--
To view, visit http://gerrit.cloudera.org:8080/17478
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I28c19c4b39b01ffa7d275fb245be85c28e9b2963
Gerrit-Change-Number: 17478
Gerrit-PatchSet: 30
Gerrit-Owner: Qifan Chen <qc...@cloudera.com>
Gerrit-Reviewer: Aman Sinha <amsi...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Qifan Chen <qc...@cloudera.com>
Gerrit-Reviewer: Zoltan Borok-Nagy <borokna...@cloudera.com>
Gerrit-Comment-Date: Thu, 03 Jun 2021 17:27:06 +0000
Gerrit-HasComments: Yes

Reply via email to