lipeng...@apache.org has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19419 )

Change subject: IMPALA-11826: Avoid calling planFiles() on Iceberg V2 tables 
when there are no predicates
......................................................................


Patch Set 2:

(9 comments)

Left some minor comments.

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/FeIcebergTable.java
File fe/src/main/java/org/apache/impala/catalog/FeIcebergTable.java:

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/FeIcebergTable.java@754
PS2, Line 754: Iterables.concat(
             :           icebergFiles.dataFilesWithoutDeletes,
             :           icebergFiles.dataFilesWithDeletes,
             :           icebergFiles.deleteFiles)
I prefer to put this logic in GroupedContentFiles, which could be nameed like 
'getAllContentFiles()'?


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergContentFileStore.java
File fe/src/main/java/org/apache/impala/catalog/IcebergContentFileStore.java:

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergContentFileStore.java@49
PS2, Line 49: files
nit: file?


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergContentFileStore.java@149
PS2, Line 149:   public FileDescriptor getFileDescriptor(String pathHash) {
> This method appears to be unused
+1 indeed.


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergTable.java
File fe/src/main/java/org/apache/impala/catalog/IcebergTable.java:

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergTable.java@32
PS2, Line 32: import org.apache.iceberg.DataFile;
            : import org.apache.iceberg.DeleteFile;
nit: unused import.


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergTable.java@37
PS2, Line 37: FileDescriptor
ditto


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergTable.java@39
PS2, Line 39: import org.apache.impala.common.Pair;
ditto


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/iceberg/GroupedContentFiles.java
File 
fe/src/main/java/org/apache/impala/catalog/iceberg/GroupedContentFiles.java:

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/iceberg/GroupedContentFiles.java@36
PS2, Line 36: GroupedContentFiles
This is a nice encapsulation, the class name is explicit, and it's more elegant 
than 'Pair<List<DataFile>, Set<DeleteFile>>'. I think we should try not to have 
this kind of nesting(Pair<Collection<?>, Collection<?>) in the future, and try 
to wrap it into a meaningful class?


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/planner/IcebergScanPlanner.java
File fe/src/main/java/org/apache/impala/planner/IcebergScanPlanner.java:

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/planner/IcebergScanPlanner.java@162
PS2, Line 162:   private void setFileDescriptorsBasedOnFileStore() throws 
ImpalaException {
> Can delete "throws ImpalaException"
+1 indeed


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/planner/IcebergScanPlanner.java@166
PS2, Line 166:     deleteFiles_ = new 
HashSet<FileDescriptor>(fileStore.getDeleteFiles());
> Can be "HashSet<>"
+1 indeed



--
To view, visit http://gerrit.cloudera.org:8080/19419
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia46bd2dce248a9e096fc1c0bd914fc3fa4686fb0
Gerrit-Change-Number: 19419
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy <borokna...@cloudera.com>
Gerrit-Reviewer: Andrew Sherman <asher...@cloudera.com>
Gerrit-Reviewer: Anonymous Coward <lipeng...@apache.org>
Gerrit-Reviewer: Gergely Fürnstáhl <gfurnst...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Tamas Mate <tma...@apache.org>
Gerrit-Comment-Date: Fri, 13 Jan 2023 07:01:18 +0000
Gerrit-HasComments: Yes

Reply via email to