Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19419 )

Change subject: IMPALA-11826: Avoid calling planFiles() on Iceberg V2 tables 
when there are no predicates
......................................................................


Patch Set 3:

(13 comments)

Thanks for the comments!

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/FeIcebergTable.java
File fe/src/main/java/org/apache/impala/catalog/FeIcebergTable.java:

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/FeIcebergTable.java@754
PS2, Line 754: ble, contentFile);
             :         nameToStats.put(name, mergePartitionStats(nameToStats, 
contentFile, name));
             :       }
             :       return nameToStats;
> I prefer to put this logic in GroupedContentFiles, which could be nameed li
Done


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergContentFileStore.java
File fe/src/main/java/org/apache/impala/catalog/IcebergContentFileStore.java:

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergContentFileStore.java@49
PS2, Line 49: file
> nit: file?
Done


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergContentFileStore.java@149
PS2, Line 149:   public List<FileDescriptor> getDataFilesWithoutDeletes() {
> +1 indeed.
Done


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergTable.java
File fe/src/main/java/org/apache/impala/catalog/IcebergTable.java:

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergTable.java@32
PS2, Line 32: import org.apache.impala.analysis.IcebergPartitionField;
            : import org.apache.impala.analysis.Ice
> nit: unused import.
Done


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergTable.java@37
PS2, Line 37: ec;
> ditto
Done


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/IcebergTable.java@39
PS2, Line 39: import org.apache.impala.thrift.TGetP
> ditto
Done


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/iceberg/GroupedContentFiles.java
File 
fe/src/main/java/org/apache/impala/catalog/iceberg/GroupedContentFiles.java:

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/iceberg/GroupedContentFiles.java@34
PS2, Line 34:  * Struct-like object to group different Iceberg content files:
> line has trailing whitespace
Done


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/iceberg/GroupedContentFiles.java@36
PS2, Line 36: es with deleted row
> This is a nice encapsulation, the class name is explicit, and it's more ele
Yeah, I agree. Sometimes a Pair is a handy lazy solution when we only need to 
group two objects, but I agree that it's nicer to introduce a new class instead.


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/catalog/iceberg/GroupedContentFiles.java@42
PS2, Line 42:   public Set<DeleteFile> deleteFiles = new HashSet<>();
> This is a style thing, but it can be neater to do these initializations in
Done


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/planner/IcebergScanPlanner.java
File fe/src/main/java/org/apache/impala/planner/IcebergScanPlanner.java:

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/planner/IcebergScanPlanner.java@162
PS2, Line 162:   private void setFileDescriptorsBasedOnFileStore() {
> +1 indeed
Done


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/planner/IcebergScanPlanner.java@166
PS2, Line 166:     deleteFiles_ = new HashSet<>(fileStore.getDeleteFiles());
> +1 indeed
Done


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/util/IcebergUtil.java
File fe/src/main/java/org/apache/impala/util/IcebergUtil.java:

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/main/java/org/apache/impala/util/IcebergUtil.java@558
PS2, Line 558:     try (CloseableIterable<FileScanTask> fileScanTasks = 
planFiles(
> dataFilesWithoutDeletes, dataFilesWithDeletes, deleteFiles are unused now.
Done


http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/test/java/org/apache/impala/catalog/local/LocalCatalogTest.java
File fe/src/test/java/org/apache/impala/catalog/local/LocalCatalogTest.java:

http://gerrit.cloudera.org:8080/#/c/19419/2/fe/src/test/java/org/apache/impala/catalog/local/LocalCatalogTest.java@287
PS2, Line 287:         tblInfo.getIceberg_table().getContent_files(),
> Can add
Good point, thanks.



--
To view, visit http://gerrit.cloudera.org:8080/19419
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ia46bd2dce248a9e096fc1c0bd914fc3fa4686fb0
Gerrit-Change-Number: 19419
Gerrit-PatchSet: 3
Gerrit-Owner: Zoltan Borok-Nagy <borokna...@cloudera.com>
Gerrit-Reviewer: Andrew Sherman <asher...@cloudera.com>
Gerrit-Reviewer: Anonymous Coward <lipeng...@apache.org>
Gerrit-Reviewer: Gergely Fürnstáhl <gfurnst...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Tamas Mate <tma...@apache.org>
Gerrit-Reviewer: Zoltan Borok-Nagy <borokna...@cloudera.com>
Gerrit-Comment-Date: Mon, 16 Jan 2023 17:11:28 +0000
Gerrit-HasComments: Yes

Reply via email to