[GitHub] [hudi] danny0405 commented on a diff in pull request #9122: [HUDI-6477] Lazy fetching partition path & file slice when refresh in…

2023-07-04 Thread via GitHub


danny0405 commented on code in PR #9122:
URL: https://github.com/apache/hudi/pull/9122#discussion_r1252517410


##
hudi-common/src/main/java/org/apache/hudi/BaseHoodieTableFileIndex.java:
##
@@ -144,18 +144,7 @@ public BaseHoodieTableFileIndex(HoodieEngineContext 
engineContext,
 this.engineContext = engineContext;
 this.fileStatusCache = fileStatusCache;
 
-// The `shouldListLazily` variable controls how we initialize the 
TableFileIndex:
-//  - non-lazy/eager listing (shouldListLazily=false):  all partitions and 
file slices will be loaded eagerly during initialization.
-//  - lazy listing (shouldListLazily=true): partitions listing will be 
done lazily with the knowledge from query predicate on partition
-//columns. And file slices fetching only happens for partitions 
satisfying the given filter.
-//
-// In SparkSQL, `shouldListLazily` is controlled by option 
`REFRESH_PARTITION_AND_FILES_IN_INITIALIZATION`.
-// In lazy listing case, if no predicate on partition is provided, all 
partitions will still be loaded.
-if (shouldListLazily) {
-  this.tableMetadata = createMetadataTable(engineContext, metadataConfig, 
basePath);

Review Comment:
   Ignore, it is created in `doRefresh`



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [hudi] danny0405 commented on a diff in pull request #9122: [HUDI-6477] Lazy fetching partition path & file slice when refresh in…

2023-07-04 Thread via GitHub


danny0405 commented on code in PR #9122:
URL: https://github.com/apache/hudi/pull/9122#discussion_r1252517112


##
hudi-common/src/main/java/org/apache/hudi/BaseHoodieTableFileIndex.java:
##
@@ -144,18 +144,7 @@ public BaseHoodieTableFileIndex(HoodieEngineContext 
engineContext,
 this.engineContext = engineContext;
 this.fileStatusCache = fileStatusCache;
 
-// The `shouldListLazily` variable controls how we initialize the 
TableFileIndex:
-//  - non-lazy/eager listing (shouldListLazily=false):  all partitions and 
file slices will be loaded eagerly during initialization.
-//  - lazy listing (shouldListLazily=true): partitions listing will be 
done lazily with the knowledge from query predicate on partition
-//columns. And file slices fetching only happens for partitions 
satisfying the given filter.
-//
-// In SparkSQL, `shouldListLazily` is controlled by option 
`REFRESH_PARTITION_AND_FILES_IN_INITIALIZATION`.
-// In lazy listing case, if no predicate on partition is provided, all 
partitions will still be loaded.
-if (shouldListLazily) {
-  this.tableMetadata = createMetadataTable(engineContext, metadataConfig, 
basePath);

Review Comment:
   The initialization of `tableMetadata` is removed?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org