lw309637554 commented on a change in pull request #3240:
URL: https://github.com/apache/hudi/pull/3240#discussion_r668356708



##########
File path: 
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/table/action/commit/UpsertPartitioner.java
##########
@@ -146,7 +146,7 @@ private int addUpdateBucket(String partitionPath, String 
fileIdHint) {
    * @return smallFiles not in clustering
    */
   private List<SmallFile> filterSmallFilesInClustering(final Set<String> 
pendingClusteringFileGroupsId, final List<SmallFile> smallFiles) {
-    if (this.config.isClusteringEnabled()) {

Review comment:
       @satishkotha @zhangyue19921010 
   Use "if (!pendingClusteringFileGroupsId.isEmpty())" will improve ease of 
use. 
   Another need to  modify. But if this will bring performance loss? 
@satishkotha 
   
   "  private JavaRDD<HoodieRecord<T>> 
clusteringHandleUpdate(JavaRDD<HoodieRecord<T>> inputRecordsRDD) {
       if (config.isClusteringEnabled()) {"




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to