wangxianghu commented on a change in pull request #3213:
URL: https://github.com/apache/hudi/pull/3213#discussion_r663303427
##
File path:
hudi-client/hudi-spark-client/src/main/java/org/apache/hudi/table/action/commit/UpsertPartitioner.java
##
@@ -185,21 +185,26 @@ private void assignInserts(WorkloadProfile profile,
HoodieEngineContext context)
// first try packing this into one of the smallFiles
for (SmallFile smallFile : smallFiles) {
- long recordsToAppend = Math.min((config.getParquetMaxFileSize() -
smallFile.sizeBytes) / averageRecordSize,
- totalUnassignedInserts);
- if (recordsToAppend > 0 && totalUnassignedInserts > 0) {
-// create a new bucket or re-use an existing bucket
-int bucket;
-if
(updateLocationToBucket.containsKey(smallFile.location.getFileId())) {
- bucket =
updateLocationToBucket.get(smallFile.location.getFileId());
- LOG.info("Assigning " + recordsToAppend + " inserts to existing
update bucket " + bucket);
-} else {
- bucket = addUpdateBucket(partitionPath,
smallFile.location.getFileId());
- LOG.info("Assigning " + recordsToAppend + " inserts to new
update bucket " + bucket);
+ if (totalUnassignedInserts > 0) {
+long recordsToAppend = Math.min((config.getParquetMaxFileSize() -
smallFile.sizeBytes) / averageRecordSize,
+totalUnassignedInserts);
+if (recordsToAppend > 0) {
+ // create a new bucket or re-use an existing bucket
+ int bucket;
+ if
(updateLocationToBucket.containsKey(smallFile.location.getFileId())) {
+bucket =
updateLocationToBucket.get(smallFile.location.getFileId());
+LOG.info("Assigning " + recordsToAppend + " inserts to
existing update bucket " + bucket);
+ } else {
+bucket = addUpdateBucket(partitionPath,
smallFile.location.getFileId());
+LOG.info("Assigning " + recordsToAppend + " inserts to new
update bucket " + bucket);
+ }
+ bucketNumbers.add(bucket);
+ recordsPerBucket.add(recordsToAppend);
+ totalUnassignedInserts -= recordsToAppend;
}
-bucketNumbers.add(bucket);
-recordsPerBucket.add(recordsToAppend);
-totalUnassignedInserts -= recordsToAppend;
+ } else {
Review comment:
> if the main thing you want to fix is to break out early. Can we do it
after L203, with a if check, without nesting this 1 level under again in if
block?
>
> ```java
> totalUnassignedInserts -= recordsToAppend;
> if (totalUnassignedInserts <= 0) {
> break;
> }
> ```
Yes, I want to break it early. done
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org