Fokko commented on code in PR #8980:
URL: https://github.com/apache/iceberg/pull/8980#discussion_r1435150752


##########
core/src/main/java/org/apache/iceberg/MicroBatches.java:
##########
@@ -92,7 +92,7 @@ private static List<Pair<ManifestFile, Integer>> 
indexManifests(
 
     for (ManifestFile manifest : manifestFiles) {
       manifestIndexes.add(Pair.of(manifest, currentFileIndex));
-      currentFileIndex += manifest.addedFilesCount() + 
manifest.existingFilesCount();
+      currentFileIndex += manifest.addedFilesCount();

Review Comment:
   I think it is relatively easy to reason about this part because it is used 
only in one path.
   
   The existing entries will be entries of data files that are already part of 
a manifest. For example, when you rewrite the metadata; for example because you 
use a lot of fast-appends. With a fast append, a new manifest will be written 
and added to the manifest-list. At some point, you want to combine these 
manifests into a bigger one to reduce the number of calls to the object store 
and make the planning part faster.
   
   We want to skip when only the metadata has been rewritten, so I think this 
change is correct.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@iceberg.apache.org
For additional commands, e-mail: issues-h...@iceberg.apache.org

Reply via email to