sreejasahithi opened a new pull request, #10145:
URL: https://github.com/apache/ozone/pull/10145

   ## What changes were proposed in this pull request?
   This PR provides logic to determine the specific subset of Iceberg manifest 
files that require path rewriting, avoiding redundant processing of manifests.
   
     - Compute delta snapshots as the difference between the start and end 
table metadata versions. If no start version is provided, all snapshots are 
treated as delta.
     - Iterate over all snapshots in the end-version table and read each 
snapshot's manifest list in parallel.
     - If no start metadata is provided, include all manifests unconditionally.
     - If start metadata is provided, filter at the manifest level — only 
include manifests whose snapshotId belongs to the delta snapshot ID set.
     - Deduplicate manifest paths so that manifests shared across multiple 
snapshots are only collected once.
   
   ## What is the link to the Apache JIRA
   
   [HDDS-14942](https://issues.apache.org/jira/browse/HDDS-14942)
   
   ## How was this patch tested?
   
   https://github.com/sreejasahithi/ozone/actions/runs/24986534925
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to