hemanthboyina opened a new pull request, #15576:
URL: https://github.com/apache/iceberg/pull/15576

   When max-files-to-rewrite truncates a file group to a subset of files, 
inputSplitSize and  expectedOutputFiles were still computed from the full 
group's input size. This caused the sort/zorder rewrite strategy to create too 
many small output files, as numShufflePartitions was overestimated. This fix 
recomputes the input size from the actual truncated file list.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to