Re: [PR] GH-48467: [C++][Parquet] Add configure to limit the row group size in bytes [arrow]

via GitHub Fri, 27 Feb 2026 19:29:45 -0800


wecharyu commented on PR #48468:
URL: https://github.com/apache/arrow/pull/48468#issuecomment-3976245659


   > Sure, but it's still trying to estimate that, and it may very well 
overshoot or undershoot depending on the data being actually written.
   
   @pitrou Sorry, maybe I did not quite get your point. If you mean that the 
first row group is still using buffered size for estimation and may cause 
over-estimation, I should admit it and there seems no better way for the first 
row group.
   
   And the row groups excluding the first one will not use buffered bytes for 
estimation, it should be what we are expected.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [PR] GH-48467: [C++][Parquet] Add configure to limit the row group size in bytes [arrow]

Reply via email to