wecharyu commented on PR #48468: URL: https://github.com/apache/arrow/pull/48468#issuecomment-3976245659
> Sure, but it's still trying to estimate that, and it may very well overshoot or undershoot depending on the data being actually written. @pitrou Sorry, maybe I did not quite get your point. If you mean that the first row group is still using buffered size for estimation and may cause over-estimation, I should admit it and there seems no better way for the first row group. And the row groups excluding the first one will not use buffered bytes for estimation, it should be what we are expected. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
