wiedld commented on issue #15028:
URL: https://github.com/apache/datafusion/issues/15028#issuecomment-2704943619

   If turn back on compression, and set `table_opts.global.max_row_group_size = 
5_000` (this is max row count [used 
here](https://github.com/apache/datafusion/blob/34efd1fbae39eb0441a43ab976fc23001d1f674a/datafusion/datasource-parquet/src/file_format.rs#L1210)),
 then you can avoid the OOM on that reproducer.  If you keep compression off, 
then you have to reduce the max row count to be much smaller (so can buffer it 
all in memory).
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to