I have two parquet related bug fixes and I wonder if we can release them in 12.0.1 1. https://github.com/apache/arrow/pull/35428 2. https://github.com/apache/arrow/pull/35520
Patch 1 can cause BYTE_STREAM_SPLIT unable to be read if the previous parquet page is larger than the incoming one. Patch 2 might cause segment fault when Close row-group meets an exception. Best, Xuwei Fu On 2023/05/18 17:04:12 Weston Pace wrote: > Regrettabl, 12.0.0 had a significant performance regression (I'll take the > blame for not thinking through all the use cases), most easily exposed when > writing datasets from pandas / numpy data, which is being addressed in > [1]. I believe this to be a fairly common use case and it may warrant a > 12.0.1 patch. Are there other issues that would need a patch? Do we feel > this issue is significant enough to justify the work? > > [1] https://github.com/apache/arrow/pull/35565 >