Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/16281 For this issue, it was initially PARQUET-363, but all maintenance fix (with new features) are welcome. For me, the followings? - PARQUET-99: Large rows cause unnecessary OOM exceptions - PARQUET-353: Compressors not getting recycled while writing parquet files, causing memory leak - PARQUET-363: Cannot construct empty MessageType for ReadContext.requestedSchema - PARQUET-511: Integer overflow on counting values in column - PARQUET-569: ParquetMetadataConverter offset filter is broken - PARQUET-571: Fix potential leak in ParquetFileReader.close() - PARQUET-623: DeltaByteArrayReader has incorrect skip behaviour - PARQUET-645: DictionaryFilter incorrectly handles null @gatorsmile . Do you have more?
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org