alamb opened a new issue, #10073:
URL: https://github.com/apache/arrow-rs/issues/10073

   @adriangb  has made some great contributions to the Rust parquet writer to 
support several usecases where parquet writers typically struggle:
   - https://github.com/apache/arrow-rs/pull/9972
   -https://github.com/apache/arrow-rs/pull/10020
   
   I think it would be amazing to write a blog post explaining the challenge 
and the solutions
   
   Challenge 1: Oversized pages for large binary/string columns
   - https://github.com/apache/arrow-rs/issues/10061
   
   Challenge 2: RAM buffering requirements when writing large Row Groups
   - https://github.com/apache/arrow-rs/issues/10071
   
   The idea is to write a blog in https://arrow.apache.org/blog/  (source in 
https://github.com/apache/arrow-site) that explains the challenges and how we 
solved it with Software Engineering rather than a new file format


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to