etseidl opened a new pull request, #7594:
URL: https://github.com/apache/arrow-rs/pull/7594

   # Which issue does this PR close?
   
   Closes #7580.
   
   # Rationale for this change
   Reduces metadata bloat by not writing redundant statistics to each Parquet 
page header.
   
   # What changes are included in this PR?
   
   Changes the meaning of `EnabledStatistics::Page`. Currently this level means 
statistics will be written to the column chunk, page header, and column index. 
With this PR `Page` now means writing to the column chunk and column index. 
Writing to the page header can be enabled using an added 
`write_page_header_statistics` writer option.
   
   Also adds some command line switches to the `parquet-rewrite` tool.
   
   # Are there any user-facing changes?
   
   No breaking API changes, but an added option and behavior change.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to