rok opened a new issue, #8304:
URL: https://github.com/apache/arrow-rs/issues/8304

   As per 
[spec](https://parquet.apache.org/docs/file-format/data-pages/encryption/#55-plaintext-footer-mode):
   
   > In the plaintext footer mode, the optional ColumnMetaData meta_data is set 
in the ColumnChunk structure for all columns, but is stripped of the statistics 
for the sensitive (encrypted) columns. These statistics are available for new 
readers with the column key - they decrypt the encrypted_column_metadata field, 
described in the section 5.3, and parse it to get statistics and all other 
column metadata values. The legacy readers are not aware of the encrypted 
metadata field; they parse the regular (plaintext) field as usual. While they 
can’t read the data of encrypted columns, they read their metadata to extract 
the offset and size of encrypted column data, required for column chunk 
vectorization.
   
   Current writer omits stats for plaintext footers all together. We would want 
to write stats as per the spec.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to