alamb commented on PR #9678:
URL: https://github.com/apache/arrow-rs/pull/9678#issuecomment-4389558274

   I had a few comments that github won't seem to let me post
   
   It might help to explain what files it helps the most for:
   
   ```suggestion
       /// The `path_in_schema` field in the Thrift metadata is redundant and 
wastes a sometimes
       /// significant amount of space, especially for files with long column 
names or deeply nested
       /// schema. Parquet file footers can be made smaller and easier to
       /// parse by omitting this field.
   ```
   
   
   
   
   It might help to be a little more specific about what readers this effects
   
   ```suggestion
       /// Setting this to `false` will break compatibility with Parquet 
readers that
       /// still expect this field to be present. Many older Parquet readers, 
such as parquet-java
       /// but with the exception
       /// of the one in this crate, expect this field to be present, and will 
terminate execution
       /// if it is not. This will continue to be the case unless/until the 
Parquet format
       /// specification is explicitly changed to allow this field to be 
missing. As a consquence,
       /// users should only set this to `false` if they have verified that any 
reader(s) they plan
       /// to use can tolerate the absence of this field.
       ///
       /// For more context, see [GH-563].
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to