friendlymatthew opened a new issue, #9539:
URL: https://github.com/apache/arrow-rs/issues/9539

   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   
   `StatisticsConverter::try_new` cannot extract Parquet statistics for nested 
fields. Internally it calls `parquet_column()` which explicitly bails on nested 
types: 
https://github.com/apache/arrow-rs/blob/d3c79006f2595e144d539f56b3054fe916ab184b/parquet/src/arrow/mod.rs#L478-L486
   
   There is no alternative constructor that accepts a Parquet leaf column index 
directly, so callers that have already resolved the mapping (like Datafusion's 
row filter logic) have no way to use StatisticsConverter for nested fields
   
   I'd like to propose adding 
`StatisticsConverter::from_column_index(parquet_column_index, arrow_field, 
parquet_schema)` constructor that bypasses schema resolution and accepts a 
pre-resolved leaf col index directly


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to