jasonkhadka opened a new issue #9194: URL: https://github.com/apache/arrow/issues/9194
Currently there is no way to handle the error raised by missing columns in parquet file. If a column passed in `columns=[item1, item2, item3]` is missing it just raises : `ArrowInvalid: Field named 'item3' not found or not unique in the schema.` There is no way to handle this. The ArrowInvalid also does not carry any information that can give out the field name so that in next try this filed can be ignored. https://github.com/apache/arrow/blob/ec18db9dbde801781109095dc4c7198dc35bbc07/python/pyarrow/parquet.py#L1657 ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
