pitrou commented on issue #45969: URL: https://github.com/apache/arrow/issues/45969#issuecomment-2765508966
This would probably need to be added to the Arrow-Parquet C++ APIs first. I see two possible kinds of API: 1) expose an additional parameter to pass the full Parquet schema (we would need to validate the Parquet schema is compatible with the provided Arrow data) 2) allow specifying the Parquet logical type of a Arrow column, using dedicated key-value metadata: we need to devise a binary encoding for a Parquet node description (can we use Thrift?) @wgtmac @mapleFU -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
