zeroshade commented on PR #520: URL: https://github.com/apache/arrow-go/pull/520#issuecomment-3339971017
> Does pqarrow smuggle Arrow schemas inside Parquet metadata so users can round-trip? If you set the `StoreSchema` option, then yes it does add the Arrow schema inside the parquet metadata. > This PR doesn't introduce any breakage of course but it strikes me that arrow -> parquet -> arrow won't round-trip well if the arrow input has to be cast like this. This is the case no matter what type you pass (even if it's not dictionary encoded) because Parquet only has int32/int64 as physical storage types. In the case where the original Arrow schema is in the metadata, we properly cast on the way out so there isn't an issue. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
