pitrou commented on code in PR #13901: URL: https://github.com/apache/arrow/pull/13901#discussion_r1752271358
########## cpp/src/parquet/properties.h: ########## @@ -941,6 +942,18 @@ class PARQUET_EXPORT ArrowReaderProperties { return coerce_int96_timestamp_unit_; } + /// Enable Parquet supported Arrow ExtensionTypes. + /// + /// When enabled, Parquet will use supported Arrow ExtensionTypes by mapping correctly + /// mapping them to Arrow types at read time. Currently only arrow::extension::json() + /// extension type is supported. Columns whose LogicalType is JSON will be interpreted + /// as arrow::extension::json() ExtensionType with storage type utf8, large_utf8 or Review Comment: I don't think it makes sense to expose such a setting specifically for JSON (rather than generically for all string-like columns). Also, if you store the Arrow schema, the original storage type should be restored (but that's not implemented for `utf8_view` AFAICT). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org