wgtmac commented on code in PR #45360:
URL: https://github.com/apache/arrow/pull/45360#discussion_r2086194655
##########
cpp/src/parquet/properties.h:
##########
@@ -245,6 +245,34 @@ class PARQUET_EXPORT ColumnProperties {
bool page_index_enabled_;
};
+// EXPERIMENTAL: Options for content-defined chunking.
+struct PARQUET_EXPORT CdcOptions {
+ /// Minimum chunk size in bytes, default is 256 KiB
Review Comment:
> A chunk is always a data page, right?
IIUC, it may have many pages in a single chunk if the configured data page
size is significantly smaller than the min chunk size. If yes, it would be much
clearer if we explain the relationship between the chunk size and data page
size configurations.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]