[
https://issues.apache.org/jira/browse/PARQUET-2413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Gang Wu resolved PARQUET-2413.
------------------------------
Fix Version/s: 1.14.0
Assignee: Claire McGinty
Resolution: Fixed
> Support custom file footer metadata via ParquetWriter
> -----------------------------------------------------
>
> Key: PARQUET-2413
> URL: https://issues.apache.org/jira/browse/PARQUET-2413
> Project: Parquet
> Issue Type: Improvement
> Reporter: Claire McGinty
> Assignee: Claire McGinty
> Priority: Minor
> Fix For: 1.14.0
>
>
> Currently Parquet supports extra metadata keys passed via
> [WriteContext|https://github.com/apache/parquet-mr/blob/945836c79b5bd3003512ace9e2d30d4cd03422f3/parquet-hadoop/src/main/java/org/apache/parquet/hadoop/api/WriteSupport.java#L49-L54].
> However, in practice these are limited to implementation-specific values
> (i.e. AvroWriteSupport adds a
> [key|https://github.com/apache/parquet-mr/blob/945836c79b5bd3003512ace9e2d30d4cd03422f3/parquet-avro/src/main/java/org/apache/parquet/avro/AvroWriteSupport.java#L153-L154]
> containing the serialized Avro schema), and cannot be overridden or extended
> by the end user.
>
> Maybe we could extend the ParquetWriter builder to support this? For example:
>
> ```
> AvroParquetWriter.Builder<T> builder =
> AvroParquetWriter.<T>builder(outputFile)
> .withConf(...)
> .withPageSize(...)
> + .withExtraMetadata(Map.of(..))
> .build()
> ```
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]