[ https://issues.apache.org/jira/browse/PARQUET-1951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gabor Szadovszky reassigned PARQUET-1951: ----------------------------------------- Assignee: satish > Allow different strategies to combine key values when merging parquet files > --------------------------------------------------------------------------- > > Key: PARQUET-1951 > URL: https://issues.apache.org/jira/browse/PARQUET-1951 > Project: Parquet > Issue Type: Improvement > Reporter: satish > Assignee: satish > Priority: Minor > > I work on Apache Hudi project. We store some additional metadata in parquet > files (key range in the file, for example). So the metadata is different in > different parquet files that we want to merge these files. > Here is what I'm thinking: > 1) Merge command takes additional command line option: --strategy > <StrategyClassName>. > 2) We introduce new strategy class in parquet-hadoop to keep the same > behavior as today. > We can extend that class and provide our custom implementation. -- This message was sent by Atlassian Jira (v8.3.4#803005)