[ https://issues.apache.org/jira/browse/PARQUET-1685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Gabor Szadovszky resolved PARQUET-1685. --------------------------------------- Resolution: Fixed > Truncate the stored min and max for String statistics to reduce the footer > size > -------------------------------------------------------------------------------- > > Key: PARQUET-1685 > URL: https://issues.apache.org/jira/browse/PARQUET-1685 > Project: Parquet > Issue Type: Improvement > Components: parquet-mr > Affects Versions: 1.10.1 > Reporter: Xinli Shang > Assignee: Xinli Shang > Priority: Major > Labels: pull-request-available > Fix For: 1.11.0 > > > Iceberg has a cool feature that truncates the stored min, max statistics to > minimize the metadata size. We can borrow to truncate them in Parquet also to > reduce the size of the footer, or even the page header. Here is the code in > IceBerg > [https://github.com/apache/incubator-iceberg/blob/master/api/src/main/java/org/apache/iceberg/util/UnicodeUtil.java]. > > > > > > > -- This message was sent by Atlassian Jira (v8.3.4#803005)