[ https://issues.apache.org/jira/browse/PARQUET-1801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17038172#comment-17038172 ]
Gabor Szadovszky commented on PARQUET-1801: ------------------------------------------- Currently, only column indexes are the special data that does not belong to the row groups/pages neither to the footer. But, bloom filter is also on the way which will be similar to the column indexes. Also, there are more than one feature requires to copy existing column chunks and column indexes untouched and some others to be rewritten. I think, it would be a good idea to have these requirements implemented in a way that it does not belong to parquet-tools or parquet-cli and can be used for both the 'prune' and the 'mask' features (and maybe the properly implemented merge feature as well). > Add column index support for 'prune' command in Parquet-tools/cli > ----------------------------------------------------------------- > > Key: PARQUET-1801 > URL: https://issues.apache.org/jira/browse/PARQUET-1801 > Project: Parquet > Issue Type: Improvement > Components: parquet-cli, parquet-mr > Affects Versions: 1.12.0 > Reporter: Xinli Shang > Assignee: Xinli Shang > Priority: Major > Fix For: 1.12.0 > > -- This message was sent by Atlassian Jira (v8.3.4#803005)