[ 
https://issues.apache.org/jira/browse/PARQUET-2252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17695022#comment-17695022
 ] 

ASF GitHub Bot commented on PARQUET-2252:
-----------------------------------------

zhongyujiang opened a new pull request, #1038:
URL: https://github.com/apache/parquet-mr/pull/1038

   …implement page skipping.
   
   Issue: [PARQUET-2252](https://issues.apache.org/jira/browse/PARQUET-2252)
   
   This PR makes some methods required to implement column index filter public 
to allow Iceberg build its own column index filtering.  Since Iceberg is going 
to calculate `RowRanges` itself, this also adds a public method in 
`ParquetFileReader` that allows users to pass in `RowRanges` to read filtered 
row group. Use of these changes can refer to this 
[PR](https://github.com/apache/iceberg/pull/6967), currently it uses reflection 
as a workaround.




> Make some methods public to allow external projects to implement page skipping
> ------------------------------------------------------------------------------
>
>                 Key: PARQUET-2252
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2252
>             Project: Parquet
>          Issue Type: New Feature
>            Reporter: Yujiang Zhong
>            Priority: Major
>
> Iceberg hopes to implement the column index filter based on Iceberg's own 
> expressions, we would like to be able to use some of the methods in Parquet 
> repo, for example: methods in `RowRanges` and `IndexIterator`, however these 
> are currently not public. Currently we can only rely on reflection to use 
> them.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to