[ https://issues.apache.org/jira/browse/PARQUET-2373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17789296#comment-17789296 ]
ASF GitHub Bot commented on PARQUET-2373: ----------------------------------------- wgtmac commented on PR #1184: URL: https://github.com/apache/parquet-mr/pull/1184#issuecomment-1825081874 @zhangjiashen This can be rebased to adopt parquet-format 2.10.0 > Improve I/O performance with bloom_filter_length > ------------------------------------------------ > > Key: PARQUET-2373 > URL: https://issues.apache.org/jira/browse/PARQUET-2373 > Project: Parquet > Issue Type: Improvement > Reporter: Jiashen Zhang > Priority: Minor > > The spec PARQUET-2257 has added bloom_filter_length for reader to load the > bloom filter in a single shot. This implementation alters the code to make > use of the 'bloom_filter_length' field for loading the bloom filter > (consisting of the header and bitset) in order to enhance I/O scheduling. -- This message was sent by Atlassian Jira (v8.20.10#820010)