[ https://issues.apache.org/jira/browse/PARQUET-1061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091799#comment-16091799 ]
Ryan Blue commented on PARQUET-1061: ------------------------------------ Did you set the property to enable dictionary filtering? https://github.com/apache/parquet-mr/blob/master/parquet-hadoop/src/main/java/org/apache/parquet/filter2/compat/RowGroupFilter.java#L95 > parquet dictionary filter does not work. > ---------------------------------------- > > Key: PARQUET-1061 > URL: https://issues.apache.org/jira/browse/PARQUET-1061 > Project: Parquet > Issue Type: Bug > Components: parquet-mr > Affects Versions: 1.9.0 > Environment: Hive 2.2.0 + Parquet-mr 1.9.0/master > Reporter: Junjie Chen > > When perform selective query, we observed that dictionary filter was not > applied. Please see following code snippet. > if (rowGroupOffsets != null) { > // verify a row group was found for each offset > List<BlockMetaData> blocks = reader.getFooter().getBlocks(); > if (blocks.size() != rowGroupOffsets.length) { > throw new IllegalStateException( > "All of the offsets in the split should be found in the file." > + " expected: " + Arrays.toString(rowGroupOffsets) > + " found: " + blocks); > } > } else { > *Why apply data filter when row group offset equal to null? * > // apply data filters > reader.filterRowGroups(getFilter(configuration)); > } > I can enable filter after move else block code into second layer if. -- This message was sent by Atlassian JIRA (v6.4.14#64029)