[ https://issues.apache.org/jira/browse/HUDI-7569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated HUDI-7569: --------------------------------- Labels: hudi-1.0.0-beta2 pull-request-available (was: hudi-1.0.0-beta2) > Fix wrong result while using RLI for pruning files > -------------------------------------------------- > > Key: HUDI-7569 > URL: https://issues.apache.org/jira/browse/HUDI-7569 > Project: Apache Hudi > Issue Type: Bug > Reporter: Vinaykumar Bhat > Priority: Major > Labels: hudi-1.0.0-beta2, pull-request-available > Fix For: 1.0.0 > > > Data skipping (pruning files) for RLI is supported only when the query > predicate has `EqualTo` or `In` expressions/filters on the record-key column. > However, the logic for detecting valid `In` expression/filter on record-key > has bugs. It tries to prune files assuming that `In` expression/filter can > reference only record-key column even when the `In` query is based on other > columns. > > For example, a query of the foem `select * from trips_table where driver in > ('abc', 'xyz')` has the potential to return wrong results if the record-key > for this table also has values 'abc' or 'xyz' for some rows of the table. -- This message was sent by Atlassian Jira (v8.20.10#820010)