[ https://issues.apache.org/jira/browse/HUDI-1188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated HUDI-1188: --------------------------------- Labels: pull-request-available (was: ) > MOR hbase index tables not deduplicating records > ------------------------------------------------ > > Key: HUDI-1188 > URL: https://issues.apache.org/jira/browse/HUDI-1188 > Project: Apache Hudi > Issue Type: Bug > Reporter: Ryan Pifer > Assignee: Ryan Pifer > Priority: Major > Labels: pull-request-available > > After fetching hbase index for a record, Hudi performs a validation that the > commit timestamp stored in hbase for that record is a commit on the timeline. > This makes any record that is stored to hbase index during a deltacommit > (upsert on MOR table) considered an invalid commit and treated as a new > record. This causes the hbase index to be updated every time which leads to > records being able to be in multiple partitions and even in different file > groups within same partition. -- This message was sent by Atlassian Jira (v8.3.4#803005)