Vladimir Rodionov created HBASE-15442:
-----------------------------------------

             Summary: HBase Backup Phase 2: Potential data loss and or data 
duplication in incremental backup
                 Key: HBASE-15442
                 URL: https://issues.apache.org/jira/browse/HBASE-15442
             Project: HBase
          Issue Type: Bug
            Reporter: Vladimir Rodionov
            Assignee: Vladimir Rodionov
            Priority: Critical


Suppose we have two tables T1 and T2

# Create full backup T1 with backup id = B1
# Create full backup T2 backupId = B2
# New data arrived into file WAL1
# Create incremental backup of T1 with backupId = B3
# Create incremental backup of T2 with backupid = B4

The directory structure for backup site after this steps

BACKUP_ROOT/WALs/B3
BACKUP_ROOT/WALs/B4
BACKUP_ROOT/T1/B1
BACKUP_ROOT/T2/B2

File WAL1 may end up either in BACKUP_ROOT/WALs/B3 or in both: 
BACKUP_ROOT/WALs/B3 and BACKUP_ROOT/WALs/B4 location. Both are bad: in first 
case we lose data for backup B4 in second case we have duplicate copies of WAL1









--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to