Vinayak Hegde created HBASE-29220:
-------------------------------------
Summary: Track the Age/Timestamp of the Last Successfully
Backed-Up WAL Entry in Continuous Backup Replication Endpoint
Key: HBASE-29220
URL: https://issues.apache.org/jira/browse/HBASE-29220
Project: HBase
Issue Type: Task
Components: backup&restore
Reporter: Vinayak Hegde
We use HBase’s replication framework for Continuous Backup through
{{{}ContinuousBackupReplicationEndpoint{}}}. This replicates WAL entries to the
backup location, which are then used for Point-In-Time Recovery (PITR) and
Incremental Backup (an optimization technique that collects WALs and generates
HFiles for faster recovery).
However, the {{ReplicationEndpoint}} can lag behind in time.
For example, if replication is one hour behind,
{{ContinuousBackupReplicationEndpoint}} will currently be writing WAL entries
that are one hour old. This means that if a user requests a PITR for the
current time or attempts an incremental backup, they will miss that one hour of
data.
To prevent this, we need to ensure that users can only request data that has
been fully backed up. Therefore, we must track the timestamp of the last
successfully backed-up WAL entry:
* For PITR: Users should only be allowed to restore to a point before this
timestamp.
* For Incremental Backup: The incremental backup process should store this
timestamp as the backup time to maintain data consistency.
This ensures data integrity and prevents users from requesting backups that
include unprocessed WAL entries.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)