[ 
https://issues.apache.org/jira/browse/HBASE-14030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vladimir Rodionov updated HBASE-14030:
--------------------------------------
    Release Note: 
This experimental feature allows to perform backup/restore operations, 
including incremental ones, on a set of HBase tables. 

Key features and Use Cases

A common practice of backup and restore in database is to first take full 
baseline backup, and
then periodically take incremental backup that capture the changes since the 
full baseline
backup. HBase cluster can store massive amount data. Therefore we want use full 
backup in
combination with incremental backups for HBase as well.
The following is a typical use case scenario for full and incremental backup:

● The user takes a full backup of a table or a set of tables in HBase.
● The user schedules periodical incremental backups to capture the changes from 
the full
backup, or from last incremental backup.
● The user needs to restore table data to a past point in time.
● The full backup is restored to the table(s) or to different table name(s). 
Then the
incremental backups that are up to the desired point in time are applied on top 
of the full
backup.
We would support the following key features and capabilities.
● Backup to DFS FileSystem across clusters and possibly to other storage media 
or
servers.
● Support single table or a set of tables backup and restore (full and 
incremental).
● Restore to different table names and to different clusters. 
● Support adding and removing tables to and from backup set without 
interruption of
incremental backup schedule.
● Support merge of incremental backups into longer period and bigger incremental
backups for easy storage and restore.
● Support scheduled backups.
● Unified command line interface for all the above.

To illustrate these key capabilities, the following are two more detailed use 
case examples.

Use case example 1:

1. User takes a full backup of a set of tables (i.e. table1 and table2) in 
HBase.
2. User takes incremental backups. The incremental backup will only track 
table1 and
table2.
3. User adds other tables (i.e. table3 and table4) in HBase, and an implicit 
full backup is
executed during the add process
4. User continues to take incremental backups. The incremental backup data 
would cover
table1, table2, table3 and table4.
5. User wants to restore table3 and table4 to a past PIT (point-in-time).
6. Full backup in 3. is restored onto HBase cluster. Then the incremental 
backups after that
full backup are applied on top of the full restore until the PIT.

Use case example 2:

1. User takes a full backup of a set of tables in HBase.
2. User takes daily incremental backups.
3. User merges the daily incremental backups into weekly incremental backups.
4. User combines/rolls up the weekly incremental backup into monthly incremental
backups.
5. User wants to restore the tables to a past PIT.
6. Full backup is restored onto HBase cluster.
7. Monthly incremental backups before the desired PIT are applied.
8. Closest daily backups up to the PIT are applied.





  was:
This experimental feature allows to perform backup/restore operations, 
including incremental ones, on a set of HBase tables. 




> HBase Backup/Restore Phase 1
> ----------------------------
>
>                 Key: HBASE-14030
>                 URL: https://issues.apache.org/jira/browse/HBASE-14030
>             Project: HBase
>          Issue Type: Umbrella
>    Affects Versions: 2.0.0
>            Reporter: Vladimir Rodionov
>            Assignee: Vladimir Rodionov
>             Fix For: 2.0.0
>
>         Attachments: HBASE-14030-v0.patch, HBASE-14030-v1.patch, 
> HBASE-14030-v10.patch, HBASE-14030-v11.patch, HBASE-14030-v12.patch, 
> HBASE-14030-v13.patch, HBASE-14030-v14.patch, HBASE-14030-v15.patch, 
> HBASE-14030-v17.patch, HBASE-14030-v18.patch, HBASE-14030-v2.patch, 
> HBASE-14030-v20.patch, HBASE-14030-v21.patch, HBASE-14030-v22.patch, 
> HBASE-14030-v23.patch, HBASE-14030-v24.patch, HBASE-14030-v25.patch, 
> HBASE-14030-v3.patch, HBASE-14030-v4.patch, HBASE-14030-v5.patch, 
> HBASE-14030-v6.patch, HBASE-14030-v7.patch, HBASE-14030-v8.patch
>
>
> This is the umbrella ticket for Backup/Restore Phase 1. See HBASE-7912 design 
> doc for the phase description.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to