I have used the tool Hbackup from https://github.com/urbanairship/hbackup
I will look into S3distcp. The name suggests ot should be sufficient for me to load the data. However I have a more generic question. How do people who backup the Hbase data tables to S3 test the restore. My backup ran for about a day and there were a couple of exceptions in the logs. How do I test the table? Do I need to recreate the hadoop/Hbase cluster and test whether everything went well? regards, Prem On Wed, Aug 8, 2012 at 6:54 PM, Dan Young <danoyo...@gmail.com> wrote: > Have you looked into s3distcp ? > > Regards , > > Dano > On Aug 8, 2012 7:21 AM, "prem yadav" <ipremya...@gmail.com> wrote: > >> Hi, >> I recently used a backup tool to back up all my HDFS data to S3. The data >> is on S3 in multiparts. >> I need to test the restore now. Could you please give me some pointers on >> how to test this. >> >> 1) Do I need to create another cluster? The data is around 3 TB in size. >> 2) How do I upload multipart data from S3 to HDFS cluster? >> >> >> regards, >> Prem >> >>