Hi,

I'm wondering if it's possible to export all data from one HBase cluster
and import it into another.  We have a lot of data that we've imported
into our staging HBase environment, and rather than repeating the
lengthy import process in our production environment we would prefer to
just copy all the data directly from HBase/HDFS in staging into
production.  Is there an easy way to do this?  I know Hadoop has some
distributed copy functionality, but I don't know if this will work with
HBase.  The number of region servers and the replication factor will be
the same in the source and destination environments, but the
hostnames/IPs will be different.  The production environment is
completely empty right now, so we don't need to worry about overwriting
data.

I came across these links while searching for information HBase
export/import:

http://issues.apache.org/jira/browse/HBASE-897
http://issues.apache.org/jira/browse/HBASE-1684
http://hadoop.apache.org/hbase/docs/current/api/org/apache/hadoop/hbase/mapreduce/Export.html

Has anyone used these tools?  Is there a better way?

Thanks,
James


Reply via email to