[ https://issues.apache.org/jira/browse/HBASE-13042?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320564#comment-14320564 ]
Ian Friedman commented on HBASE-13042: -------------------------------------- it's crazy. We're crazy. :O > MR Job to export HFiles directly from an online cluster > ------------------------------------------------------- > > Key: HBASE-13042 > URL: https://issues.apache.org/jira/browse/HBASE-13042 > Project: HBase > Issue Type: New Feature > Reporter: Dave Latham > > We're looking at the best way to bootstrap a new remote cluster. The source > cluster has a a large table of compressed data using more than 50% of the > HDFS capacity and we have a WAN link to the remote cluster. Ideally we would > set up replication to a new table remotely, snapshot the source table, copy > the snapshot across, then bulk load it into the new table. However the > amount of time to copy the data remotely is greater than the major compaction > interval so the source cluster would run out of storage. > One approach is HBASE-13031 to allow the operators to snapshot and copy one > key range at a time. Here's another idea: > Create a MR job that tries to do a robust remote HFile copy directly: > * Each split is responsible for a key range. > * Map task lookups up that key range and maps it to a set of HDFS store > directories (one for each region/family) > * For each store: > ** List HFiles in store (needs to be less than 1000 files to guarantee > atomic listing) > ** Attempt to copy store files (copy in increasing size order to minimize > likelihood of compaction removing a file during copy) > ** If some of the files disappear (compaction), retry directory list / copy > * If any of the stores disappear (region split / merge) then retry map task > (and remap key range to stores) > Or maybe there are some HBase locking mechanisms for a region or store that > would be better. Otherwise the question is how often would compactions or > region splits force retries. > Is this crazy? -- This message was sent by Atlassian JIRA (v6.3.4#6332)