In HBase 0.94, bulkload would always copy the files:

// Move the file if it's on another filesystem
FileSystem srcFs = srcPath.getFileSystem(conf);
if (!srcFs.equals(fs)) {
   LOG.info("File " + srcPath + " on different filesystem than " +
      "destination store - moving to this filesystem.");
    ......

Since fs here is an instance of HFileSystem. This would result in the
load taking long time to complete if the HFiles are in the destination
cluster.

This is fixed in trunk:

FileSystem srcFs = srcPath.getFileSystem(conf);
FileSystem desFs = fs instanceof HFileSystem ?
((HFileSystem)fs).getBackingFs() : fs;
    if (!srcFs.equals(desFs)) {
    ... ...

My question is: should we back port the fix to 0.94?

Reply via email to