Hans Zeller created TRAFODION-1820:
--------------------------------------

             Summary: HBase error after a failed bulk load in Trafodion
                 Key: TRAFODION-1820
                 URL: https://issues.apache.org/jira/browse/TRAFODION-1820
             Project: Apache Trafodion
          Issue Type: Bug
          Components: sql-general
    Affects Versions: 1.1 (pre-incubation)
         Environment: Any
            Reporter: Hans Zeller


When using the Trafodion bulk loader (LOAD statement), if the load fails in the 
middle, it may leave HBase in a state where the HMaster produces 
security-related errors. In the case I saw, the HMaster failed to start, due to 
these errors. Here is an example:

Unhandled exception. Starting shutdown.
java.io.IOException: Received error when attempting to archive files ([class 
org.apache.hadoop.hbase.backup.HFileArchiver$FileablePath, 
file:hdfs://ip-10-0-0-132.ec2.internal:8020/hbase/.tmp/data/default/TRAFODION.SEABASE.LSTATS1/03a60516389fcc3e02e5de07f936cbb1/#1,
 class org.apache.hadoop.hbase.backup.HFileArchiver$FileablePath, 
file:hdfs://ip-10-0-0-132.ec2.internal:8020/hbase/.tmp/data/default/TRAFODION.SEABASE.LSTATS1/03a60516389fcc3e02e5de07f936cbb1/mt_,
 class org.apache.hadoop.hbase.backup.HFileArchiver$FileablePath, 
file:hdfs://ip-10-0-0-132.ec2.internal:8020/hbase/.tmp/data/default/TRAFODION.SEABASE.LSTATS1/03a60516389fcc3e02e5de07f936cbb1/recovered.edits]),
 cannot delete region directory. 
        at 
org.apache.hadoop.hbase.backup.HFileArchiver.archiveRegion(HFileArchiver.java:148)
        at 
org.apache.hadoop.hbase.master.MasterFileSystem.checkTempDir(MasterFileSystem.java:508)
        at 
org.apache.hadoop.hbase.master.MasterFileSystem.createInitialFileSystemLayout(MasterFileSystem.java:149)
        at 
org.apache.hadoop.hbase.master.MasterFileSystem.<init>(MasterFileSystem.java:126)
        at 
org.apache.hadoop.hbase.master.HMaster.finishActiveMasterInitialization(HMaster.java:611)
        at org.apache.hadoop.hbase.master.HMaster.access$500(HMaster.java:169)
        at org.apache.hadoop.hbase.master.HMaster$1.run(HMaster.java:1484)
        at java.lang.Thread.run(Thread.java:745)


The solution is to do a "chown hbase:hbase" command in HDFS, to give the files 
to the hbase user. A fix for this problem might do this chown operation earlier 
in the Trafodion code path, or use some other method to avoid the security 
error.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to