Region server references deleted compaction log
-----------------------------------------------
Key: HBASE-1894
URL: https://issues.apache.org/jira/browse/HBASE-1894
Project: Hadoop HBase
Issue Type: Bug
Components: regionserver
Affects Versions: 0.20.0
Environment: HBase Version 0.20.0, r805538
HBase Compiled Wed Aug 19 14:38:36 PDT 2009, root
Hadoop Version 0.20.0-plus4681, r767961
Hadoop Compiled Mon Jun 8 18:18:06 UTC 2009, stack
Reporter: elsif
The region server appears to reference blocks from deleted log files after
compaction. Stopping the affected region server causes the region to be
reassigned and appears to function properly again.
We have seen two instances this week where this caused the region server to get
stuck with the following error:
hbase-root-regionserver-hfs-030015.log:2009-10-07 12:24:41,853 INFO
org.apache.hadoop.hdfs.DFSClient: Could not obtain block
blk_6092375271544310747_250240 from any node: java.io.IOException: No live
nodes contain current block
.
.
.
/opt/hbase/logs/hbase-root-regionserver-hfs-030015.log:java.io.IOException:
Cannot open filename /hbase/fc_test/1209236691/json/9131415140626575165
/opt/hbase/logs/hbase-root-regionserver-hfs-030015.log:2009-10-07 13:09:47,343
INFO org.apache.hadoop.hdfs.DFSClient: Could not obtain block
blk_6092375271544310747_250240 from any node: java.io.IOException: No live
nodes contain current block
>From the hdfs log we can see that the block in question was part of a
>compaction log:
hadoop-root-namenode-hfs-030010.log:2009-10-07 08:48:31,271 INFO
org.apache.hadoop.hdfs.StateChange: BLOCK* NameSystem.allocateBlock:
/hbase/fc_test/compaction.dir/1209236691/2099727166320402414.
blk_6092375271544310747_250240
hadoop-root-namenode-hfs-030010.log:2009-10-07 08:48:31,337 INFO
org.apache.hadoop.hdfs.StateChange: BLOCK* NameSystem.addStoredBlock: blockMap
updated: 10.4.30.39:50010 is added to blk_6092375271544310747_250240 size 360813
hadoop-root-namenode-hfs-030010.log:2009-10-07 08:48:31,338 INFO
org.apache.hadoop.hdfs.StateChange: BLOCK* NameSystem.addStoredBlock: blockMap
updated: 10.4.30.23:50010 is added to blk_6092375271544310747_250240 size 360813
hadoop-root-namenode-hfs-030010.log:2009-10-07 08:48:31,339 INFO
org.apache.hadoop.hdfs.StateChange: BLOCK* NameSystem.addStoredBlock: blockMap
updated: 10.4.30.16:50010 is added to blk_6092375271544310747_250240 size 360813
>From the same log we see the block deleted:
hadoop-root-namenode-hfs-030010.log:2009-10-07 12:13:20,738 INFO
org.apache.hadoop.hdfs.StateChange: BLOCK* ask 10.4.30.39:50010 to delete
blk_-8366798629992987785_250239 blk_-278212026264018289_224708
blk_4131756856877489141_230749 blk_6092375271544310747_250240
blk_-1303356519220271320_144130
hadoop-root-namenode-hfs-030010.log:2009-10-07 12:13:26,739 INFO
org.apache.hadoop.hdfs.StateChange: BLOCK* ask 10.4.30.23:50010 to delete
blk_3405064566022477311_135928 blk_-7806500863137897827_139699
blk_6092375271544310747_250240
hadoop-root-namenode-hfs-030010.log:2009-10-07 12:13:29,739 INFO
org.apache.hadoop.hdfs.StateChange: BLOCK* ask 10.4.30.16:50010 to delete
blk_-3938002954120963098_139149 blk_2284074818158875873_144134
blk_519099260905493191_222718 blk_7684159545746656462_144137
blk_7857071994919747100_144127 blk_6311161030310597085_135919
blk_-8366798629992987785_250239 blk_5638304438869005376_142347
blk_-1019479064035180992_143997 blk_4131756856877489141_230749
blk_6092375271544310747_250240 blk_-1520990752128999182_192595
blk_1714900485284856973_230761 blk_-1303356519220271320_144130
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.