[ 
https://issues.apache.org/jira/browse/HBASE-8912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13810240#comment-13810240
 ] 

Sergey Kirichenko commented on HBASE-8912:
------------------------------------------

May be this helps (HBase from cloudera - 0.94.6-cdh4.4.0):

grep by region caused exception on master:
{noformat}
2013-10-31 00:07:52,871 WARN org.apache.hadoop.hbase.master.AssignmentManager: 
Region 3a476d37da81f620a3e53179d7d9192b has null regionLocation. But its table 
table_x isn't in ENABLING state.
2013-10-31 00:07:53,057 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
master:60000-0x242045137a20070 Async create of unassigned node for 
3a476d37da81f620a3e53179d7d9192b with OFFLINE state
2013-10-31 00:07:53,467 DEBUG 
org.apache.hadoop.hbase.master.AssignmentManager$CreateUnassignedAsyncCallback: 
rs=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 state=OFFLINE, ts=1383163673057, server=null, server=xxx100,60020,1383163665902
2013-10-31 00:07:53,495 DEBUG 
org.apache.hadoop.hbase.master.AssignmentManager$ExistsUnassignedAsyncCallback: 
rs=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 state=OFFLINE, ts=1383163673057, server=null
2013-10-31 00:07:54,834 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Handling transition=RS_ZK_REGION_OPENING, server=xxx100,60020,1383163665902, 
region=3a476d37da81f620a3e53179d7d9192b
2013-10-31 00:07:56,953 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Handling transition=RS_ZK_REGION_FAILED_OPEN, 
server=xxx100,60020,1383163665902, region=3a476d37da81f620a3e53179d7d9192b
2013-10-31 00:07:56,953 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Found an existing plan for 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 destination server is xxx100,60020,1383163665902
2013-10-31 00:07:56,953 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
No previous transition plan was found (or we are ignoring an existing plan) for 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 so generated a random one; 
hri=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.,
 src=, dest=xxx108,60020,1383163666006; 9 (online=9, available=8) available 
servers
2013-10-31 00:07:56,955 DEBUG 
org.apache.hadoop.hbase.master.handler.ClosedRegionHandler: Handling CLOSED 
event for 3a476d37da81f620a3e53179d7d9192b
2013-10-31 00:07:56,956 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Forcing OFFLINE; 
was=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 state=CLOSED, ts=1383163675624, server=xxx100,60020,1383163665902
2013-10-31 00:07:56,956 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
master:60000-0x242045137a20070 Creating (or updating) unassigned node for 
3a476d37da81f620a3e53179d7d9192b with OFFLINE state
2013-10-31 00:07:57,003 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Found an existing plan for 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 destination server is xxx108,60020,1383163666006
2013-10-31 00:07:57,003 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Using pre-existing plan for region 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.;
 
plan=hri=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.,
 src=, dest=xxx108,60020,1383163666006
2013-10-31 00:07:57,003 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Assigning region 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 to xxx108,60020,1383163666006
2013-10-31 00:07:58,545 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Handling transition=RS_ZK_REGION_FAILED_OPEN, 
server=xxx108,60020,1383163666006, region=3a476d37da81f620a3e53179d7d9192b
2013-10-31 00:07:58,545 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Found an existing plan for 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 destination server is xxx108,60020,1383163666006
2013-10-31 00:07:58,545 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
No previous transition plan was found (or we are ignoring an existing plan) for 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 so generated a random one; 
hri=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.,
 src=, dest=xxx106,60020,1383163666003; 9 (online=9, available=8) available 
servers
2013-10-31 00:07:58,546 DEBUG 
org.apache.hadoop.hbase.master.handler.ClosedRegionHandler: Handling CLOSED 
event for 3a476d37da81f620a3e53179d7d9192b
2013-10-31 00:07:58,546 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Forcing OFFLINE; 
was=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 state=CLOSED, ts=1383163677110, server=xxx108,60020,1383163666006
2013-10-31 00:07:58,546 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
master:60000-0x242045137a20070 Creating (or updating) unassigned node for 
3a476d37da81f620a3e53179d7d9192b with OFFLINE state
2013-10-31 00:07:58,553 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Handling transition=RS_ZK_REGION_FAILED_OPEN, 
server=xxx108,60020,1383163666006, region=3a476d37da81f620a3e53179d7d9192b
2013-10-31 00:07:58,554 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Found an existing plan for 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 destination server is xxx106,60020,1383163666003
2013-10-31 00:07:58,554 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
No previous transition plan was found (or we are ignoring an existing plan) for 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 so generated a random one; 
hri=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.,
 src=, dest=xxx104,60020,1383163665976; 9 (online=9, available=8) available 
servers
2013-10-31 00:07:58,554 DEBUG 
org.apache.hadoop.hbase.master.handler.ClosedRegionHandler: Handling CLOSED 
event for 3a476d37da81f620a3e53179d7d9192b
2013-10-31 00:07:58,554 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Forcing OFFLINE; 
was=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 state=CLOSED, ts=1383163677110, server=xxx108,60020,1383163666006
2013-10-31 00:07:58,571 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Found an existing plan for 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 destination server is xxx104,60020,1383163665976
2013-10-31 00:07:58,571 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Using pre-existing plan for region 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.;
 
plan=hri=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.,
 src=, dest=xxx104,60020,1383163665976
2013-10-31 00:07:58,571 DEBUG org.apache.hadoop.hbase.master.AssignmentManager: 
Assigning region 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 to xxx104,60020,1383163665976
2013-10-31 00:07:58,595 FATAL org.apache.hadoop.hbase.master.HMaster: 
Unexpected state : 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 state=PENDING_OPEN, ts=1383163678594, server=xxx104,60020,1383163665976 .. 
Cannot transit it to OFFLINE.
java.lang.IllegalStateException: Unexpected state : 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
 state=PENDING_OPEN, ts=1383163678594, server=xxx104,60020,1383163665976 .. 
Cannot transit it to OFFLINE.
        at 
org.apache.hadoop.hbase.master.AssignmentManager.setOfflineInZooKeeper(AssignmentManager.java:1831)
        at 
org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1661)
        at 
org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1426)
        at 
org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1398)
        at 
org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1393)
        at 
org.apache.hadoop.hbase.master.handler.ClosedRegionHandler.process(ClosedRegionHandler.java:105)
        at 
org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:175)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
        at java.lang.Thread.run(Thread.java:662)
{noformat}


grep by region caused exception on xxx100:
{noformat}
2013-10-31 00:07:54,000 INFO 
org.apache.hadoop.hbase.regionserver.HRegionServer: Received request to open 
region: 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
2013-10-31 00:07:54,000 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x242045137a20071 Attempting to transition node 
3a476d37da81f620a3e53179d7d9192b from M_ZK_REGION_OFFLINE to 
RS_ZK_REGION_OPENING
2013-10-31 00:07:54,029 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x242045137a20071 Successfully transitioned node 
3a476d37da81f620a3e53179d7d9192b from M_ZK_REGION_OFFLINE to 
RS_ZK_REGION_OPENING
2013-10-31 00:07:55,439 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
Opening region: {NAME => 
'table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.',
 STARTKEY => '6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8', ENDKEY => 
'71c71c71c71c71c71c71c71c71c71c71c71c71c0', ENCODED => 
3a476d37da81f620a3e53179d7d9192b,}
2013-10-31 00:07:55,439 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
Instantiated 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
2013-10-31 00:07:55,447 DEBUG org.apache.hadoop.hbase.regionserver.StoreFile: 
Store file 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/A/table_y=abca20169d0fdd22f2a13d6caf41d83d-0729d4dbd652443bbd44381db5b7b26b
 is a link
2013-10-31 00:07:55,501 DEBUG org.apache.hadoop.hbase.regionserver.Store: 
loaded 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/A/table_y=abca20169d0fdd22f2a13d6caf41d83d-0729d4dbd652443bbd44381db5b7b26b,
 isReference=false, isBulkLoadResult=false, seqid=46816, majorCompaction=true
2013-10-31 00:07:55,546 DEBUG org.apache.hadoop.hbase.regionserver.StoreFile: 
Store file 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/B/table_y=abca20169d0fdd22f2a13d6caf41d83d-8606885898507153833
 is a link
2013-10-31 00:07:55,602 DEBUG org.apache.hadoop.hbase.regionserver.StoreFile: 
Store file 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/B/table_y=abca20169d0fdd22f2a13d6caf41d83d-b2449476f78e42b7ba0bba2ac69a24b8
 is a link
2013-10-31 00:07:55,613 DEBUG org.apache.hadoop.hbase.regionserver.Store: 
loaded 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/B/table_y=abca20169d0fdd22f2a13d6caf41d83d-b2449476f78e42b7ba0bba2ac69a24b8,
 isReference=false, isBulkLoadResult=false, seqid=46816, majorCompaction=false
2013-10-31 00:07:55,618 ERROR 
org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Failed open of 
region=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.,
 starting to roll back the global memstore size.
2013-10-31 00:07:55,621 INFO 
org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Opening of 
region {NAME => 
'table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.',
 STARTKEY => '6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8', ENDKEY => 
'71c71c71c71c71c71c71c71c71c71c71c71c71c0', ENCODED => 
3a476d37da81f620a3e53179d7d9192b,} failed, marking as FAILED_OPEN in ZK
2013-10-31 00:07:55,621 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x242045137a20071 Attempting to transition node 
3a476d37da81f620a3e53179d7d9192b from RS_ZK_REGION_OPENING to 
RS_ZK_REGION_FAILED_OPEN
2013-10-31 00:07:55,630 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x242045137a20071 Successfully transitioned node 
3a476d37da81f620a3e53179d7d9192b from RS_ZK_REGION_OPENING to 
RS_ZK_REGION_FAILED_OPEN
{noformat}

grep by region caused exception on xxx108:
{noformat}
2013-10-31 00:07:57,003 INFO 
org.apache.hadoop.hbase.regionserver.HRegionServer: Received request to open 
region: 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
2013-10-31 00:07:57,010 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x242045137a20074 Attempting to transition node 
3a476d37da81f620a3e53179d7d9192b from M_ZK_REGION_OFFLINE to 
RS_ZK_REGION_OPENING
2013-10-31 00:07:57,042 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x242045137a20074 Successfully transitioned node 
3a476d37da81f620a3e53179d7d9192b from M_ZK_REGION_OFFLINE to 
RS_ZK_REGION_OPENING
2013-10-31 00:07:57,043 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
Opening region: {NAME => 
'table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.',
 STARTKEY => '6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8', ENDKEY => 
'71c71c71c71c71c71c71c71c71c71c71c71c71c0', ENCODED => 
3a476d37da81f620a3e53179d7d9192b,}
2013-10-31 00:07:57,043 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
Instantiated 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
2013-10-31 00:07:57,049 DEBUG org.apache.hadoop.hbase.regionserver.StoreFile: 
Store file 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/A/table_y=abca20169d0fdd22f2a13d6caf41d83d-0729d4dbd652443bbd44381db5b7b26b
 is a link
2013-10-31 00:07:57,060 DEBUG org.apache.hadoop.hbase.regionserver.Store: 
loaded 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/A/table_y=abca20169d0fdd22f2a13d6caf41d83d-0729d4dbd652443bbd44381db5b7b26b,
 isReference=false, isBulkLoadResult=false, seqid=46816, majorCompaction=true
2013-10-31 00:07:57,065 DEBUG org.apache.hadoop.hbase.regionserver.StoreFile: 
Store file 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/B/table_y=abca20169d0fdd22f2a13d6caf41d83d-8606885898507153833
 is a link
2013-10-31 00:07:57,095 DEBUG org.apache.hadoop.hbase.regionserver.StoreFile: 
Store file 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/B/table_y=abca20169d0fdd22f2a13d6caf41d83d-b2449476f78e42b7ba0bba2ac69a24b8
 is a link
2013-10-31 00:07:57,105 DEBUG org.apache.hadoop.hbase.regionserver.Store: 
loaded 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/B/table_y=abca20169d0fdd22f2a13d6caf41d83d-b2449476f78e42b7ba0bba2ac69a24b8,
 isReference=false, isBulkLoadResult=false, seqid=46816, majorCompaction=false
2013-10-31 00:07:57,107 ERROR 
org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Failed open of 
region=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.,
 starting to roll back the global memstore size.
2013-10-31 00:07:57,108 INFO 
org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Opening of 
region {NAME => 
'table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.',
 STARTKEY => '6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8', ENDKEY => 
'71c71c71c71c71c71c71c71c71c71c71c71c71c0', ENCODED => 
3a476d37da81f620a3e53179d7d9192b,} failed, marking as FAILED_OPEN in ZK
2013-10-31 00:07:57,108 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x242045137a20074 Attempting to transition node 
3a476d37da81f620a3e53179d7d9192b from RS_ZK_REGION_OPENING to 
RS_ZK_REGION_FAILED_OPEN
2013-10-31 00:07:57,125 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x242045137a20074 Successfully transitioned node 
3a476d37da81f620a3e53179d7d9192b from RS_ZK_REGION_OPENING to 
RS_ZK_REGION_FAILED_OPEN
{noformat}

grep by region caused exception on xxx104:
{noformat}
2013-10-31 00:07:58,581 INFO 
org.apache.hadoop.hbase.regionserver.HRegionServer: Received request to open 
region: 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
2013-10-31 00:07:58,587 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x420451326a0070 Attempting to transition node 
3a476d37da81f620a3e53179d7d9192b from M_ZK_REGION_OFFLINE to 
RS_ZK_REGION_OPENING
2013-10-31 00:07:58,602 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x420451326a0070 Successfully transitioned node 
3a476d37da81f620a3e53179d7d9192b from M_ZK_REGION_OFFLINE to 
RS_ZK_REGION_OPENING
2013-10-31 00:07:58,603 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
Opening region: {NAME => 
'table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.',
 STARTKEY => '6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8', ENDKEY => 
'71c71c71c71c71c71c71c71c71c71c71c71c71c0', ENCODED => 
3a476d37da81f620a3e53179d7d9192b,}
2013-10-31 00:07:58,604 DEBUG org.apache.hadoop.hbase.regionserver.HRegion: 
Instantiated 
table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.
2013-10-31 00:07:58,610 DEBUG org.apache.hadoop.hbase.regionserver.StoreFile: 
Store file 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/A/table_y=abca20169d0fdd22f2a13d6caf41d83d-0729d4dbd652443bbd44381db5b7b26b
 is a link
2013-10-31 00:07:58,621 DEBUG org.apache.hadoop.hbase.regionserver.Store: 
loaded 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/A/table_y=abca20169d0fdd22f2a13d6caf41d83d-0729d4dbd652443bbd44381db5b7b26b,
 isReference=false, isBulkLoadResult=false, seqid=46816, majorCompaction=true
2013-10-31 00:07:58,627 DEBUG org.apache.hadoop.hbase.regionserver.StoreFile: 
Store file 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/B/table_y=abca20169d0fdd22f2a13d6caf41d83d-8606885898507153833
 is a link
2013-10-31 00:07:58,639 DEBUG org.apache.hadoop.hbase.regionserver.StoreFile: 
Store file 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/B/table_y=abca20169d0fdd22f2a13d6caf41d83d-b2449476f78e42b7ba0bba2ac69a24b8
 is a link
2013-10-31 00:07:58,650 DEBUG org.apache.hadoop.hbase.regionserver.Store: 
loaded 
hdfs://xxx-master/hbase/table_x/3a476d37da81f620a3e53179d7d9192b/B/table_y=abca20169d0fdd22f2a13d6caf41d83d-b2449476f78e42b7ba0bba2ac69a24b8,
 isReference=false, isBulkLoadResult=false, seqid=46816, majorCompaction=false
2013-10-31 00:07:58,652 ERROR 
org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Failed open of 
region=table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.,
 starting to roll back the global memstore size.
2013-10-31 00:07:58,653 INFO 
org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler: Opening of 
region {NAME => 
'table_x,6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8,1382557475861.3a476d37da81f620a3e53179d7d9192b.',
 STARTKEY => '6eeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee8', ENDKEY => 
'71c71c71c71c71c71c71c71c71c71c71c71c71c0', ENCODED => 
3a476d37da81f620a3e53179d7d9192b,} failed, marking as FAILED_OPEN in ZK
2013-10-31 00:07:58,653 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x420451326a0070 Attempting to transition node 
3a476d37da81f620a3e53179d7d9192b from RS_ZK_REGION_OPENING to 
RS_ZK_REGION_FAILED_OPEN
2013-10-31 00:07:58,670 DEBUG org.apache.hadoop.hbase.zookeeper.ZKAssign: 
regionserver:60020-0x420451326a0070 Successfully transitioned node 
3a476d37da81f620a3e53179d7d9192b from RS_ZK_REGION_OPENING to 
RS_ZK_REGION_FAILED_OPEN
{noformat}

1) Initially AM try to assign region with 'bulk assign' on xxx100 (PENDING_OPEN 
=> RS_ZK_REGION_OPENING); but xxx100 failed to open region and AM handles this 
event (RS_ZK_REGION_FAILED_OPEN => CLOSED => OFFLINE)
2) AM try to assign region in ClosedRegionHandler on xxx108 (there is no 
RS_ZK_REGION_OPENING event in master's logs, but we see it in regionserver's 
logs); it fails again
3) AM chose xxx106 for region assignment but receives RS_ZK_REGION_FAILED_OPEN 
before sending request => CLOSED => ClosedRegionHandler => xxx104 => exception

> [0.94] AssignmentManager throws IllegalStateException from PENDING_OPEN to 
> OFFLINE
> ----------------------------------------------------------------------------------
>
>                 Key: HBASE-8912
>                 URL: https://issues.apache.org/jira/browse/HBASE-8912
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Enis Soztutar
>             Fix For: 0.94.14
>
>         Attachments: HBase-0.94 #1036 test - testRetrying [Jenkins].html
>
>
> AM throws this exception which subsequently causes the master to abort: 
> {code}
> java.lang.IllegalStateException: Unexpected state : 
> testRetrying,jjj,1372891751115.9b828792311001062a5ff4b1038fe33b. 
> state=PENDING_OPEN, ts=1372891751912, 
> server=hemera.apache.org,39064,1372891746132 .. Cannot transit it to OFFLINE.
>       at 
> org.apache.hadoop.hbase.master.AssignmentManager.setOfflineInZooKeeper(AssignmentManager.java:1879)
>       at 
> org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1688)
>       at 
> org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1424)
>       at 
> org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1399)
>       at 
> org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1394)
>       at 
> org.apache.hadoop.hbase.master.handler.ClosedRegionHandler.process(ClosedRegionHandler.java:105)
>       at 
> org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:175)
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
>       at java.lang.Thread.run(Thread.java:662)
> {code}
> This exception trace is from the failing test TestMetaReaderEditor which is 
> failing pretty frequently, but looking at the test code, I think this is not 
> a test-only issue, but affects the main code path. 
> https://builds.apache.org/job/HBase-0.94/1036/testReport/junit/org.apache.hadoop.hbase.catalog/TestMetaReaderEditor/testRetrying/



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to