Liu Shaohui created HBASE-14237:
-----------------------------------

             Summary: Meta region may be onlined on multi regonservers for bugs 
of assigning meta
                 Key: HBASE-14237
                 URL: https://issues.apache.org/jira/browse/HBASE-14237
             Project: HBase
          Issue Type: Bug
    Affects Versions: 0.94.11
            Reporter: Liu Shaohui
            Assignee: Liu Shaohui
            Priority: Critical


When a regionserver failed to open the meta region and crash after setting the 
RS_ZK_REGION_FAILED_OPEN state of meta region in zookeeper, the master will 
handle the event of RS_ZK_REGION_FAILED_OPEN and try to assign the meta region 
again in AssignmentManager#handleRegion. But at the same time, the master will 
handle the regionserver expired event and start a MetaServerShutdownHandler for 
the regionserver, because the servername of regionserver is same as the 
servername of the unassigned node of meta region. In the 
MetaServerShutdownHandler, the meta region may be assigned for second time.

[~heliangliang]
We have encountered this problem in our production cluster which resulted in 
inconsistency of region location in meta table. You can see the log from the 
attachment.

The code of AssignmentManager is so complex and I have not get a solution to 
fix this problem. Could someone kindly help to give some suggestions? Thanks




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to