[ https://issues.apache.org/jira/browse/HBASE-14190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14662447#comment-14662447 ]
Ted Yu commented on HBASE-14190: -------------------------------- Did some more cluster testing (based on patch v2): 1. dropped server column for namespace table: {code} hbase(main):001:0> delete 'hbase:meta', 'hbase:namespace,,1436988213845.553f6143557b29a57346c6172a401953.', 'info:server' 0 row(s) in 0.4580 seconds {code} 2. restarted master 3. in master log, observed the following: {code} 2015-08-07 20:56:59,184 DEBUG [bloomberg-perf-eval-test-2:20000.activeMasterManager] master.HMaster: Assigning hbase:namespace to some server {code} Master came up fine. Checking test failure in patch v3. > Assign system tables ahead of user region assignment > ---------------------------------------------------- > > Key: HBASE-14190 > URL: https://issues.apache.org/jira/browse/HBASE-14190 > Project: HBase > Issue Type: Bug > Reporter: Ted Yu > Assignee: Ted Yu > Priority: Critical > Attachments: 14190-v1.txt, 14190-v2.txt, 14190-v3.txt > > > Currently the namespace table region is assigned like user regions. > I spent several hours working with a customer where master couldn't finish > initialization. > Even though master was restarted quite a few times, it went down with the > following: > {code} > 2015-08-05 17:16:57,530 FATAL [hdpmaster1:60000.activeMasterManager] > master.HMaster: Master server abort: loaded coprocessors are: [] > 2015-08-05 17:16:57,530 FATAL [hdpmaster1:60000.activeMasterManager] > master.HMaster: Unhandled exception. Starting shutdown. > java.io.IOException: Timedout 300000ms waiting for namespace table to be > assigned > at > org.apache.hadoop.hbase.master.TableNamespaceManager.start(TableNamespaceManager.java:104) > at org.apache.hadoop.hbase.master.HMaster.initNamespace(HMaster.java:985) > at > org.apache.hadoop.hbase.master.HMaster.finishActiveMasterInitialization(HMaster.java:779) > at org.apache.hadoop.hbase.master.HMaster.access$500(HMaster.java:182) > at org.apache.hadoop.hbase.master.HMaster$1.run(HMaster.java:1646) > at java.lang.Thread.run(Thread.java:744) > {code} > During previous run(s), namespace table was created, hence leaving an entry > in hbase:meta. > The following if block in TableNamespaceManager#start() was skipped: > {code} > if (!MetaTableAccessor.tableExists(masterServices.getConnection(), > TableName.NAMESPACE_TABLE_NAME)) { > {code} > TableNamespaceManager#start() spins, waiting for namespace region to be > assigned. > There was issue in master assigning user regions. > We tried issuing 'assign' command from hbase shell which didn't work because > of the following check in MasterRpcServices#assignRegion(): > {code} > master.checkInitialized(); > {code} > This scenario can be avoided if we assign hbase:namespace table after > hbase:meta is assigned but before user table region assignment. -- This message was sent by Atlassian JIRA (v6.3.4#6332)