[ https://issues.apache.org/jira/browse/HBASE-6625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13439928#comment-13439928 ]
Jonathan Hsieh commented on HBASE-6625: --------------------------------------- I've feel that we should make merging regions a robust and constantly tested feature instead of just a script. There was some discussion about this with 0.92 becuase of HFile v2. When we have that then we can make a long running system test to merge/split/merge/split while read/write load is going on. > If we have hundreds of thousands of regions getChildren will encouter zk > exception > ----------------------------------------------------------------------------------- > > Key: HBASE-6625 > URL: https://issues.apache.org/jira/browse/HBASE-6625 > Project: HBase > Issue Type: Bug > Reporter: Zhou wenjian > Assignee: Zhou wenjian > > 2012-05-13 19:37:37,528 DEBUG > org.apache.hadoop.hbase.master.AssignmentManager$ExistsUnassignedAsyncCallback: > rs=CreateNewTableWith100000Regions,\x05\xB3\x06 > g\xE8r\xBB]\x09\xCF,1336724029944.079cb2f8a375e66fa089291b82f2a03f. > state=OFFLINE, ts=1336909053108 > 2012-05-13 19:37:37,528 DEBUG > org.apache.hadoop.hbase.master.AssignmentManager$CreateUnassignedAsyncCallback: > rs=CreateNewTableWith100000Regions,\x08s\x84\x8 > 8$7\xB1\xC4\xFCg,1336724030660.76c07780231942231013c7feb5e5eb14. > state=OFFLINE, ts=1336909055089, server=dw76.kgb.sqa.cm4,60020,1336908983944 > 2012-05-13 19:37:37,528 DEBUG > org.apache.hadoop.hbase.master.AssignmentManager$CreateUnassignedAsyncCallback: > rs=CreateNewTableWith100000Regions,\x08s\x89\xC > B\x9B\xF0\xE4\xCA\x97\xB0,1336724030660.fa38b9d8367387a64a327087cb43b3e0. > state=OFFLINE, ts=1336909055089, server=dw76.kgb.sqa.cm4,60020,1336908983944 > 2012-05-13 19:37:37,528 INFO > org.apache.hadoop.hbase.master.AssignmentManager: > dw76.kgb.sqa.cm4,60020,1336908983944 unassigned znodes=58464 of total=120002 > 2012-05-13 19:37:37,758 WARN org.apache.zookeeper.ClientCnxn: Session > 0x13745fc2c8d0001 for server dw51.kgb.sqa.cm4/10.232.98.51:2180, unexpected > error, clos > ing socket connection and attempting reconnect > java.io.IOException: Packet len4320092 is out of range! > at > org.apache.zookeeper.ClientCnxn$SendThread.readLength(ClientCnxn.java:710) > at > org.apache.zookeeper.ClientCnxn$SendThread.doIO(ClientCnxn.java:869) > at > org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1130) > 2012-05-13 19:37:37,860 WARN org.apache.hadoop.hbase.zookeeper.ZKUtil: > master:60000-0x13745fc2c8d0001 Unable to list children of znode > /hbase-new4/unassigned > org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode > = ConnectionLoss for /hbase-new4/unassigned > at > org.apache.zookeeper.KeeperException.create(KeeperException.java:90) > at > org.apache.zookeeper.KeeperException.create(KeeperException.java:42) > at org.apache.zookeeper.ZooKeeper.getChildren(ZooKeeper.java:1243) > at > org.apache.hadoop.hbase.zookeeper.ZKUtil.listChildrenAndWatchForNewChildren(ZKUtil.java:302) > > at > org.apache.hadoop.hbase.zookeeper.ZKUtil.watchAndGetNewChildren(ZKUtil.java:413) > > at > org.apache.hadoop.hbase.master.AssignmentManager.nodeChildrenChanged(AssignmentManager.java:759) > > at > org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.process(ZooKeeperWatcher.java:314) > > at > org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:530) > at > org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:506) > 2012-05-13 19:37:37,861 ERROR > org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher: > master:60000-0x13745fc2c8d0001 Received unexpected KeeperException, re-thro > wing exception > org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode > = ConnectionLoss for /hbase-new4/unassigned > at > org.apache.zookeeper.KeeperException.create(KeeperException.java:90) > at > org.apache.zookeeper.KeeperException.create(KeeperException.java:42) > at org.apache.zookeeper.ZooKeeper.getChildren(ZooKeeper.java:1243) > at > org.apache.hadoop.hbase.zookeeper.ZKUtil.listChildrenAndWatchForNewChildren(ZKUtil.java:302) > > at > org.apache.hadoop.hbase.zookeeper.ZKUtil.watchAndGetNewChildren(ZKUtil.java:413) > > at > org.apache.hadoop.hbase.master.AssignmentManager.nodeChildrenChanged(AssignmentManager.java:759) > > at > org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.process(ZooKeeperWatcher.java:314) > > at > org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:530) > at > org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:506) > 2012-05-13 19:37:37,861 FATAL org.apache.hadoop.hbase.master.HMaster: > Unexpected ZK exception reading unassigned children > org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode > = ConnectionLoss for /hbase-new4/unassigned > at > org.apache.zookeeper.KeeperException.create(KeeperException.java:90) > at > org.apache.zookeeper.KeeperException.create(KeeperException.java:42) > at org.apache.zookeeper.ZooKeeper.getChildren(ZooKeeper.java:1243) > at > org.apache.hadoop.hbase.zookeeper.ZKUtil.listChildrenAndWatchForNewChildren(ZKUtil.java:302) > > at > org.apache.hadoop.hbase.zookeeper.ZKUtil.watchAndGetNewChildren(ZKUtil.java:413) > > at > org.apache.hadoop.hbase.master.AssignmentManager.nodeChildrenChanged(AssignmentManager.java:759) > > at > org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.process(ZooKeeperWatcher.java:314) > > at > org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:530) > at > org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:506) > 2012-05-13 19:37:37,861 INFO org.apache.hadoop.hbase.master.HMaster: Aborting -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira