[ https://issues.apache.org/jira/browse/HBASE-4690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13139543#comment-13139543 ]
Ted Yu commented on HBASE-4690: ------------------------------- It is pretty clear what happened in build 2384. The failure was because regions brought online wasn't in the same order as start keys are defined: {code} 2011-10-29 21:45:43,789 INFO [RS_OPEN_REGION-hemera.apache.org,37045,1319924731557-0] regionserver.HRegion(502): Onlined observed_table,kkk,1319924743536.d2bb03652b0e69a4a192be3b60f6cd78.; next sequenceid=1 ... 2011-10-29 21:45:43,883 DEBUG [Thread-183] hbase.HBaseTestingUtility(1129): Found 25 rows for table observed_table 2011-10-29 21:45:43,883 DEBUG [Thread-183] hbase.HBaseTestingUtility(1132): FirstRow=observed_table,,1319924743504.ed6d9b9f5122809fad16e61835367b48. 2011-10-29 21:45:43,887 INFO [RS_OPEN_REGION-hemera.apache.org,37045,1319924731557-1] regionserver.HRegion(502): Onlined observed_table,lll,1319924743540.ac163536355dbe1ab71ab1a9ee7a22d4.; next sequenceid=1 ... 2011-10-29 21:45:43,950 DEBUG [main-EventThread] zookeeper.ZKUtil(228): master:34047-0x13351a50a270000 Set watcher on existing znode /hbase/unassigned/ed6d9b9f5122809fad16e61835367b48 ... 2011-10-29 21:45:44,050 INFO [RS_OPEN_REGION-hemera.apache.org,45759,1319924731527-0] regionserver.HRegion(502): Onlined observed_table,,1319924743504.ed6d9b9f5122809fad16e61835367b48.; next sequenceid=1 {code} We can see the ~170ms delay between the discovery of region 1319924743504.ed6d9b9f5122809fad16e61835367b48. and its actual online. A simple patch would be to give getRSForFirstRegionInTable() some time if index returned by hbaseCluster.getServerWith() was -1. > Intermittent > TestRegionServerCoprocessorExceptionWithAbort#testExceptionFromCoprocessorDuringPut > failure > -------------------------------------------------------------------------------------------------------- > > Key: HBASE-4690 > URL: https://issues.apache.org/jira/browse/HBASE-4690 > Project: HBase > Issue Type: Test > Affects Versions: 0.92.0 > Reporter: Ted Yu > Assignee: Eugene Koontz > Fix For: 0.92.0 > > > See > https://builds.apache.org/view/G-L/view/HBase/job/HBase-0.92/83/testReport/junit/org.apache.hadoop.hbase.coprocessor/TestRegionServerCoprocessorExceptionWithAbort/testExceptionFromCoprocessorDuringPut/ > Somehow getRSForFirstRegionInTable() wasn't able to retrieve the region > server. > One fix for this issue is to spin up MiniCluster with 1 region server so that > we don't need to search for the region server where first region is hosted. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira