[ https://issues.apache.org/jira/browse/HBASE-4610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13164835#comment-13164835 ]
Jonathan Hsieh commented on HBASE-4610: --------------------------------------- I had started doing this also -- are you sure you want to keep the 'if (count == oldcount && count > 0) break' line? It was removed on the 0.90 version. {code} + long slept = 0; for (int oldcount = countOfRegionServers(); !this.master.isStopped();) { Thread.sleep(interval); + slept += interval; count = countOfRegionServers(); if (count == oldcount && count > 0) break; String msg; + if (count == oldcount && count >= minToStart && slept >= timeout) { + LOG.info("Finished waiting for regionserver count to settle; " + + "count=" + count + ", sleptFor=" + slept); + break; {code} Before and after test, TestMasterFailover seemed flaky for me on the 0.92 branch. Is the plan for this 0.92.0 or 0.92.1? > Port HBASE-3380 (Master failover can split logs of live servers) to 92/trunk > (definitely bring in config params, decide if we need to do more to fix the > bug) > ------------------------------------------------------------------------------------------------------------------------------------------------------------- > > Key: HBASE-4610 > URL: https://issues.apache.org/jira/browse/HBASE-4610 > Project: HBase > Issue Type: Bug > Components: master > Affects Versions: 0.92.0, 0.94.0 > Reporter: Jonathan Gray > Assignee: Jonathan Gray > Fix For: 0.92.1 > > Attachments: 4610.txt > > > Over in HBASE-3380 we were having some TestMasterFailover flakiness. We > added some more config parameters to better control the master startup loop > where it waits for RS to heartbeat in. We had thought at the time that 92 > would have a different solution but it is still relying on heartbeats to > learn about RSs. > For now, we should definitely bring these config params into 92/trunk. > Otherwise this is an incompatible regression and adding these will also make > things like what was just reported over in HBASE-4603 trivial to fix in an > optimal way. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira