[ https://issues.apache.org/jira/browse/HBASE-24745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17160883#comment-17160883 ]
wenfeiyi666 commented on HBASE-24745: ------------------------------------- A simple mock test in branch-2.3 and master, everything is normal. {code:java} final long initPauseTime = 1000; int tries = 0; long pauseTime; while (tries < 10000) { try { throw new IllegalArgumentException(); } catch (Exception e) { pauseTime = ConnectionUtils.getPauseTime(initPauseTime, tries); LOG.info("trie=" + tries + ", pauseTime=" + pauseTime); Threads.sleep(pauseTime); tries++; } } {code} output {code:java} 2020-07-20 12:18:22,331 INFO regionserver.Test(22): trie=0, pauseTime=1001 2020-07-20 12:18:23,342 INFO regionserver.Test(22): trie=1, pauseTime=2004 2020-07-20 12:18:25,348 INFO regionserver.Test(22): trie=2, pauseTime=3011 2020-07-20 12:18:28,363 INFO regionserver.Test(22): trie=3, pauseTime=5002 2020-07-20 12:18:33,368 INFO regionserver.Test(22): trie=4, pauseTime=10019 2020-07-20 12:18:43,389 INFO regionserver.Test(22): trie=5, pauseTime=20051 2020-07-20 12:19:03,444 INFO regionserver.Test(22): trie=6, pauseTime=40300 2020-07-20 12:19:43,751 INFO regionserver.Test(22): trie=7, pauseTime=100958 2020-07-20 12:21:24,717 INFO regionserver.Test(22): trie=8, pauseTime=100961 ...{code} > 'Failed report transition' logs too often > ----------------------------------------- > > Key: HBASE-24745 > URL: https://issues.apache.org/jira/browse/HBASE-24745 > Project: HBase > Issue Type: Sub-task > Affects Versions: 2.3.0 > Reporter: Michael Stack > Assignee: wenfeiyi666 > Priority: Minor > > The parent issue fixed a backoff that was too aggressive. Now I notice we try > too much. Saw 9k logs in 17 seconds of the below type... > {code:java} > 2020-07-15 14:36:23,104 INFO > org.apache.hadoop.hbase.regionserver.HRegionServer: Failed report transition > server { host_name: "X.example.org" port: 16020 start_code: 1594823099666 } > transition { transition_ code: CLOSED region_info { region_id: > 1594814749475 table_name { namespace: "default" qualifier: > "IntegrationTestBigLinkedList" } start_key: "\"\"\"\"\"\"\" " end_key: > "#Q\352\f\003" offline: false split: false replica_id: 0 } proc_id: > 81545 }; retry (#8888) after 200805ms delay (Master is coming online...). > {code} > The delay doesn't seem correct or respected. > > -- This message was sent by Atlassian Jira (v8.3.4#803005)