Sergey Shelukhin created HBASE-21767:
----------------------------------------

             Summary: findRegionsToForceFlush could be improved
                 Key: HBASE-21767
                 URL: https://issues.apache.org/jira/browse/HBASE-21767
             Project: HBase
          Issue Type: Improvement
            Reporter: Sergey Shelukhin
         Attachments: image-2019-01-23-16-07-42-166.png

We see Too many WALs message logged a lot, and it seems like it usually fails 
to bring the WAL count back down for a long time, sometimes hours. This results 
in large steady state WAL volume on some region servers, and slower recovery.
Based on the functionality in HBASE-21626, it should be possible to add an 
option to be aggressive in this method, and to determine the minimum set of 
regions that will actually bring WAL count close to a limit in one operation.

An example of too-many-WALs log statements reporting WAL count over ~2.5 hours, 
with a limit of 79. You can see when the count is coming down; it often doesn't 
come anywhere close to the limit so another flush is requested, etc. for a long 
time.

 !image-2019-01-23-16-07-42-166.png! 




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to