Duo Zhang created HBASE-29206:
---------------------------------

             Summary: RollingBatchSuspendResumeRsAction can not actually 
'resume' a region server
                 Key: HBASE-29206
                 URL: https://issues.apache.org/jira/browse/HBASE-29206
             Project: HBase
          Issue Type: Improvement
          Components: integration tests
            Reporter: Duo Zhang


After HBASE-28023, we can successfully suspend and resume the region servers, 
but the problem is that usually after resuming, the region server will receive 
a YouAreDeadException while calling reportForDuty and also the zk session will 
expire, so soon the region server will abort.

A possible way is to use autostart, so the region server will restart 
automatically, but the problem is that, we also have some restart actions in 
our integration tests, they use 'start' instead of 'autostart', so even if we 
use autostart, after a BatchRestartRsAction we will fallback to start and cause 
problem when a RollingBatchSuspendResumeRsAction comes.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to