[ 
https://issues.apache.org/jira/browse/HBASE-18131?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16029816#comment-16029816
 ] 

Enis Soztutar commented on HBASE-18131:
---------------------------------------

I think the root cause of this is not that servers are in the dead-servers list 
indefinitely. They will be removed from this list, however, due to HBASE-14223 
the master thinks that the server is dead-but-not-processed even after restart. 
 

> Add an hbase shell command to clear deadserver list in ServerManager
> --------------------------------------------------------------------
>
>                 Key: HBASE-18131
>                 URL: https://issues.apache.org/jira/browse/HBASE-18131
>             Project: HBase
>          Issue Type: New Feature
>          Components: Operability
>            Reporter: Yu Li
>            Assignee: Yu Li
>             Fix For: 2.0.0, 1.4.0
>
>
> Currently if a regionserver is aborted due to fatal error or stopped by 
> operator on purpose, it will be added into {{ServerManager#deadservers}} list 
> and shown as "Dead Servers" in the master UI. This is a valid warn for 
> operators to  notice the self-aborted servers and give a sanity check to 
> avoid further issues. However, after necessary checks, even if operator is 
> sure that the node is decommissioned (such as for repair), there's no way to 
> clear the dead server list except restarting master. See more details in 
> [this 
> discussion|http://mail-archives.apache.org/mod_mbox/hbase-user/201705.mbox/%3CCAM7-19%2BD4MLu2b1R94%2BtWQDspjfny2sCy4Qit8JtCgjvTOZzzg%40mail.gmail.com%3E]
>  in mail list
> Here we propose to add a hbase shell command to allow clearing dead server 
> list in {{ServerManager}} for advanced users, and the command should be 
> executed with caution.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to