[ https://issues.apache.org/jira/browse/HBASE-19925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16351229#comment-16351229 ]
Ted Yu commented on HBASE-19925: -------------------------------- It seems isPeerEnabled() can call getPeer() first. If the return value is null, don't proceed to calling getStatusOfPeer(). Do you want to provide a patch ? Thanks > Delete an unreachable peer will triggers all regionservers abort > ---------------------------------------------------------------- > > Key: HBASE-19925 > URL: https://issues.apache.org/jira/browse/HBASE-19925 > Project: HBase > Issue Type: Bug > Reporter: Yun Zhao > Priority: Critical > > Add an unreachable peer > {code:java} > add_peer '4', CLUSTER_KEY => "server1.cie.com:2181:/hbase"{code} > After a while to delete it,Regionserver will appear in the following log and > stop. > {code:java} > 2018-02-02 20:04:25,959 INFO [main-EventThread.replicationSource,4] > regionserver.ReplicationSource: Replicating > 5467de52-dc46-45be-902c-110dd7a83e06 -> null > 2018-02-02 20:04:25,960 ERROR > [main-EventThread.replicationSource,4.replicationSource.xxxx.com%2C16020%2C1515498473547.default,4] > regionserver.ReplicationSource: Unexpected exception in > ReplicationSourceWorkerThread, currentPath=null > java.lang.IllegalArgumentException: Peer with id= 4 is not connected > at > org.apache.hadoop.hbase.replication.ReplicationPeersZKImpl.getStatusOfPeer(ReplicationPeersZKImpl.java:207) > at > org.apache.hadoop.hbase.replication.regionserver.ReplicationSource.isPeerEnabled(ReplicationSource.java:327) > at > org.apache.hadoop.hbase.replication.regionserver.ReplicationSource$ReplicationSourceWorkerThread.run(ReplicationSource.java:512) > 2018-02-02 20:04:25,960 INFO > [main-EventThread.replicationSource,4.replicationSource.xxxx.com%2C16020%2C1515498473547.default,4] > regionserver.HRegionServer: STOPPED: Unexpected exception in > ReplicationSourceWorkerThread{code} > -- This message was sent by Atlassian JIRA (v7.6.3#76005)