dlmarion commented on issue #5162: URL: https://github.com/apache/accumulo/issues/5162#issuecomment-2539389401
From what I can tell, the logic in `LiveTServerSet.checkServer` only adds the tablet server to the dead list if the tablet server path in ZooKeeper exists but there is no lock data, or if the instance does not match (like the tserver was restarted at the same host/port location). The ZooKeeper node for the tserver is deleted after 10 minutes if there is no lock data. This information is only shown on the Monitor. We could add a Monitor property that is a list of resource groups to ignore, then use that when calling `DeadServerList.getList` to skip dead servers with a matching resource group in the ServiceLockPath. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
