Github user ShawnWalker commented on the issue:
https://github.com/apache/accumulo/pull/121
Upon further thought, I haven't accounted for the possibility that a tablet
server might return before the master notices it had died. Such a situation
would likely happen during a rolling restart, and might also happen in the
other types of transient failures that have been discussed.
One solution which suggests itself to me is to block balancing until after
the next assignment run whenever the `LiveTServerSet` contains two distinct
tservers at the same location (host+port). But that would still leave a minor
race condition.
Thoughts?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---