Github user ShawnWalker commented on the issue:

    https://github.com/apache/accumulo/pull/121
  
    Upon further thought, I haven't accounted for the possibility that a tablet 
server might return before the master notices it had died.  Such a situation 
would likely happen during a rolling restart, and might also happen in the 
other types of transient failures that have been discussed.
    
    One solution which suggests itself to me is to block balancing until after 
the next assignment run whenever the `LiveTServerSet` contains two distinct 
tservers at the same location (host+port).  But that would still leave a minor 
race condition.
    
    Thoughts?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

Reply via email to