Mike Percy created KUDU-1724:
--------------------------------

             Summary: Add ability to elect a leader with an "unsafe" number of 
votes
                 Key: KUDU-1724
                 URL: https://issues.apache.org/jira/browse/KUDU-1724
             Project: Kudu
          Issue Type: Bug
          Components: consensus, ops-tooling, supportability
            Reporter: Mike Percy


As a generalization of KUDU-1720, it would be useful to be able to start a 
leader election that does not require a "safe" number of votes for the leader 
to consider it successful.

Imagine that we have 5 replicas and 3 replicas are permanently offline. We know 
that the remaining 3 replicas will never come back online and there is 
currently no leader. As part of an emergency maintenance procedure, and 
administrator needs to get a leader back online so that an emergency 
configuration change can be forced (see KUDU-1721).

The administrator potentially has a couple of options:
1. Choose a leader based on how full the replica's log is, based on manual or 
automatic inspection
2. Run a leader election that only considers the remaining nodes as potential 
voters: for example, even though the full config is 5 nodes, if there are only 
2 nodes remaining then this particular election can consider there to be only 2 
valid voters.

Requiring some form of election to take place, even if it is not a "safe" 
election, is more likely to lose the minimum possible amount of data since that 
is what the election protocol is designed to do. Only the most up-to-date 
remaining replica should get elected.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to