[ 
https://issues.apache.org/jira/browse/KUDU-2257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Henke updated KUDU-2257:
------------------------------
    Labels: supportability  (was: )

> Add tool to recover a tablet that has lost a majority of replicas
> -----------------------------------------------------------------
>
>                 Key: KUDU-2257
>                 URL: https://issues.apache.org/jira/browse/KUDU-2257
>             Project: Kudu
>          Issue Type: Improvement
>            Reporter: William Berkeley
>            Priority: Major
>              Labels: supportability
>
> In the unfortunate case where a tablet has lost a majority of its replicas, 
> the way to recover is to rewrite the config of the remaining healthy members 
> so the unhealthy members are excluded, then wait for normal re-replication. 
> This involves k similar commands on the k remaining healthy servers. We could 
> simplify the process by adding a simple tool "kudu tablet 
> unsafe_promote_minority" that does the rewrite in one step.
> The tool could use ksck to try and identify the healthy minority, or it could 
> rely on the user to do that.
> The tool should also make it very clear that it may cause data loss and that 
> its use is a last resort.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to