[
https://issues.apache.org/jira/browse/HDDS-11207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17867385#comment-17867385
]
Ethan Rose commented on HDDS-11207:
-----------------------------------
There are some challenges with this:
* The replica should probably move to quasi-closed state first, since its
checksums may match locally but it still may be missing data from other
replicas it was unable to reconcile with.
* Datanodes cannot easily distinguish between EC and Ratis containers.
Currently EC replicas cannot move to quasi-closed state.
> Allow reconciliation and scanner to move replicas out of the UNHEALTHY state
> ----------------------------------------------------------------------------
>
> Key: HDDS-11207
> URL: https://issues.apache.org/jira/browse/HDDS-11207
> Project: Apache Ozone
> Issue Type: Sub-task
> Reporter: Ethan Rose
> Priority: Major
>
> If reconciliation completes and all checksums are verified correct during the
> repair, or the scanner identifies that all the checksums are correct, it
> should have a way to move the container out of the unhealthy state. This will
> allow SCM's current replication manager implementation to fix cases like
> quasi-closed stuck containers or all unhealthy replicas.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]