[ 
https://issues.apache.org/jira/browse/HDDS-11207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17867385#comment-17867385
 ] 

Ethan Rose commented on HDDS-11207:
-----------------------------------

There are some challenges with this:
* The replica should probably move to quasi-closed state first, since its 
checksums may match locally but it still may be missing data from other 
replicas it was unable to reconcile with.
* Datanodes cannot easily distinguish between EC and Ratis containers. 
Currently EC replicas cannot move to quasi-closed state.

> Allow reconciliation and scanner to move replicas out of the UNHEALTHY state
> ----------------------------------------------------------------------------
>
>                 Key: HDDS-11207
>                 URL: https://issues.apache.org/jira/browse/HDDS-11207
>             Project: Apache Ozone
>          Issue Type: Sub-task
>            Reporter: Ethan Rose
>            Priority: Major
>
> If reconciliation completes and all checksums are verified correct during the 
> repair, or the scanner identifies that all the checksums are correct, it 
> should have a way to move the container out of the unhealthy state. This will 
> allow SCM's current replication manager implementation to fix cases like 
> quasi-closed stuck containers or all unhealthy replicas.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to