Devesh Kumar Singh created HDDS-15308:
-----------------------------------------

             Summary: Improve ICR/FCR-driven container state recovery using 
SCM-authoritative state
                 Key: HDDS-15308
                 URL: https://issues.apache.org/jira/browse/HDDS-15308
             Project: Apache Ozone
          Issue Type: Task
          Components: Ozone Recon
    Affects Versions: 2.1.0
            Reporter: Devesh Kumar Singh
            Assignee: Devesh Kumar Singh


**Description**

Recon currently advances existing containers from `OPEN` to `CLOSING` when a
healthy non-`OPEN` DataNode replica report is received, but the DN-report path
does not fully converge several stale lifecycle states without waiting for the
periodic targeted SCM container sync.

Enhance Recon's ICR/FCR handling for existing containers so that:

- when Recon has `CLOSING`, Recon checks SCM's authoritative state and advances
  to `QUASI_CLOSED`, `CLOSED`, `DELETING`, or `DELETED` when appropriate
- when Recon has `DELETED` but a DN reports a live replica and SCM still reports
  `QUASI_CLOSED` or `CLOSED`, Recon rebuilds the local container entry from SCM
  metadata instead of remaining permanently stale
- open-container pipeline accounting remains correct when moving
  `OPEN -> CLOSING`
- focused unit/integration coverage is added for the above ICR/FCR recovery
  cases

This follow-up should stay separate from HDDS-14758, whose scope is periodic
targeted SCM-to-Recon reconciliation.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to