Devesh Kumar Singh created HDDS-13891:
-----------------------------------------
Summary: Recon - Introduce ContainerHealthTaskV2 with SCM-based
health monitoring and batch processing
Key: HDDS-13891
URL: https://issues.apache.org/jira/browse/HDDS-13891
Project: Apache Ozone
Issue Type: Task
Components: Ozone Recon
Affects Versions: 2.0.0
Reporter: Devesh Kumar Singh
Assignee: Devesh Kumar Singh
Implement ContainerHealthTaskV2 that uses SCM's ReplicationManager as the
single source of truth for container health status, replacing the legacy
approach of maintaining separate health calculations in Recon. Includes batch
processing optimization for improved performance with large container sets.
The current ContainerHealthTask implementation has several limitations:
1. Inconsistent Health Status: Recon computes container health independently
from SCM, leading to discrepancies between Recon's view and SCM's authoritative
state
2. Missing Containers: Containers known to SCM but not yet synced to Recon
are not tracked for health issues
3. Performance Issues: Individual database operations for each container
cause excessive round-trips in large deployments
4. Data Accuracy: No guarantee that Recon's health calculations match SCM's
ReplicationManager decisions
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]