Hao Zhang created HELIX-683: ------------------------------- Summary: Clean monitoring cache upon helix controller enable monitoring Key: HELIX-683 URL: https://issues.apache.org/jira/browse/HELIX-683 Project: Apache Helix Issue Type: Bug Reporter: Hao Zhang
We found a bug in reporting cluster status, partition masterless duration. The root cause is that the duration is calculated based on controller cache. And currently, this cache is not cleaned when leadership is changed. As a result, if controller A start a mastership handoff but was interrupted once, the start time will be kept in cache until next mastership handoff on the same partition happens. Then the later handoff duration will be calculated based on the stale start time. This could be super large. To fix it, we might consider clean cache when leadership changed. -- This message was sent by Atlassian JIRA (v7.6.3#76005)