Hao Zhang created HELIX-683:
-------------------------------

             Summary: Clean monitoring cache upon helix controller enable 
monitoring
                 Key: HELIX-683
                 URL: https://issues.apache.org/jira/browse/HELIX-683
             Project: Apache Helix
          Issue Type: Bug
            Reporter: Hao Zhang


We found a bug in reporting cluster status, partition masterless duration.


The root cause is that the duration is calculated based on controller cache. 
And currently, this cache is not cleaned when leadership is changed. As a 
result, if controller A start a mastership handoff but was interrupted once, 
the start time will be kept in cache until next mastership handoff on the same 
partition happens. Then the later handoff duration will be calculated based on 
the stale start time. This could be super large.

To fix it, we might consider clean cache when leadership changed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to