Dale LaBossiere created EDGENT-396: -------------------------------------- Summary: JobMonitorApp restarts job 3 times more than it should Key: EDGENT-396 URL: https://issues.apache.org/jira/browse/EDGENT-396 Project: Edgent Issue Type: Bug Components: Runtime Reporter: Dale LaBossiere Assignee: Dale LaBossiere
N.B. I don't think this bug affects IotProvider since I'm pretty sure that while IotProvider includes JobMonitorApp, it doesn't register a JobRegistryService so the monitor does nothing. JIRA forthcoming for that. JobMonitorAppTest exercises the app but it doesn't perform any validation that restarts were actually happening. Adding instrumentation / validation highlights that 3x the number of rebuilds/restarts are happening. {code} appOne: buildCnt: 7 injectedFailureCnt: 2 appTwo: buildCnt: 10 injectedFailureCnt: 3 {code} Further investigation identifies the JobMonitorApp's job event filtering as the problem. Each "failed" job ends up with 3 events that pass through the filter {code} RUNNING, RUNNING, UNHEALTHY RUNNING, CLOSED, UNHEALTHY CLOSED, CLOSED, UNHEALTHY {code} ... or something like that -- This message was sent by Atlassian JIRA (v6.3.15#6346)