Hello, I have this monit job:
check program bak_sanoid with path "/usr/local/bin/sanoid
--monitor-snapshots" every 10 cycles
if content = "CRIT:" then alert
if content = "WARN:" then alert
if content != "OK:" for 2 cycles then alert
[...]
Several more of these, matching for concrete file paths.
Last night sanoid met the WARN: condition because I had manually deleted
an old snapshot, and it took another 24h until this re-appeared via the
next backup cron job:
---
Description: content match on program output:
WARN: tank/syncoid/[long path] newest daily snapshot is
3d 2h 27m 6s old (should be < 2d 12h 0m 0s)
---
In my experience, I get just *one* alert message from monit in such a case
(when the condition is first met), and another one when the condition is
ok again (see manual excerpt below).
However, in this case I got alert messages every 10 minutes (10 cycles).
Is that expected? The manual clearly says
---
Monit by default sends just one notification if a service failed and
another when/if it recovers.
---
I am not using any reminder statements. The monit version is 5.35.2. Any
idea why this happens?
cu
Gerrit
smime.p7s
Description: S/MIME cryptographic signature
