All I was wondering whats the best way to get icinga to mark a host down after some ( to be defined ) amount of services are down.
Here is my example. I monitor a group of servers about 40 who have 8 - 10 checks that are probed via nrpe. The nrpe checks probe if local services are running , if a log file is being updated etc. In the case of the log file update check, when the process(es) that updates the log file stops responding we get an alert; Which is what I want. Then we can go in and fix the issue. The same can be said for the local service checks . If the process hangs and does not respond to the probe we get an alert, log in and fix the issue as needed. What happens when a server crashes or is rebooted is unwanted. We get lots of service down alerts and no hostdown alerts. What I want to setup is after ICMP, SSH and ntp checks stop responding to mark the host down send out an alert, and to suspend the service checks that are checked via nrpe . Is this possible . TIA/HAND -- mark saad | [email protected]
_______________________________________________ icinga-users mailing list [email protected] https://lists.icinga.org/mailman/listinfo/icinga-users
