Hi Tom, I've tried to answer your questions to the best of my own personal knowledge -I have replaced any of your original "*" symbols with my own on all my comments/thoughts below, since my MS outlook client apparently just sucks, so this appears more readable.
Regards, jamie -----Original Message----- From: Kustner, Tom [mailto:tom.kust...@retirementpartner.com] Sent: Friday, June 19, 2009 5:35 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] How often does Nagios need restarting? (Quiscustodiet ipsos custodes?) I am a Nagios user, not the administrator. We are running Nagios 2.9 on RHEL 4 or 5. Overall, 200+ hosts with 3000 services being monitored. I have access for monitoring a smaller number of hosts. * ok, understood... In another posting, I alluded to an issue where a host had gone down but no alert was sent out. The issue surfaced again today and as was done the other time, Nagios was restarted to "fix" the problem. I am naturally concerned about the unreliability. * did you get any on-list or off-list replies at all? You have not mentioned if you had it resolved or not, but it sound like the answer is no to possibly both(?) Any thoughts on this problem? Specifically: What are best practices for making sure Nagios does not fall down on the job? Is there something not set right? * Understanding your setup and the way nagios works is how you ensure it stands up... a mis-config sounds likely, but who knows... Are other Nagios administrators restarting Nagios on a weekly or nightly basis to keep it on the job? * Heck no! That's why we run it on Linux or Solaris! :) Is this an issue specific to Nagios 2.9? Was 2.9 a spotty version? *Not to my knowledge - all stable releases have worked very reliably here, especially 2.9 now that I look back... For a given host, why would "active checks" be enabled, yet "N/A" appears in the "Next Active Check" field? * RTM - host checks are not always performed unless service checks fail, and since I've been a manual-slacker myself, that may not even be the true correct answer (Marc? :) Thanks for any help. -Tom Kustner- * Not to sound negative/condescending or anything like that, but your install will truly only work as well as you have maintained it/understand it. You should really look at your current config files and read the manual on 2.9, or upgrade to 3.x and again rtm... Also, you have not sent anything specific related to your problematic config(s) for anyone on this list to even guess either way whether or not something is mis-configured. If you are concerned about posting your configs/setup, change stuff properly to hide what you need to on-list. (I apologize if I have missed your earlier posting. Many here try our best to help people here when possible, but sometimes we are all busy at the same time, who knows!?). Cheers, Jamie ------------------------------------------------------------------------------ Are you an open source citizen? Join us for the Open Source Bridge conference! Portland, OR, June 17-19. Two days of sessions, one day of unconference: $250. Need another reason to go? 24-hour hacker lounge. Register today! http://ad.doubleclick.net/clk;215844324;13503038;v?http://opensourcebridge.org _______________________________________________ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null