On Jan 31, 2013, at 11:32 AM, Assaf Flatto <nag...@flatto.net> wrote:
> On 31/01/13 15:56, Alaric wrote: >> Hi, >> >> I was hoping that someone on this list might have some insight to an issue >> that I recently ran into after upgrading my Nagios core installation to >> 3.4.4 (out of the EPEL Repo) After upgrading, log rotation stopped on one of >> my two servers, and perfdata_file_processing_commands stopped working, and >> whatever Job that sets 'CURRENT HOST STATE' in the main nagios logs stopped >> working. >> >> I upgraded both my Dev server, and my Production server, and only my Prod >> servers seems to have the issue. Both run the same code, and both have the >> same configs. >> I'm confident the configs as the same, as both get the configs deployed via >> puppet. I've googled around quite a bit, and haven't had any luck figuring >> it out. Has anyone seen anything similar? >> My feeling, based on the behavior of my Dev server is that it's not a >> problem with the code, but that something got "stuck" but I'm darned if I >> can figure out what. I've cleared out the logs, restarted nagios, rebooted >> the server. Audited configs and checked the debug output. >> >> Any help is much appreciated! >> >> >> -a >> >> >> >> > What is the difference in the volume of activity on those servers , it > could be that you found an issue related the the amount of checks or > traffic generated to the the log. > > If the internal log rotation is faulty - have you considered using > logrotate as a dirty hack to fix your issue ? > While logrotate or a cronjob will clean up the actual files, part of what I'm trying to troubleshoot what looks like the failure of some internal nagios processes, for example, some processes nightly normally adds these enteries into the top of the log: [1359608400] CURRENT HOST STATE: example1;UP;HARD;1;FPING OK - 10.1.2.3 (loss=0%, rta=1.210000 ms) Which goes missing, even if I manually rotate the logs... The difference in volume seems pretty low i've been trying to keep dev and prod as similar as possible: Host Service Checks from Dev: # Active Host / Service Checks: 1486 / 7219 # Passive Host / Service Checks: 0 / 0 Host Service Checks form Prod: # Active Host / Service Checks: 1564 / 8264 # Passive Host / Service Checks: 0 / 84 ------------------------------------------------------------------------------ Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_jan _______________________________________________ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null