[Nagios-users] [SPAM] Large installation

2012-06-11 Thread Brandino Andreas
Hi all, my nagios installation has currently 400+ hosts and around 1400 checks. As the server load grows, delays are appearing. Is any way to move a part of active checks to a second nagios server? And in that case how will these two nagios servers exchange data? If this is feasible can you poin

[Nagios-users] Large scale installation

2012-06-11 Thread Andreas Brandino
Hi all, my nagios installation has currently 400+ hosts and around 1400 checks. As the server load grows, delays are appearing. Is any way to move a part of active checks to a second nagios server? And in that case how will these two nagios servers exchange data? If this is feasible can you poin

[Nagios-users] Passive freshness checks go wrong

2012-06-11 Thread MAD
Hi, I have a Nagios 3.2.3 box on an Ubuntu server 11.04 with about 930 hosts and 28500 services. For each host, I have 5 passive checks mapped on SNMP Traps. In order to secure those checks, I set up a freshness checks using check_dummy and a freshness threshold of 15min, as I normally receive

Re: [Nagios-users] check_snmp_ibm_imm.sh Plugin Help

2012-06-11 Thread Peter . Shankland
When I run the following command, to check running it against the nagios user: su nagios -s /bin/bash -c "./check_snmp_ibm_imm.sh -H 172.29.13.16 -C 4H2KZNpX -T voltage" I do get correct output: Planar 3.3V = 3250 Planar 5V = 4900 Planar 12V = 11880 Planar VBAT = 2920 |Voltage1=3250 Voltag

Re: [Nagios-users] Large scale installation

2012-06-11 Thread Daniel Wittenberg
Take a look at mod_gearman for distributing the checks: http://labs.consol.de/lang/de/nagios/mod-gearman/ Dan On Jun 10, 2012, at 7:38 AM, Andreas Brandino wrote: Hi all, my nagios installation has currently 400+ hosts and around 1400 checks. As the server load grows, delays are appearing.

Re: [Nagios-users] Large scale installation

2012-06-11 Thread Assaf Flatto
See the documentation on distributed monitoring . Also check out check_mk and mod_gearman. Assaf On 10/06/12 13:38, Andreas Brandino wrote: Hi all, my nagios installation has currently 400+ hosts and around 1400 checks. As the server load grows, delays are appearing. Is any way to move a p

Re: [Nagios-users] Large scale installation

2012-06-11 Thread Randal, Phil
What's the spec of your nagios server? We're checking (around) 500 hosts, 4500 active, 5000 passive service with Nagios 3.4.1 in a CentOS 5.8 VM with 2GB RAM, 4 vCPUs, without problems with the help of check_mk /mk_livestatus (http://mathias-kettner.de/check_mk.html) Also using pnp4nagios and r

Re: [Nagios-users] Large scale installation

2012-06-11 Thread Paul Weaver
Doesn't it depend on how often you're performing the checks too? 1000 checks every 10 seconds is harder than 10,000 checks every hour. We have 589 hosts/3619 service on a 2 cpu 2.8GHz xeon with 1GB of ram, which does other things too. The machine is about 8-10 years old. The checks are schedule

Re: [Nagios-users] [SPAM] Large installation

2012-06-11 Thread Claudio Kuenzler
I think mod_gearman is what you're looking for: http://labs.consol.de/lang/en/nagios/mod-gearman/ Also 1400 checks is not that huge of a setup. You can also specify check_interval for certain checks which don't need to be executed every now and then (e.g. Disk Space Utilization, HDD Smart Status o

Re: [Nagios-users] Large scale installation

2012-06-11 Thread Giorgio Zarrelli
Hi, there are some key factors involved in high delays: iowait Ciao, Giorgio Il giorno 10/giu/2012, alle ore 14:38, Andreas Brandino ha scritto: > Hi all, > > my nagios installation has currently 400+ hosts and around 1400 checks. > As the server load grows, delays are appearing. > > Is

Re: [Nagios-users] Large scale installation

2012-06-11 Thread Giorgio Zarrelli
Hi, I suggest to review your installation. Try with the large installation tweaks http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html. Then, check whether you need all your checks at 5 mins or you can move some of them to 10 mins pace. Then, review your check plugins: Perl plugins eat

Re: [Nagios-users] Large scale installation

2012-06-11 Thread Jake Xu
Also, you might want to find out the performance of your service checks. The nagios profiler is a very good tool to find execution time of individual services. http://exchange.nagios.org/directory/Plugins/Network-and-Systems-Management/Nagios/Profiler-to-check-plugin-execution-time/details On Mo

Re: [Nagios-users] Large scale installation

2012-06-11 Thread Ian Orszaczki
Great advice. Funny you should mention status.dat in ramdisk as we have hit a hiccup this morning which has meant we have lost comments and downtimes. We had moved status.dat to a ramdisk as recommended for large installations (we monitoring 3390 hosts with 18748 services from one server, latenci

Re: [Nagios-users] Large scale installation

2012-06-11 Thread Giorgio Zarrelli
Hi, You are right., open files IS a major concern I forgot to mention. A quick and dirty method to solve it is to raise the number of open files putting ulimit command folllowed by a high value in The Nagios startup script. ulimit -a will tell The current system wirde ulimit value. Lucky you,