Re: [Nagios-users] Alleviating Nagios i/o contention problem
2010/9/25 Frost, Mark {PBC} : > Greetings, listers, > > We've got an on-going issue with i/o contention. There's the obvious > problem that we've got a whole lot of things all writing to the same > partition. In this case, there's just one big chunk of RAID 5 disk on a > single controller so I don't believe that making more partitions is going to > help. > > On this same partition we have: > > 1) Nagios 3.2.1 running as the central/reporting server for a couple of > other Nagios nodes that are sending check results via NSCA. Approximately > 6-7K checks. > > 2) pnp4nagios 0.6.2 (with rrd 1.4.2) writing graph data. > > There's a 2nd server configured identically to the first that's acting as a > "hot spare" so it also receives check data from the 2 distributed nodes and > writes its own copy of the graph data locally as well. > > At the moment I'm concerned about the graphdata, but because I can only see > i/o utilization as an aggregate, I can't tell what is the worst component on > that filesystem -- status.dat updates? graph data? writes to the var/spool > directory? We also look at continued growth so this is only going to get > worse. > > These systems are quite lightly loaded from a CPU (2 dual-core CPUs) and > memory (4GB) perspective, but the i/o to the nagios filesystem is queuing > now. > > We're about to order new hardware for these servers and I want to make a > reasonable choice. I'd like to make some reasonable changes without > requiring too exotic of a setup. I believe these servers are currently Dell > 2950s and they're all running Suse Linux 10.3 SP2. > > My first thought was to potentially move the graphs to a NAS share which > would shift that i/o to the network. I don't know how that would work > though and it would ultimately be an experiment. > > What experiences do people out there have handling this kind of i/o and what > have you done to ease it? > > > Thanks very much! > > Mark > Depending on the kernel version used, you could use iotop to check what processes are the top I/O consumers... Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Start uncovering the many advantages of virtual appliances and start using them to simplify application deployment and accelerate your shift to cloud computing. http://p.sf.net/sfu/novell-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE compilation on HPUX
2010/2/11 Frost, Mark {PBG} : > > > It's kind of strange, but I know I had the problem you're having at one > point, but I no longer seem to have it on HPUX 11.31 (IA64). I wonder if > there was some OS patch that was applied that took care of that. The OS > level on our servers is shown as "HP-UX B.11.31 U ia64 1200835863" if that > helps. swlist does show that we have an installed tcp wrapper package, > however: > > > > HPUX-TCPWRAP B.11.31 HPUX TCPWrapper > daemon > > > > 'nm' shows that there is a fromhost() entity in /usr/lib/libwrap.sl > (libwrap.1) > > > > Unless I'm mistaken, your issue is with the libwrap library which I think > exists on HPUX, but isn't complete as far as NRPE is concerned. I never > spent a lot of time trying to find a generalized solution to this, but > instead kind of hacked something into nrpe.c where I think I just commented > out some of the #ifdef HAVE_LIBWRAP code segments. Note that the fromhost() > call you're missing is in one of the HAVE_LIBRWRAP sections. I believe that > NRPE expects that . We don’t use TCP wrappers anyway, so it's not a loss. > > > > I first encountered this issue with PA-RISC HPUX and I believe the way I got > around was to download the latest tcp wrapper code from the internet, then > pulled out the function that I was missing (I don't recall if it was from > host > > > > My current issue with NRPE on HPUX is twofold -- the lack of some syslog > capabilities, and that the configure script doesn't correctly find our SSL > libraries. I have made the following 2 hacky changes to make ours work: > > > > > > *** configure.old Mon Mar 31 13:49:11 2008 > > --- configure Thu Feb 11 11:07:10 2010 > > *** > > *** 6676,6686 > > fi > > done > > > > ! if test x_$found_ssl != x_yes; then > > ! { { echo "$as_me:$LINENO: error: Cannot find ssl > libraries" >&5 > > ! echo "$as_me: error: Cannot find ssl libraries" >&2;} > > ! { (exit 1); exit 1; }; } > > ! else > > printf "SSL libraries found in $ssllibdir\n"; > > > > LDFLAGS="$LDFLAGS -L$ssllibdir"; > > --- 6676,6686 > > fi > > done > > > > ! # if test x_$found_ssl != x_yes; then > > ! # { { echo "$as_me:$LINENO: error: Cannot find ssl > libraries" >&5 > > ! #echo "$as_me: error: Cannot find ssl libraries" >&2;} > > ! # { (exit 1); exit 1; }; } > > ! # else > > printf "SSL libraries found in $ssllibdir\n"; > > > > LDFLAGS="$LDFLAGS -L$ssllibdir"; > > *** > > *** 6701,6707 > > fi > > # awk to strip off meta data at bottom of dhparam > output > > $sslbin dhparam -C 512 | awk '/^-/ {exit} > {print}' > include/dh.h > > ! fi > > fi > > > > echo "$as_me:$LINENO: checking for Kerberos include files" >>&5 > > --- 6701,6707 > > fi > > # awk to strip off meta data at bottom of dhparam > output > > $sslbin dhparam -C 512 | awk '/^-/ {exit} > {print}' > include/dh.h > > ! # fi > > fi > > > > echo "$as_me:$LINENO: checking for Kerberos include files" >>&5 > > > > > > > > > > > > *** src/nrpe.c.old Mon Mar 31 14:02:59 2008 > > --- src/nrpe.c Thu Feb 11 11:18:34 2010 > > *** > > *** 613,622 > > log_facility=LOG_UUCP; > > else if(!strcmp(varvalue,"cron")) > > log_facility=LOG_CRON; > > ! else if(!strcmp(varvalue,"authpriv")) > > log_facility=LOG_AUTHPRIV; > > else if(!strcmp(varvalue,"ftp")) > > ! log_facility=LOG_FTP; > > else if(!strcmp(varvalue,"local0")) > > log_facility=LOG_LOCAL0; > > else if(!str
Re: [Nagios-users] nagios web configurator
2010/3/4 wale Ajayi : > Hello every one, i need a very good web interface configurator for my > nagios3.0.6 can anybody help me to get one , thanks very much. > NagiosQL? -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE compilation on HPUX
2010/2/11 Max : > On Wed, Feb 10, 2010 at 5:30 PM, Ciro Iriarte wrote: >> Hi, i'm getting issues compiling NRPE on HPUX 11.31, do you now what >> missing library can cause this? > > Colleague of mine used the NRPE package here: > > http://mayoxide.com/naghpux/ > > and said it works well. > > - max > I used his build notes, and got that errors... Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- -- SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NRPE compilation on HPUX
Hi, i'm getting issues compiling NRPE on HPUX 11.31, do you now what missing library can cause this? - utils.c: ld: Unsatisfied symbol "fromhost" in file nrpe.o 1 errors. *** Error exit code 1 Stop. *** Error exit code 1 Stop. - Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- -- SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3.2 awareness latency?
2009/11/19 Ciro Iriarte : > Hi, i'm having a weird issue with nagios 3.2. Sometimes it takes up to > an hour to make it aware of a server being up again after maintenance > or a hardware issue. Our checks are scheduled to run every 1 minute or > 5 minutes at most. Even if you force the check it's not done! > > We have around 300 hosts and 1800 services and the nagios server > resources are fine (near 800MB free RAM and 90% idle CPU). Any ideas? > > Regards, > CI.- > Bump I think I should have called the thread "long gap between checks". Even if you force it, the check is not done when this delay occur... -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios 3.2 awareness latency?
Hi, i'm having a weird issue with nagios 3.2. Sometimes it takes up to an hour to make it aware of a server being up again after maintenance or a hardware issue. Our checks are scheduled to run every 1 minute or 5 minutes at most. Even if you force the check it's not done! We have around 300 hosts and 1800 services and the nagios server resources are fine (near 800MB free RAM and 90% idle CPU). Any ideas? Regards, CI.- -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE gives wrong exit codes
2009/9/4 Marc Powell : > > On Sep 4, 2009, at 2:30 PM, Ciro Iriarte wrote: > >>> It's weird, if I restart the daemon it works, but just for the >>> first execution. >>> >>> - >>> spmon:/etc/nagios/objects/services # >>> /usr/lib/nagios/plugins/check_nrpe -H billbd2 -c check_disk -a 90% >>> 80% >>> /kml_inst2 >>> DISK CRITICAL - free space: /kml_inst2 76172 MB (21% inode=100%);| >>> /kml_inst2=273552MB;34972;69944;0;349725 >>> spmon:/etc/nagios/objects/services # echo $? >>> 2 >>> spmon:/etc/nagios/objects/services # >>> /usr/lib/nagios/plugins/check_nrpe -H billbd2 -c check_disk -a 90% >>> 80% >>> /kml_inst2 >>> DISK CRITICAL - free space: /kml_inst2 76172 MB (21% inode=100%);| >>> /kml_inst2=273552MB;34972;69944;0;349725 >>> spmon:/etc/nagios/objects/services # echo $? >>> 0 >>> -- >>> >>> Regards, >>> >> >> Any ideas?. > > Never seen it before. > > Don't test as root. > > Try using truss on check_nrpe to verify that you're getting the right > exit code back from the remote host. > > -- > Marc > Apparently is a Solaris 10 service management bug, if I run it manually, it works in each execution. Also there's another (less critical) server with more current patches that works just fine. Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE gives wrong exit codes
2009/9/4 Marc Powell : > > On Sep 4, 2009, at 2:30 PM, Ciro Iriarte wrote: > >>> It's weird, if I restart the daemon it works, but just for the >>> first execution. >>> >>> - >>> spmon:/etc/nagios/objects/services # >>> /usr/lib/nagios/plugins/check_nrpe -H billbd2 -c check_disk -a 90% >>> 80% >>> /kml_inst2 >>> DISK CRITICAL - free space: /kml_inst2 76172 MB (21% inode=100%);| >>> /kml_inst2=273552MB;34972;69944;0;349725 >>> spmon:/etc/nagios/objects/services # echo $? >>> 2 >>> spmon:/etc/nagios/objects/services # >>> /usr/lib/nagios/plugins/check_nrpe -H billbd2 -c check_disk -a 90% >>> 80% >>> /kml_inst2 >>> DISK CRITICAL - free space: /kml_inst2 76172 MB (21% inode=100%);| >>> /kml_inst2=273552MB;34972;69944;0;349725 >>> spmon:/etc/nagios/objects/services # echo $? >>> 0 >>> -- >>> >>> Regards, >>> >> >> Any ideas?. > > Never seen it before. > > Don't test as root. > > Try using truss on check_nrpe to verify that you're getting the right > exit code back from the remote host. > > -- > Marc I'm running linux on the nagios end I ran truss attached to the NRPE service. This are two executions, the first one from the check that works and the second one with the critical message + exit code=0. Exit code = 2 - Fine - 25565: stat("/kml_inst2", 0x0003CB28) = 0 25565: stat("/kml_inst2", 0x0003CB28) = 0 25565: statvfs("/kml_inst2", 0xFFBFFAA8) = 0 25565: ioctl(1, TCGETA, 0xFFBFED5C)Err#22 EINVAL 25565: fstat64(1, 0xFFBFEDD0) = 0 25565: fstat64(1, 0xFFBFEC78) = 0 25565: write(1, " D I S K C R I T I C A".., 108) = 108 25565: _exit(2) 25561: read(6, " D I S K C R I T I C A".., 5120) = 108 25563: waitid(P_PID, 25565, 0xFFBFFA28, WEXITED|WTRAPPED|WNOWAIT) = 0 25563: ioctl(0, TIOCGPGRP, 0xFFBFFA44) Err#6 ENXIO 25563: getpgid(25565) = 25561 25563: ioctl(0, TIOCGPGRP, 0xFFBFFA44) Err#6 ENXIO 25563: waitid(P_PID, 25565, 0xFFBFFA28, WEXITED|WTRAPPED) = 0 25563: brk(0x0003AA20) = 0 25563: _exit(2) 25561: read(6, 0x000B2DBC, 5120) = 0 25561: write(8, " D I S K C R I T I C A".., 108) = 108 25561: llseek(6, 0, SEEK_CUR) Err#29 ESPIPE 25561: close(6)= 0 25561: waitid(P_PID, 25563, 0xFFBFC560, WEXITED|WTRAPPED) = 0 25561: close(8)= 0 25561: alarm(0)= 60 25561: lseek(1, 0, SEEK_CUR) = 0 25561: lseek(2, 0, SEEK_CUR) = 0 25561: lseek(2, 0, SEEK_CUR) = 0 25561: lseek(1, 0, SEEK_CUR) = 0 25561: lseek(2, 0, SEEK_CUR) = 0 25561: lseek(2, 0, SEEK_CUR) = 0 25561: write(5, " 005020102 B\0", 7) = 7 25561: close(5)= 0 25561: close(3)= 0 25561: _exit(2) -- Exit code 0 - Wrong! -- 1383: stat("/kml_inst2", 0x0003CB28) = 0 1383: stat("/kml_inst2", 0x0003CB28) = 0 1383: statvfs("/kml_inst2", 0xFFBFFAA8) = 0 1383: ioctl(1, TCGETA, 0xFFBFED5C)Err#22 EINVAL 1383: fstat64(1, 0xFFBFEDD0) = 0 1383: fstat64(1, 0xFFBFEC78) = 0 1383: write(1, " D I S K C R I T I C A".., 108) = 108 1383: _exit(2) 1379: read(6, " D I S K C R I T I C A".., 5120) = 108 1381: waitid(P_PID, 1383, 0xFFBFFA28, WEXITED|WTRAPPED|WNOWAIT) = 0 1381: ioctl(0, TIOCGPGRP, 0xFFBFFA44) Err#6 ENXIO 1381: getpgid(1383) = 1379 1381: ioctl(0, TIOCGPGRP, 0xFFBFFA44) Err#6 ENXIO 1381: waitid(P_PID, 1383, 0xFFBFFA28, WEXITED|WTRAPPED) = 0 1381: brk(0x0003AA20) = 0 1381: _exit(2) 1379: read(6, 0x000B2DBC, 5120) = 0 1379: write(8, " D I S K C R I T I C A".., 108) = 108 1379: llseek(6, 0, SEEK_CUR) Err#29 ESPIPE 1379: close(6)= 0 1379: waitid(P_PID, 1381, 0xFFBFC560, WEXITED|WTRAPPED) = 0 1379: close(8)= 0 1379: alarm(0)
Re: [Nagios-users] NRPE gives wrong exit codes
2009/9/3 Ciro Iriarte : > 2009/9/3 Ciro Iriarte : >> 2009/9/3 Morris, Patrick : >>> On Thu, 03 Sep 2009, Ciro Iriarte wrote: >>> >>>> Hi, i'm trying to monitor a Solaris box using NRPE. The thing is >>>> nagios sees always an EXIT_CODE=0. >>>> >>>> Running the check by hand on the Solaris host works as expected: >>>> >>>> -- >>>> [solaris ~]$ /usr/local/nagios/libexec/check_disk -w 50% -c 10% -p /kml_ >>>> DISK WARNING - free space: /kml_inst2 101125 MB (28% inode=100%);| >>>> /kml_inst2=248599MB;174862;314752;0;349725 >>>> [solaris ~]$ echo $? >>>> 1 >>>> -- >>>> >>>> But running it from the nagios host I get: >>>> - >>>> spmon:~ # /usr/lib/nagios/plugins/check_nrpe -H solaris -c check_disk >>>> -a 50% 10% /test >>>> DISK WARNING - free space: /test 101125 MB (28% inode=100%);| >>>> /test=248599MB;174862;314752;0;349725 >>>> spmon:~ # echo $? >>>> 0 >>>> - >>> >>> How is check_disk defined in your nrpe config and is it configured to >>> allow arguments? >>> >> >> I have: >> >> >> dont_blame_nrpe=1 >> command[check_disk]=/usr/local/nagios/libexec/check_disk -w $ARG1$ -c >> $ARG2$ -p $ARG3$ >> >> >> Regards, >> > > It's weird, if I restart the daemon it works, but just for the first > execution. > > - > spmon:/etc/nagios/objects/services # > /usr/lib/nagios/plugins/check_nrpe -H billbd2 -c check_disk -a 90% 80% > /kml_inst2 > DISK CRITICAL - free space: /kml_inst2 76172 MB (21% inode=100%);| > /kml_inst2=273552MB;34972;69944;0;349725 > spmon:/etc/nagios/objects/services # echo $? > 2 > spmon:/etc/nagios/objects/services # > /usr/lib/nagios/plugins/check_nrpe -H billbd2 -c check_disk -a 90% 80% > /kml_inst2 > DISK CRITICAL - free space: /kml_inst2 76172 MB (21% inode=100%);| > /kml_inst2=273552MB;34972;69944;0;349725 > spmon:/etc/nagios/objects/services # echo $? > 0 > -- > > Regards, > Any ideas?. -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE gives wrong exit codes
2009/9/3 Ciro Iriarte : > 2009/9/3 Morris, Patrick : >> On Thu, 03 Sep 2009, Ciro Iriarte wrote: >> >>> Hi, i'm trying to monitor a Solaris box using NRPE. The thing is >>> nagios sees always an EXIT_CODE=0. >>> >>> Running the check by hand on the Solaris host works as expected: >>> >>> -- >>> [solaris ~]$ /usr/local/nagios/libexec/check_disk -w 50% -c 10% -p /kml_ >>> DISK WARNING - free space: /kml_inst2 101125 MB (28% inode=100%);| >>> /kml_inst2=248599MB;174862;314752;0;349725 >>> [solaris ~]$ echo $? >>> 1 >>> -- >>> >>> But running it from the nagios host I get: >>> - >>> spmon:~ # /usr/lib/nagios/plugins/check_nrpe -H solaris -c check_disk >>> -a 50% 10% /test >>> DISK WARNING - free space: /test 101125 MB (28% inode=100%);| >>> /test=248599MB;174862;314752;0;349725 >>> spmon:~ # echo $? >>> 0 >>> - >> >> How is check_disk defined in your nrpe config and is it configured to >> allow arguments? >> > > I have: > > > dont_blame_nrpe=1 > command[check_disk]=/usr/local/nagios/libexec/check_disk -w $ARG1$ -c > $ARG2$ -p $ARG3$ > > > Regards, > It's weird, if I restart the daemon it works, but just for the first execution. - spmon:/etc/nagios/objects/services # /usr/lib/nagios/plugins/check_nrpe -H billbd2 -c check_disk -a 90% 80% /kml_inst2 DISK CRITICAL - free space: /kml_inst2 76172 MB (21% inode=100%);| /kml_inst2=273552MB;34972;69944;0;349725 spmon:/etc/nagios/objects/services # echo $? 2 spmon:/etc/nagios/objects/services # /usr/lib/nagios/plugins/check_nrpe -H billbd2 -c check_disk -a 90% 80% /kml_inst2 DISK CRITICAL - free space: /kml_inst2 76172 MB (21% inode=100%);| /kml_inst2=273552MB;34972;69944;0;349725 spmon:/etc/nagios/objects/services # echo $? 0 -- Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE gives wrong exit codes
2009/9/3 Morris, Patrick : > On Thu, 03 Sep 2009, Ciro Iriarte wrote: > >> Hi, i'm trying to monitor a Solaris box using NRPE. The thing is >> nagios sees always an EXIT_CODE=0. >> >> Running the check by hand on the Solaris host works as expected: >> >> -- >> [solaris ~]$ /usr/local/nagios/libexec/check_disk -w 50% -c 10% -p /kml_ >> DISK WARNING - free space: /kml_inst2 101125 MB (28% inode=100%);| >> /kml_inst2=248599MB;174862;314752;0;349725 >> [solaris ~]$ echo $? >> 1 >> -- >> >> But running it from the nagios host I get: >> - >> spmon:~ # /usr/lib/nagios/plugins/check_nrpe -H solaris -c check_disk >> -a 50% 10% /test >> DISK WARNING - free space: /test 101125 MB (28% inode=100%);| >> /test=248599MB;174862;314752;0;349725 >> spmon:~ # echo $? >> 0 >> - > > How is check_disk defined in your nrpe config and is it configured to > allow arguments? > I have: dont_blame_nrpe=1 command[check_disk]=/usr/local/nagios/libexec/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NRPE gives wrong exit codes
Hi, i'm trying to monitor a Solaris box using NRPE. The thing is nagios sees always an EXIT_CODE=0. Running the check by hand on the Solaris host works as expected: -- [solaris ~]$ /usr/local/nagios/libexec/check_disk -w 50% -c 10% -p /kml_ DISK WARNING - free space: /kml_inst2 101125 MB (28% inode=100%);| /kml_inst2=248599MB;174862;314752;0;349725 [solaris ~]$ echo $? 1 -- But running it from the nagios host I get: - spmon:~ # /usr/lib/nagios/plugins/check_nrpe -H solaris -c check_disk -a 50% 10% /test DISK WARNING - free space: /test 101125 MB (28% inode=100%);| /test=248599MB;174862;314752;0;349725 spmon:~ # echo $? 0 - Versions: check_nrpe -- NRPE Plugin for Nagios Copyright (c) 1999-2008 Ethan Galstad (nag...@nagios.org) Version: 2.12 Last Modified: 03-10-2008 License: GPL v2 with exemptions (-l for more info) SSL/TLS Available: Anonymous DH Mode, OpenSSL 0.9.6 or higher required - remote NRPE agent - spmon:~ # /usr/lib/nagios/plugins/check_nrpe -H solaris NRPE v2.12 It's a bug in NRPE agent?, configuration error? Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Redundant Nagios setups...
2009/8/26 Janet Sullivan : > Other than the methods described in the official docs > (http://nagios.sourceforge.net/docs/3_0/redundancy.html) and Merlin > (http://www.op5.org/community/projects/merlin) are there any other suggested > solutions for setting up redundant Nagios servers? > For HA we use a Xen domU that can jump between 2 physical hosts... Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NRPE vs. check_by_ssh
2009/3/25 Idriss ARABBAJ : > Hi Kevin, > > I carefully read your speech about this subject and I found you a lot > of insist on security offering by ssh, but you can also configure > nrpe to work with ssl so I think we will have no difference at this > level, then what do you think? > best regards > > > > -- > Cordialement, > Idriss ARABBAJ > Don't top-post We have seen performance issues (in the key exchange phase mostly) using check_by_ssh with about 300 checks on a single host. NRPE using SSL has less overhead. Running a plugin using SSH took 7.5 to 10 seconds (various tests) and running the same with NRPE took 0.148 seconds at most. The target server is a big SPARC with 3 CPUs dual-core, dual-strand and 24GB, so is not a hardware issue Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are powering Web 2.0 with engaging, cross-platform capabilities. Quickly and easily build your RIAs with Flex Builder, the Eclipse(TM)based development software that enables intelligent coding and step-through debugging. Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Ndoutils MySQL performance problem?
2009/3/3 Livio Zanol Puppim : > Hello everyone! > > I'm having huge troubles with Mysql using 100% of CPU at peculiar times. I > don't know when this problem starts and I don't know the reason, but when it > happends I kill nagios, ndoutils and Mysql, and try to restart Mysql, > ndoutils and Nagios at this order. When I start Nagios, the CPU usage of > Mysql goes to 100% and everything just freezes with the initials > INSERT/UPDATES made by ndoutils. > > Has anyone had the same problem? I've changed my.cnf to hold slow-queries > logging, an have seen at intervals of 1 minute a query runned by ndo that > tries to delete service_checks results. It takes about 4 seconds too run. Is > this expected? Can anyone help me? > > Running Nagios 3.0.3, Debian etch, NDO2DB 1.4b7 and mysql Ver 14.12 Distrib > 5.0.45. > > > Slow-Queries log: > # Time: 090303 14:29:25 > # u...@host: ndoutils[ndoutils] @ localhost [] > # Query_time: 4 Lock_time: 0 Rows_sent: 0 Rows_examined: 0 > DELETE FROM nagios_servicechecks WHERE instance_id='1' AND > start_time # Time: 090303 14:30:26 > # u...@host: ndoutils[ndoutils] @ localhost [] > # Query_time: 4 Lock_time: 0 Rows_sent: 0 Rows_examined: 0 > DELETE FROM nagios_servicechecks WHERE instance_id='1' AND > start_time > Thank you. > -- > []'s > > Lívio Zanol Puppim > Try creating a index for that table - create index nagios_servicechecks_idx1 on nagios_servicechecks (start_time, instance_id) - I'm not sure if the order would change the result though (start_time, instance_id) OR (instance_id, start_time) Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Greetings. Anyone using Oracle Grid Control to send alert data into Nagios?
2009/2/19 : > > I'm looking to have Oracle grid control send alert data into Nagios. Thus far > I've only been doing things via NRPE, as it's a fairly small monitoring > structure. > > I was wondering if anyone was interfacing to Oracle grid control, what > information you were getting from it and how you were getting that information > into Nagios. And if anyone is using an event handler in conjunction with > this, what sorts of things are you using event handlers for? > > James Melin > Sr. Systems Software Programmer > Hennepin County Mainframe Services > 612-348-7147 > Never used that application, but if it has support for SNMP (I believe it does), you can write a script to collect data with snmpget (snmpwalk will give you a list of all you can extract from the agent). Nagios can also work with SNMP traps. Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Using SLES10 and Nagios 3.x under VMWare
2009/2/19 Deborah Martin : > Hi Christian, > > The vmware environment (i.e Sles10) is built by IT services here so I don't > have any insight on this part. > > I shall then build nagios from source so that I have a more granular control > over where things go although it's mostly > to standard places i.e /usr/local/nagios etc > > > > regards, > deborah > > -Original Message- > From: Christian Schneemann [mailto:cschneem...@suse.de] > Sent: 19 February 2009 14:17 > To: nagios-users@lists.sourceforge.net > Subject: Re: [Nagios-users] Using SLES10 and Nagios 3.x under VMWare > > On Thursday February 19 2009 02:30:29 pm Deborah Martin wrote: >> Hi, >> >> I've never used SLES10 + Nagios 3.x as a VM Ware box and I wondered if >> any of you had come across any issues that I should be aware of. (A >> new environment is being built in this way to replace an old PC >> running nagios) >> >> Basically, the nagios 3.x setup is very simple. We use it to monitor >> database activity such as RAM or disk usage amongst others using >> plugins that I wrote. >> >> Nagios monitors a few other nodes using ping or ssh but other than >> that it's not a complicated setup. >> >> Any feedback / advice on running Nagios within VM Ware would be really >> useful. >> >> thanks, >> deborah > Hi, > we've used an vmware image for testing our config and new nagios packages > and everything went fine. We now changed to xen guests because I prefer xen > ;) > > I see no problems with running nagios in a virtual machine. > > Do you use the official packages from SLES or the ones from buildservice? Or > do you install nagios from sources? > > Greetings, >Christian > If you are running a 32 bits machine, you can use the kernel that has VMware's paravirtualization support and ask your infrastructure admin to enable that option in the VM. I'm running nagios v3 from BuildService on a Xen domU with SLES10SP2. Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring Shared Storage
2008/9/29 Jay Chandler <[EMAIL PROTECTED]>: > I've got a scenario that I'd imagine many of you have already dealt > with, so before I roll my own solution I figured I'd ask y'all what > you've done. > > I have roughly 50 servers or so that mount their home directories (as > well as a few other things, but let's talk about /home for simplicity) > from an NFS server (isilon at the moment, shortly to become a NetApp). > > Because space is expensive, periodically central storage fills up. This > results in /home sending out notifications from all 50 servers. > > Is there a good way to set things up so that if central storage fills up > I don't get paged by every system we've got? I don't want to give up > monitoring /home on these hosts since our configuration monkey has > screwed up before and NOT mounted the NFS share, leading to the > mountpoint on local disk filling up instead. > > Thanks-- my apologies if this wasn't clear. > > -- > Jay Chandler / KB1JWQ > Living Legend / Systems Exorcist > Today's Excuse: bit bucket overflow > Can you check the FS directly on the NFS server?, that would be better, and just check that the FS is mounted on your 50 clients... To check mount points: http://www.nagiosexchange.org/cgi-bin/page.cgi?g=2588.html;d=1 http://www.nagiosexchange.org/cgi-bin/page.cgi?g=1334.html;d=1 http://www.nagiosexchange.org/cgi-bin/page.cgi?g=1552.html;d=1 http://www.nagiosexchange.org/cgi-bin/page.cgi?g=2172.html;d=1 Regards, Ciro - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Check_SSH Plug-In
2008/8/18 John Moore <[EMAIL PROTECTED]>: > Hey group, > I am running a SLES server (SuSE) with a copy of Nagios 2.6 installed and > working. > I want the check_ssh plug-in to report with a Warning when ssh is running, > and Green when ssh is off. > It would be really great if I could use NRPE and perform this check locally. > > Any suggestions? > > Thanks, > John > There's a "negate" plugin you can combine with check_ssh to negate the status of the plugin. Regards, Ciro - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Environment variables
2008/8/12 <[EMAIL PROTECTED]>: > I usually have env variable files which I source depending on the > instance/db. When I write my plugins I either specify the env > variable file as a command line argument, or hardcode the sourcing in > my script. > That's what I did at last, even this should be global variables (no SID defined there), only ORACLE_HOME, TNS_ADMIN and LD_LIBRARY_PATH, needed by all the plugins regarding to which DB they connect Regards, Ciro - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Environment variables
2008/8/11 Hugo van der Kooij <[EMAIL PROTECTED]>: > -BEGIN PGP SIGNED MESSAGE- > Hash: SHA1 > > Ciro Iriarte wrote: > | Is there other way to set environment variables than modifying the > | nagios start script?, I need to set Oracle variables for several > | scripts/plugins but I would like it to survive a > | reinstallation/migration. > > Create your own wrapper for the plugins. > > If you stick to the usage parameters of the plugins that need the > wrapper script you should be fine. > > Hugo. > So, there's no option available from nagios to define global variables?. The idea is that I don't have to change multiple script every time I change oracle client version or reinstall/migrate the monitoring host Regards, Ciro - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Environment variables
Is there other way to set environment variables than modifying the nagios start script?, I need to set Oracle variables for several scripts/plugins but I would like it to survive a reinstallation/migration. Regards, Ciro - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] ndo2db - Error: mysql_query() failed
The delay seems to be due a slow delete process in the mysql backend... Any optimization advices? Regards, Ciro - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] ndo2db - Error: mysql_query() failed
Hi, i'm having issues with ndoutils 1.4b7 and nagios 3.0.3, after some time nagios stops working and I have many defunct processes with no movement on nagios.log. On /var/log/messages I found some complains from ndo2db, but i'm not sure they are related. -- Aug 5 14:03:30 spmon ndo2db: Error: mysql_query() failed for 'INSERT INTO nagios_servicestatus SET instance_id='1', service_object_id='672', status_update_time=FROM_UNIXTIME(1217959410), output='LOG CRITICAL - /tas/monAlarm/log/alarm\.log: \[2008-08-05 14:00:01\] Insert alarm \[\[ALARMA\] Recibi SIGNAL SIGUSR1\. Pid=\(10345\)\] for pid=10345 \(intfCallRater\[Interface_Tasador_de_Llamadas\]\): Ok', perfdata='', current_state='2', has_been_checked='1', should_be_scheduled='1', current_check_attempt='1', max_check_attempts='1', last_check=FROM_UNIXTIME(1217959401), next_check=FROM_UNIXTIME(1217959116), check_type='0', last_state_change=FROM_UNIXTIME(1217959401), last_hard_state_change=FROM_UNIXTIME(1217959401), last_hard_state='0', last_time_ok=FROM_UNIXTIME(1217958996), last_time_warning=FROM_UNIXTIME(0), last_time_unknown=FROM_UNIXTIME(0), last_time_critical=FROM_UNIXTIME(1217959401), state_type='1', last_notification=FROM_UNIXTIME(1217959404), next_notification=FROM_UNIXTIME(3110119404), no_more_notifications='0', notifications_enabled='1', problem_has_been_acknowledged='0', acknowledgement_type='0', current_notification_number='1', passive_checks_enabled='1', active_checks_enabled='1', event_handler_enabled='1', flap_detection_enabled='0', is_flapping='0', percent_state_change='15.197370', latency='285.269000', execution_time='1.033410', scheduled_downtime_depth='0', failure_prediction_enabled='1', process_performance_data='1', obsess_over_service='1', modified_service_attributes='0', event_handler='ins2tt', check_command='check_log!/tas/monAlarm/log/alarm\.log!\'Insert\'!\'no definida en tabla CELDAS\'', normal_check_interval='2.00', retry_check_interval='1.00', check_timeperiod_object_id='139' ON DUPLICATE KEY UPDATE instance_id='1', service_object_id='672', status_update_time=FROM_UNIXTIME(1217959410), output='LOG CRITICAL - /tas/monAlarm/log/alarm\.log: \[2008-08-05 14:00:01\] Insert alarm \[\[ALARMA\] Recibi SIGNAL SIGUSR1\. Pid=\(10345\)\] for pid=10345 \(intfCallRater\[Interface_Tasador_de_Llamadas Aug 5 14:03:32 spmon ndo2db: Error: mysql_query() failed for 'INSERT INTO nagios_servicestatus SET instance_id='1', service_object_id='672', status_update_time=FROM_UNIXTIME(1217959411), output='LOG CRITICAL - /tas/monAlarm/log/alarm\.log: \[2008-08-05 14:00:01\] Insert alarm \[\[ALARMA\] Recibi SIGNAL SIGUSR1\. Pid=\(10345\)\] for pid=10345 \(intfCallRater\[Interface_Tasador_de_Llamadas\]\): Ok', perfdata='', current_state='2', has_been_checked='1', should_be_scheduled='1', current_check_attempt='1', max_check_attempts='1', last_check=FROM_UNIXTIME(1217959401), next_check=FROM_UNIXTIME(1217959521), check_type='0', last_state_change=FROM_UNIXTIME(1217959401), last_hard_state_change=FROM_UNIXTIME(1217959401), last_hard_state='2', last_time_ok=FROM_UNIXTIME(1217958996), last_time_warning=FROM_UNIXTIME(0), last_time_unknown=FROM_UNIXTIME(0), last_time_critical=FROM_UNIXTIME(1217959401), state_type='1', last_notification=FROM_UNIXTIME(1217959404), next_notification=FROM_UNIXTIME(3110119404), no_more_notifications='0', notifications_enabled='1', problem_has_been_acknowledged='0', acknowledgement_type='0', current_notification_number='1', passive_checks_enabled='1', active_checks_enabled='1', event_handler_enabled='1', flap_detection_enabled='0', is_flapping='0', percent_state_change='15.197370', latency='285.269000', execution_time='1.033410', scheduled_downtime_depth='0', failure_prediction_enabled='1', process_performance_data='1', obsess_over_service='1', modified_service_attributes='0', event_handler='ins2tt', check_command='check_log!/tas/monAlarm/log/alarm\.log!\'Insert\'!\'no definida en tabla CELDAS\'', normal_check_interval='2.00', retry_check_interval='1.00', check_timeperiod_object_id='139' ON DUPLICATE KEY UPDATE instance_id='1', service_object_id='672', status_update_time=FROM_UNIXTIME(1217959411), output='LOG CRITICAL - /tas/monAlarm/log/alarm\.log: \[2008-08-05 14:00:01\] Insert alarm \[\[ALARMA\] Recibi SIGNAL SIGUSR1\. Pid=\(10345\)\] for pid=10345 \(intfCallRater\[Interface_Tasador_de_Llamadas Aug 5 14:03:32 spmon ndo2db: Error: mysql_query() failed for 'INSERT INTO nagios_servicestatus SET instance_id='1', service_object_id='672', status_update_time=FROM_UNIXTIME(1217959411), output='LOG CRITICAL - /tas/monAlarm/log/alarm\.log: \[2008-08-05 14:00:01\] Insert alarm \[\[ALARMA\] Recibi SIGNAL SIGUSR1\. Pid=\(10345\)\] for pid=10345 \(intfCallRater\[Interface_Tasador_de_Llamadas\]\): Ok', perfdata='', current_state='2', has_been_checked='1', should_be_scheduled='1', current_check_attempt='1', max_check_attempts='1', last_check=FROM_UNIXTIME(1217959401), next_check=FROM_UNIXTIME(1217959521), check_type='0'
Re: [Nagios-users] Help with nrpe and certificates
2008/7/24 Steve Searle <[EMAIL PROTECTED]>: > Hi, > > I have nagios monitoring a number of servers using the nrpe plugin. > Recently one of the checks has failed with a "could not complete SSL > handshake" error. > > I think this is because the certificate has either expired or been > re-created (self signed), but I can't find out what certificate I should > create to get round this (you will see I am a newbie in this area). > > Can anyone else help, or point me at some relevant documentation. I > realise I may have got this totally wrong, so please a) be patient, and > b) feel free to point out any blindingly obvious errors I have made. > > Thanks > > Steve > > -- > Check that your monitoring host is allowed to connect to that host (nrpe.cfg). Regards, Ciro - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Migrating to v3
2008/7/21 Ciro Iriarte <[EMAIL PROTECTED]>: > 2008/7/21 Andreas Ericsson <[EMAIL PROTECTED]>: >> Ciro Iriarte wrote: >>> >>> Hi, >>> >>> I'm migrating our v2.x installation of nagios to 3.0.3 and i'm stuck >>> currently, can sameone check if i'm missing something here?: >>> >>> Verification reports: >>> - >>> Error: Service description, host name, or check command is NULL >>> Error: Could not register service (config file >>> '/etc/nagios/objects/services/filesystems.win.cfg', starting on line >>> 1) >>> - >>> >>> First lines of filesystems.win.cfg: >>> >>> - >>> define service{ >>>use filesystems >>>hostgroup_name nrpe_nt >>>service_description Unidad c:\ >> >> Nagios 3 supports line-continuation in configuration files. Remove the >> backslash >> at the end of this line and you'll be fine. >> >> -- >> Andreas Ericsson [EMAIL PROTECTED] >> OP5 AB www.op5.se >> Tel: +46 8-230225 Fax: +46 8-230231 >> > > Thanks to both of you, that was the issue, I had to add "\\" instead > and it worked! > > Regards, > Ciro > Nope, that didn't help, the error desappeared and also this my services (not seen in the web interface), deletin the "\" did it (just "Unidad c:" works. Thanks Ciro - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Migrating to v3
2008/7/21 Andreas Ericsson <[EMAIL PROTECTED]>: > Ciro Iriarte wrote: >> >> Hi, >> >> I'm migrating our v2.x installation of nagios to 3.0.3 and i'm stuck >> currently, can sameone check if i'm missing something here?: >> >> Verification reports: >> - >> Error: Service description, host name, or check command is NULL >> Error: Could not register service (config file >> '/etc/nagios/objects/services/filesystems.win.cfg', starting on line >> 1) >> - >> >> First lines of filesystems.win.cfg: >> >> - >> define service{ >>use filesystems >>hostgroup_name nrpe_nt >>service_description Unidad c:\ > > Nagios 3 supports line-continuation in configuration files. Remove the > backslash > at the end of this line and you'll be fine. > > -- > Andreas Ericsson [EMAIL PROTECTED] > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 > Thanks to both of you, that was the issue, I had to add "\\" instead and it worked! Regards, Ciro - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Migrating to v3
Hi, I'm migrating our v2.x installation of nagios to 3.0.3 and i'm stuck currently, can sameone check if i'm missing something here?: Verification reports: - Error: Service description, host name, or check command is NULL Error: Could not register service (config file '/etc/nagios/objects/services/filesystems.win.cfg', starting on line 1) - First lines of filesystems.win.cfg: - define service{ use filesystems hostgroup_name nrpe_nt service_description Unidad c:\ check_command check_nrpe_args!check_disk!c!2%!1%!\/p event_handler ins2tt event_handler_enabled 1 } define service{ use filesystems host_name asusis-dwhp2,asusis-dc,asusis-af service_description Unidad d:\ check_command check_nrpe_args!check_disk!d!8%!5%!\/p event_handler ins2tt event_handler_enabled 1 } define service{ use filesystems host_name asusis-dwhp service_description Unidad d:\ check_command check_nrpe_args!check_disk!d!25000!15000!\/p event_handler ins2tt event_handler_enabled 1 } - The old parameter was just "hostgroup" and according to the documentation "hostgroup_name" can still replace the "host_name" parameter The template: - define service{ use generic-service namefilesystems is_volatile 0 check_period24x7 max_check_attempts 3 normal_check_interval 2 retry_check_interval1 notification_interval 120 notification_period 24x7 notification_optionsw,u,c,r register0 event_handler ins2tt } - Regards, Ciro - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios alerts via Windows Messenger
2008/4/24, Goltz, Jim (NIH/NLM/LHC) [C] <[EMAIL PROTECTED]>: > This may be an oft-asked question, but I haven't found any mention in > recent archives. > > I'm experimenting with using the Windows Messenger service (NOT MSN > Messenger) to send alerts. Basically Nagios calls "smbclient -M > some_host", and a message pops up on some_host with whatever text I pipe > into smbclient. > > The problem is, using smbclient I need to specify which host to send the > message to. I'd much rather send the message to whichever host a > particular user is logged into. I know it's possible: running "net send > username text_of_message" in a Windows command shell will do this. But > I don't know what SMB domain magic happens to tell "net" what machine > "username" is logged into. > > Has anyone ever successfully done this, perchance? > > -- > Jim Goltz <[EMAIL PROTECTED]> > Contractor, Lockheed Martin > National Library of Medicine > > We use smbclient for notification and just specify the user instead of the machine, we have a Windows Domain, probably that affects the behaviour. Ciro - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3 GUI config tools
2008/4/21, Angel L. Mateo <[EMAIL PROTECTED]>: > > > > Centreon looks pretty good, but it's not fully functional for v3 I > > > guess it could be a great frontend if it gets more attention from > > > community... > > > > > > Ciro > > > > > > Link: http://www.centreon.com/Product/Screenshots.html > > > > The beta for 2.0 which supported 3.0 is supposed to be released this > > week, betting it will be sometime next week though. > > > > We are using centreon 1.4.2.3 with nagios 3 in a test environment and > we don't have any problem. The only problem could be that with this > configuration we can't use nagios 3 specific features. > > -- > Angel L. Mateo Martínez > Sección de Telemática > Área de Tecnologías de la Información _o) > y las Comunicaciones Aplicadas (ATICA) / \\ > http://www.um.es/atica_(___V > Tfo: 968367590 > Fax: 968398337 > Well, i tried it some time ago (when it was named oreon), but it didn't support all the configuration directives for Nagios v2, like $ADDRESSN$ (if I remember right) so it's useless for us, as we have nagios running already for many years with a currently long and complex configuration (the reason to try oreon/centreon) and need it to just work after migration to Centreon. Regards. CI.- - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3 GUI config tools
2008/4/11, Yost, Karl <[EMAIL PROTECTED]>: > > -Original Message- > > From: [EMAIL PROTECTED] [mailto:nagios-users- > > > [EMAIL PROTECTED] On Behalf Of Ciro Iriarte > > Sent: Thursday, April 10, 2008 5:01 PM > > To: Sekhar > > Cc: nagios-users@lists.sourceforge.net > > > Subject: Re: [Nagios-users] Nagios 3 GUI config tools > > > > 2008/4/9, Sekhar <[EMAIL PROTECTED]>: > > > Hi, > > > > > > I am looking for Nagios 3 GUI config tools for hosts,groups,contacts > > etc > > > > > > No one seems to be developing nice tool for nagios gui config > > > > > > fruity - this is only publicity stunt RC release nearly 2 years > > > > > > Is there any other tools which can be used for easy configuration of > > nagios > > > > > > thanks for your help > > > > > > Best Regards > > > > > > > > Centreon looks pretty good, but it's not fully functional for v3 I > > guess it could be a great frontend if it gets more attention from > > community... > > > > Ciro > > > > Link: http://www.centreon.com/Product/Screenshots.html > > The beta for 2.0 which supported 3.0 is supposed to be released this > week, betting it will be sometime next week though. > > -Karl > > > > > > --- > Yes, and It needs a lot of testing and feedback, so if there are people with spare time, please try it!, a distributed solution would be great and for graphs it uses the ndo database data. The old roadmap stated a PHP interface for nagios v3, apparently it didn't happen.. Ciro - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 3 GUI config tools
2008/4/9, Sekhar <[EMAIL PROTECTED]>: > Hi, > > I am looking for Nagios 3 GUI config tools for hosts,groups,contacts etc > > No one seems to be developing nice tool for nagios gui config > > fruity - this is only publicity stunt RC release nearly 2 years > > Is there any other tools which can be used for easy configuration of nagios > > thanks for your help > > Best Regards > Centreon looks pretty good, but it's not fully functional for v3 I guess it could be a great frontend if it gets more attention from community... Ciro Link: http://www.centreon.com/Product/Screenshots.html - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] ePN error
"Patrick Proy" <[EMAIL PROTECTED]> escribió en el mensaje news:[EMAIL PROTECTED] Hi, Download latest version 1.4.3 of the script : http://www.manubulon.com/nagios/check_snmp_int.pl This should fix the problem. Patrick http://www.manubulon.com/nagios/ Thanks a lot.. Ciro - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] ePN error
"Patrick Proy" <[EMAIL PROTECTED]> escribió en el mensaje news:[EMAIL PROTECTED] Hi, I agree with S. Hopcroft, it must be a script error. I suspect that you use performance output (-k option) : is it the case ? Could you send the options you use with the script ? (command + service definition ?) and the return of the script with -v option using these options. Patrick http://www.manubulon.com/nagios/ Thanks for your answer, here's the info you requested: - Service definition define service{ use routers-template service_description Trafico Intf 11 ENC host_name rouasusis01 contact_groups operaciones check_command check_snmp_int!not_real_community!Serial3 event_handler ins2tt event_handler_enabled 1 } - Command definition define command{ command_namecheck_snmp_int command_line$USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C $ARG1$ -n $ARG2$ -r -f -e } - On commandline: -- Working case asusis-ope2:/usr/lib/nagios/plugins # ./check_snmp_int.pl -H rouasusis01 -C not_real_community -n Serial1 -r -f -e Serial1:UP:1 UP: OK | 'Serial1_in_octet'=312860304c 'Serial1_out_octet'=310979079c 'Serial1_in_error'=583c 'Serial1_in_discard'=0c 'Serial1_out_error'=0c 'Serial1_out_discard'=0c -- Giving critical asusis-ope2:/usr/lib/nagios/plugins # ./check_snmp_int.pl -H rouasusis01 -C not_real_community -n Serial3 -r -f -e Use of uninitialized value in print at ./check_snmp_int.pl line 610. Serial3:DOWN: 1 int NOK : CRITICAL | Ciro - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] ePN error
"Stanley Hopcroft" <[EMAIL PROTECTED]> escribió en el mensaje news:[EMAIL PROTECTED] > Dear Sir, > > I am writing to thank you for your letter and say, > > On 06/12/06, Ciro Iriarte <[EMAIL PROTECTED]> wrote: >> Hi, i'm running nagios 2.5 with various perl scripts, just added >> check_snmp_int.pl (http://www.manubulon.com/nagios/snmp_int.html) to the >> mix, but i'm getting some ePN error when it gives CRITICAL status (it >> could >> be a coincidence) >> >> Output: >> Serial2:DOWN: 1 int NOK : CRITICAL**ePN >> /usr/lib/nagios/plugins/check_snmp_int.pl: "Use of uninitialized value in >> join or string at /usr/sbin/p1.pl line 72,". >> >> According to the output, it's the TIEHANDLE sub >> >> 70: sub TIEHANDLE { >> 71:my ($class) = @_; >> 72:my $me = ''; >> 73:bless \$me, $class; >> 74:} >> >> Is this a problem with ePN (what i think, because of the output) or with >> the >> script? > > If it is ePN, it certainly isn't in the TIEHANDLE sub (which only runs > when > ePN loads - once when Nagios starts). > > It's unimpressive that the error is so poorly reported. > > The problem is almost certainly with plugin. What I think is happening > is that what the plugin writes (to STDOUT) when it reports a CRITICAL, > contains some undefined values (when the plugin calls print, the tied > file handle results in the p1.pl PRINT sub being called. All PRINT > does is concatenate all the args into a string and append it to the > tied object). > > (here's a demo > > [EMAIL PROTECTED] standardise_router_configs]$ cat x > #!/usr/bin/perl -w > > use strict ; > > my @x = (undef, 1, undef, 2) ; > > print join(" ", @x), "\n" ; > [EMAIL PROTECTED] standardise_router_configs]$ perl x > Use of uninitialized value in join or string at x line 7. > Use of uninitialized value in join or string at x line 7. > 1 2 > [EMAIL PROTECTED] standardise_router_configs]$ > ) > > If you can simulate or reproduce the event that causes the problem, > try to inspect the values of the variables that the plugin outputs > before it exits with the CRITICAL. > You should see the problem - but it will be reported differently - > with or without ePN. > > ePN could and maybe should be modified to not die when print is called > with undef elements. On the other hand, Perl with strict always > complains and since ePN alwats runs with strict, you see the > complaints (assuming that is really the cause of the problem). > > HTH, > > Yours sincerely. > Thanks a lot for you answer, i'll send more details to debug the plugin. Ciro - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] ePN error
Hi, i'm running nagios 2.5 with various perl scripts, just added check_snmp_int.pl (http://www.manubulon.com/nagios/snmp_int.html) to the mix, but i'm getting some ePN error when it gives CRITICAL status (it could be a coincidence) Output: Serial2:DOWN: 1 int NOK : CRITICAL**ePN /usr/lib/nagios/plugins/check_snmp_int.pl: "Use of uninitialized value in join or string at /usr/sbin/p1.pl line 72,". According to the output, it's the TIEHANDLE sub 70: sub TIEHANDLE { 71:my ($class) = @_; 72:my $me = ''; 73:bless \$me, $class; 74:} Is this a problem with ePN (what i think, because of the output) or with the script? Thanks in advanced Ciro - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios Future
Hi, I've been using Nagios since version 1.1, right now we are using Nagios 2.5 version + Perfparse and i'm planning to implement Oreon on top of it but just found about ndoutils so i'm wondering how should i proceed, can the ndoutils replace perfparse?, would Oreon be pointless with the planned PHP version of the interfase?, it seems that the ndoutils would make posible running many nagios monitoring processes/hosts with the same data backend (great HA), at least that's what i understood. Is viable a migration of data from perfparse to ndoutils? Any comments? Thanks in advanced, Ciro - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null