Re: [Nagios-users] Monitoring disk usage
jumping in a bit late on this one... so it sounds like the check itself works on the local box... so if you run from the nagios server ./libexec/check_nrpe -H server -P 5666 -t 10 -c check_root and then... ./libexec/check_nrpe -H server -P 5666 -t 10 -c check_ora could your Nagios server also be connecting to the wrong server... ? I've had DNS issues drive me berserk before. -- Mat W. - http://www.techadre.com Date: Wed, 15 Jul 2009 16:14:27 +0300 From: juki.e...@gmail.com To: guid...@gmail.com CC: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Monitoring disk usage Hi Guy, My findings in-line... 2009/7/15 Guy Waugh They should only be defined in one place. From those errors, it sounds like NRPE on the monitored host doesn't have those commands defined, which suggests that it's not reading your nrpe.cfg config file. I would check out NRPE on the monitored host... do any checks on that host work at all? Yes, indeed the checks do work. These have been run on the monitored host. See below; bash$ /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p /var/opt/BGw/Server1 DISK OK - free space: /var/opt/BGw/Server1 35606 MB (68% inode=97%);| /var/opt/BGw/Server1=16589MB;37990;43417;0;54272 bash$ /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p /var/opt/BGw/Server1 DISK OK - free space: /var/opt/BGw/Server1 35614 MB (68% inode=97%);| /var/opt/BGw/Server1=16581MB;37990;43417;0;54272 Is the nrpe.cfg file in /usr/local/nagios/etc? Is either the NRPE daemon running or xinetd is handling the NRPE connections? Is the host listening on port 5666? Are the permissions on the nrpe.cfg file correct? What happens when you run an NRPE service check for that host manually, from the nagios server? Yes, the nrpe.cfg file is in /usr/local/nagios/etc with permissions; bash$ ls -l /usr/local/nagios/etc/ total 32 -rw-r--r-- 1 nagios nagios 7871 Jul 15 14:21 nrpe.cfg The NRPE daemon is running on the pid... bash$ ps -ef | grep nrpe root 25866 14582 0 16:01:25 pts/4 0:00 grep nrpe nagios 1640 1 0 Jan 09 ? 49:38 /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d nagios 9702 1 0 Jul 09 ? 1:27 /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d And listening on bash$ netstat -a | grep nrpe emm4.nrpe *.*0 0 49152 0 LISTEN *.nrpe *.*0 0 49152 0 LISTEN localhost.nrpe *.* 0 0 49152 0 LISTEN On running a telnet (on port 5666) to the monitored host from the nagios monitoring server, I get; relay$ telnet 10.0.0.5 5666 Trying 10.0.0.5... Connected to 10.0.0.5. Escape character is '^]'. When I run an NRPE service check for the monitored host manually from the nagios server, I get; relay$ /usr/local/nagios/libexec/check_nrpe -H 10.0.0.5 -p 5666 -c check_disk /var/opt/mediation/ora DISK OK - free space: / 22848 MB (37% inode=23%);| /=37500MB;54862;57910;0;60958 1491 This shows that the check_nrpe command is retrieving the details for the root (/) partition and NOT those of the /var/opt/mediation/ora partition If none of that give you any joy, post the nrpe.cfg file to this list if you like, suitably obfuscated of course. A copy of the nrpe.cfg file as follows; bash$ more nrpe.cfg # # Sample NRPE Config File # Written by: Ethan Galstad (nag...@nagios.org) # # Last Modified: 11-23-2007 # # NOTES: # This is a sample configuration file for the NRPE daemon. It needs to be # located on the remote host that is running the NRPE daemon, not the host # from which the check_nrpe client is being executed. # # LOG FACILITY # The syslog facility that should be used for logging purposes. log_facility=daemon # PID FILE # The name of the file in which the NRPE daemon should write it's process ID # number. The file is only written if the NRPE daemon is started by the root # user and is running in standalone mode. pid_file=/var/run/nrpe.pid # PORT NUMBER # Port number we should wait for connections on. # NOTE: This must be a non-priviledged port (i.e. > 1024). # NOTE: This option is ignored if NRPE is running under either inetd or xinetd server_port=5666 # SERVER ADDRESS # Address that nrpe should bind to in case there are more than one interface # and you do not want nrpe to bind on all interfaces. # NOTE: This option is ignored if NRPE is running under either inetd or xinetd #server_address=127.0.0.1 # NRPE USER # This determines the effective user that the NRPE daemon should run as. # You can either supply a username or a UID. # # NOTE: This option is ignored if NRPE is running under either inetd or xinetd #server_address=127.0.0.
Re: [Nagios-users] Monitoring disk usage
Hi Marc, 2009/7/15 Marc Powell > Are you sure it's listing port numbers and not symbolic names. Try > 'netstat -an | grep 5666'. > > * bash-3.00# netstat -an | grep 5666 *.5666 *.*0 0 49152 0 LISTEN* The NRPE daemon still doesn't seem to be running on the monitored host despite the above output... *bash-3.00# ps -afe | grep nrpe root 2110 14582 0 17:37:25 pts/4 0:00 grep nrpe * Regards -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring disk usage
On Jul 15, 2009, at 9:16 AM, Guy Waugh wrote: > bash-3.00# netstat -a | grep 5666 > > > The netstat command above didn't return any output. So this means > the NRPE daemon is either not running or not listening on port 5666. Are you sure it's listing port numbers and not symbolic names. Try 'netstat -an | grep 5666'. -- Marc -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring disk usage
Hi Juki, I'm not a Solaris 10 expert, but... do you have two versions of NRPE configured, one through 'lrc' and one through 'svc'? I don't know what the 'maintenance' entry means, but it doesn't look happy. Any Solaris 10 people out there? Cheers, Guy. On Wed, Jul 15, 2009 at 3:09 PM, Juki wrote: > Hi Guy, > > > 2009/7/15 Guy Waugh > >> Yes, indeed the checks do work. These have been run on the monitored host. >>> See below; >>> >>> *bash$ /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p >>> /var/opt/BGw/Server1 >>> DISK OK - free space: /var/opt/BGw/Server1 35606 MB (68% inode=97%);| >>> /var/opt/BGw/Server1=16589MB;37990;43417;0;54272 >>> >>> bash$ /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p >>> /var/opt/BGw/Server1 >>> DISK OK - free space: /var/opt/BGw/Server1 35614 MB (68% inode=97%);| >>> /var/opt/BGw/Server1=16581MB;37990;43417;0;54272* >>> >> >> OK, so they look like checks run from the monitored host itself, right? >> > > > Yes, indeed. > > > >> >> Eek! How come there are two??? There should only be one nrpe daemon >> running. You can see that the first one has been running since January. Kill >> them both, with extreme prejudice, restart it and verify that there's only >> one copy of the nrpe daemon running. This will probably solve your problem. >> > > > I have killed them both, however, I can't seem to get the nrpe daemon > running again on the monitored host (the OS is Soalris 10) . This is what I > executed; > > *bash$ svcadm enable nrpe > bash$/usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d* > > > Then to verify... > > *bash-3.00# svcs | grep nrpe > legacy_run Jan_09 lrc:/etc/rc3_d/S60nrpe > online 16:39:40 svc:/network/nrpe/tcp:default > maintenance16:41:16 svc:/application/management/nagios/nrpe:default* > > And more verification... > > *bash-3.00# netstat -a | grep 5666* > > > The netstat command above didn't return any output. So this means the NRPE > daemon is either not running or not listening on port 5666. > > Am I doing the NRPE start up the right way? Are there other ways to start > NRPE daemon on Solaris 10? > > > Regards, > Juki > > -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring disk usage
Hi Guy, 2009/7/15 Guy Waugh > Yes, indeed the checks do work. These have been run on the monitored host. >> See below; >> >> *bash$ /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p >> /var/opt/BGw/Server1 >> DISK OK - free space: /var/opt/BGw/Server1 35606 MB (68% inode=97%);| >> /var/opt/BGw/Server1=16589MB;37990;43417;0;54272 >> >> bash$ /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p >> /var/opt/BGw/Server1 >> DISK OK - free space: /var/opt/BGw/Server1 35614 MB (68% inode=97%);| >> /var/opt/BGw/Server1=16581MB;37990;43417;0;54272* >> > > OK, so they look like checks run from the monitored host itself, right? > Yes, indeed. > > Eek! How come there are two??? There should only be one nrpe daemon > running. You can see that the first one has been running since January. Kill > them both, with extreme prejudice, restart it and verify that there's only > one copy of the nrpe daemon running. This will probably solve your problem. > I have killed them both, however, I can't seem to get the nrpe daemon running again on the monitored host (the OS is Soalris 10) . This is what I executed; *bash$ svcadm enable nrpe bash$/usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d* Then to verify... *bash-3.00# svcs | grep nrpe legacy_run Jan_09 lrc:/etc/rc3_d/S60nrpe online 16:39:40 svc:/network/nrpe/tcp:default maintenance16:41:16 svc:/application/management/nagios/nrpe:default* And more verification... *bash-3.00# netstat -a | grep 5666* The netstat command above didn't return any output. So this means the NRPE daemon is either not running or not listening on port 5666. Am I doing the NRPE start up the right way? Are there other ways to start NRPE daemon on Solaris 10? Regards, Juki -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring disk usage
Hi Juki, On Wed, Jul 15, 2009 at 2:14 PM, Juki wrote: > Hi Guy, > > My findings in-line... > > > 2009/7/15 Guy Waugh > >> They should only be defined in one place. From those errors, it sounds >> like NRPE on the monitored host doesn't have those commands defined, which >> suggests that it's not reading your nrpe.cfg config file. I would check out >> NRPE on the monitored host... do any checks on that host work at all? >> > > > Yes, indeed the checks do work. These have been run on the monitored host. > See below; > > *bash$ /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p > /var/opt/BGw/Server1 > DISK OK - free space: /var/opt/BGw/Server1 35606 MB (68% inode=97%);| > /var/opt/BGw/Server1=16589MB;37990;43417;0;54272 > > bash$ /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p > /var/opt/BGw/Server1 > DISK OK - free space: /var/opt/BGw/Server1 35614 MB (68% inode=97%);| > /var/opt/BGw/Server1=16581MB;37990;43417;0;54272* > OK, so they look like checks run from the monitored host itself, right? > ** > > > >> Is the nrpe.cfg file in /usr/local/nagios/etc? Is either the NRPE daemon >> running or xinetd is handling the NRPE connections? Is the host listening on >> port 5666? Are the permissions on the nrpe.cfg file correct? What happens >> when you run an NRPE service check for that host manually, from the nagios >> server? >> > > Yes, the nrpe.cfg file is in /usr/local/nagios/etc with permissions; > > *bash$ ls -l /usr/local/nagios/etc/ > total 32 > -rw-r--r-- 1 nagios nagios 7871 Jul 15 14:21 nrpe.cfg* > > The NRPE daemon is running on the pid... > > *bash$ ps -ef | grep nrpe > root 25866 14582 0 16:01:25 pts/4 0:00 grep nrpe > nagios 1640 1 0 Jan 09 ? 49:38 > /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d > nagios 9702 1 0 Jul 09 ? 1:27 > /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d* > Eek! How come there are two??? There should only be one nrpe daemon running. You can see that the first one has been running since January. Kill them both, with extreme prejudice, restart it and verify that there's only one copy of the nrpe daemon running. This will probably solve your problem. Regards, Guy. -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring disk usage
Hi Guy, My findings in-line... 2009/7/15 Guy Waugh > They should only be defined in one place. From those errors, it sounds like > NRPE on the monitored host doesn't have those commands defined, which > suggests that it's not reading your nrpe.cfg config file. I would check out > NRPE on the monitored host... do any checks on that host work at all? > Yes, indeed the checks do work. These have been run on the monitored host. See below; *bash$ /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p /var/opt/BGw/Server1 DISK OK - free space: /var/opt/BGw/Server1 35606 MB (68% inode=97%);| /var/opt/BGw/Server1=16589MB;37990;43417;0;54272 bash$ /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p /var/opt/BGw/Server1 DISK OK - free space: /var/opt/BGw/Server1 35614 MB (68% inode=97%);| /var/opt/BGw/Server1=16581MB;37990;43417;0;54272* > Is the nrpe.cfg file in /usr/local/nagios/etc? Is either the NRPE daemon > running or xinetd is handling the NRPE connections? Is the host listening on > port 5666? Are the permissions on the nrpe.cfg file correct? What happens > when you run an NRPE service check for that host manually, from the nagios > server? > Yes, the nrpe.cfg file is in /usr/local/nagios/etc with permissions; *bash$ ls -l /usr/local/nagios/etc/ total 32 -rw-r--r-- 1 nagios nagios 7871 Jul 15 14:21 nrpe.cfg* The NRPE daemon is running on the pid... *bash$ ps -ef | grep nrpe root 25866 14582 0 16:01:25 pts/4 0:00 grep nrpe nagios 1640 1 0 Jan 09 ? 49:38 /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d nagios 9702 1 0 Jul 09 ? 1:27 /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d* And listening on *bash$ netstat -a | grep nrpe emm4.nrpe *.*0 0 49152 0 LISTEN *.nrpe *.*0 0 49152 0 LISTEN localhost.nrpe *.* 0 0 49152 0 LISTEN * On running a telnet (on port 5666) to the monitored host from the nagios monitoring server, I get; *relay$ telnet 10.0.0.5 5666 Trying 10.0.0.5... Connected to 10.0.0.5. Escape character is '^]'.* When I run an NRPE service check for the monitored host manually from the nagios server, I get; *relay$ /usr/local/nagios/libexec/check_nrpe -H 10.0.0.5 -p 5666 -c check_disk /var/opt/mediation/ora DISK OK - free space: / 22848 MB (37% inode=23%);| /=37500MB;54862;57910;0;60958 1491* This shows that the check_nrpe command is retrieving the details for the root (/) partition and NOT those of the /var/opt/mediation/ora partition > If none of that give you any joy, post the nrpe.cfg file to this list if > you like, suitably obfuscated of course. > A copy of the nrpe.cfg file as follows; *bash$ more nrpe.cfg # # Sample NRPE Config File # Written by: Ethan Galstad (nag...@nagios.org) # # Last Modified: 11-23-2007 # # NOTES: # This is a sample configuration file for the NRPE daemon. It needs to be # located on the remote host that is running the NRPE daemon, not the host # from which the check_nrpe client is being executed. # # LOG FACILITY # The syslog facility that should be used for logging purposes. log_facility=daemon # PID FILE # The name of the file in which the NRPE daemon should write it's process ID # number. The file is only written if the NRPE daemon is started by the root # user and is running in standalone mode. pid_file=/var/run/nrpe.pid # PORT NUMBER # Port number we should wait for connections on. # NOTE: This must be a non-priviledged port (i.e. > 1024). # NOTE: This option is ignored if NRPE is running under either inetd or xinetd server_port=5666 # SERVER ADDRESS # Address that nrpe should bind to in case there are more than one interface # and you do not want nrpe to bind on all interfaces. # NOTE: This option is ignored if NRPE is running under either inetd or xinetd #server_address=127.0.0.1 # NRPE USER # This determines the effective user that the NRPE daemon should run as. # You can either supply a username or a UID. # # NOTE: This option is ignored if NRPE is running under either inetd or xinetd #server_address=127.0.0.1 # NRPE USER # This determines the effective user that the NRPE daemon should run as. # You can either supply a username or a UID. # # NOTE: This option is ignored if NRPE is running under either inetd or xinetd nrpe_user=nagios # NRPE GROUP # This determines the effective group that the NRPE daemon should run as. # You can either supply a group name or a GID. # # NOTE: This option is ignored if NRPE is running under either inetd or xinetd nrpe_group=nagios # ALLOWED HOST ADDRESSES # This is an optional comma-delimited list of IP address or hostnames # that are allowed to talk to the NRPE daemon. # # Note: The daemon only does rudimenta
Re: [Nagios-users] Monitoring disk usage
Hi Juki, On Wed, Jul 15, 2009 at 12:46 PM, Juki wrote: > Hi Guy, > > I tried all the changes as per your suggestion, however, I still come up > with the same errors as before, that is; > > "NRPE: Command 'check_ora' not defined" > "NRPE: Command 'check_server1' not defined" > "NRPE: Command 'check_var' not defined" > > > Where else in the configuration files on the monitoring server are commands > defined? > They should only be defined in one place. From those errors, it sounds like NRPE on the monitored host doesn't have those commands defined, which suggests that it's not reading your nrpe.cfg config file. I would check out NRPE on the monitored host... do any checks on that host work at all? Is the nrpe.cfg file in /usr/local/nagios/etc? Is either the NRPE daemon running or xinetd is handling the NRPE connections? Is the host listening on port 5666? Are the permissions on the nrpe.cfg file correct? What happens when you run an NRPE service check for that host manually, from the nagios server? If none of that give you any joy, post the nrpe.cfg file to this list if you like, suitably obfuscated of course. Regards, Guy. > > > > Regards, > Juki > > > > > 2009/7/15 Guy Waugh > >> Hi Juki, >> >> On Wed, Jul 15, 2009 at 10:24 AM, Juki wrote: >> >>> Hi Guy, >>> >>> Many thanks for your post. >>> >>> I decided to go with Option (1) as per your description and did the >>> following in the nrpe.cfg file on the monitored host: >>> >>> command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p >>> / >>> command[check_var]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p >>> /var >>> command[check_server1]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% >>> -p /var/opt/BGw/Server1 >>> command[check_ora]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p >>> /var/opt/mediation/ora >> >> >> Looks good, except the first one... call it 'check_root' like this: >> >> command[check_root]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p >> / >> >> >>> >>> >>> >>> And then on the nagios monitoring server I have this: >>> >>> define service{ >>> use generic-service ; Name of >>> service template to use >>> host_name emm4 >>> service_description Root Partition >>> check_command check_nrpe!check_disk!15%!10%! >>> } >>> >>> define service{ >>> use generic-service ; Name of >>> service template to use >>> host_name emm4 >>> service_description var Partition >>> check_command check_nrpe!check_var!30%!20%! >>> } >>> >>> define service{ >>> use generic-service ; Name of >>> service template to use >>> host_name emm4 >>> service_description server1 Partition >>> check_command check_nrpe!check_server1!30%!20%! >>> } >>> >>> define service{ >>> use generic-service ; Name of >>> service template to use >>> host_name emm4 >>> service_description ora Partition >>> check_command check_nrpe!check_ora!30%!20%! >>> } >> >> >> You're still half way between the two approaches with your service >> definitions, because you're passing arguments :-) >> >> Try this instead: >> >> define service{ >> use generic-service ; Name of >> service template to use >> host_name emm4 >> service_description Root Partition >> check_command check_nrpe!check_root >> } >> >> define service{ >> use generic-service ; Name of >> service template to use >> host_name emm4 >> service_description var Partition >> check_command check_nrpe!check_var >> } >> >> define service{ >> use generic-service ; Name of >> service template to use >> host_name emm4 >> service_description server1 Partition >> check_command check_nrpe!check_server1 >> } >> >> define service{ >> use generic-service ; Name of >> service template to use >> host_name emm4 >> service_description ora Partition >> check_command check_nrpe!check_ora >> } >> >> >>> >>> >>> When I log into the nagios Web GUI, the services for the ora, server1 >>> and var partition have been successfully created. However, looking at the >>> "status information" column, I have the followin
Re: [Nagios-users] Monitoring disk usage
On Jul 15, 2009, at 6:46 AM, Juki wrote: > Hi Guy, > > I tried all the changes as per your suggestion, however, I still > come up with the same errors as before, that is; > > "NRPE: Command 'check_ora' not defined" > "NRPE: Command 'check_server1' not defined" > "NRPE: Command 'check_var' not defined" > > > Where else in the configuration files on the monitoring server are > commands defined? You're editing the correct nrpe.cfg, on the host emm4 and restarting NRPE on emm4 after doing so? It would appear that you are not. Has that been your problem the whole time? -- Marc -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring disk usage
Hi Guy, I tried all the changes as per your suggestion, however, I still come up with the same errors as before, that is; "NRPE: Command 'check_ora' not defined" "NRPE: Command 'check_server1' not defined" "NRPE: Command 'check_var' not defined" Where else in the configuration files on the monitoring server are commands defined? Regards, Juki 2009/7/15 Guy Waugh > Hi Juki, > > On Wed, Jul 15, 2009 at 10:24 AM, Juki wrote: > >> Hi Guy, >> >> Many thanks for your post. >> >> I decided to go with Option (1) as per your description and did the >> following in the nrpe.cfg file on the monitored host: >> >> command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p >> / >> command[check_var]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p >> /var >> command[check_server1]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% >> -p /var/opt/BGw/Server1 >> command[check_ora]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p >> /var/opt/mediation/ora > > > Looks good, except the first one... call it 'check_root' like this: > > command[check_root]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p / > > >> >> >> >> And then on the nagios monitoring server I have this: >> >> define service{ >> use generic-service ; Name of >> service template to use >> host_name emm4 >> service_description Root Partition >> check_command check_nrpe!check_disk!15%!10%! >> } >> >> define service{ >> use generic-service ; Name of >> service template to use >> host_name emm4 >> service_description var Partition >> check_command check_nrpe!check_var!30%!20%! >> } >> >> define service{ >> use generic-service ; Name of >> service template to use >> host_name emm4 >> service_description server1 Partition >> check_command check_nrpe!check_server1!30%!20%! >> } >> >> define service{ >> use generic-service ; Name of >> service template to use >> host_name emm4 >> service_description ora Partition >> check_command check_nrpe!check_ora!30%!20%! >> } > > > You're still half way between the two approaches with your service > definitions, because you're passing arguments :-) > > Try this instead: > > define service{ > use generic-service ; Name of > service template to use > host_name emm4 > service_description Root Partition > check_command check_nrpe!check_root > } > > define service{ > use generic-service ; Name of > service template to use > host_name emm4 > service_description var Partition > check_command check_nrpe!check_var > } > > define service{ > use generic-service ; Name of > service template to use > host_name emm4 > service_description server1 Partition > check_command check_nrpe!check_server1 > } > > define service{ > use generic-service ; Name of > service template to use > host_name emm4 > service_description ora Partition > check_command check_nrpe!check_ora > } > > >> >> >> When I log into the nagios Web GUI, the services for the ora, server1 and >> var partition have been successfully created. However, looking at the >> "status information" column, I have the following errors: >> >> "NRPE: Command 'check_ora' not defined" >> "NRPE: Command 'check_server1' not defined" >> "NRPE: Command 'check_var' not defined" >> > > Try my changes suggested above and see if these errors persist. They might > be happening because you're passing arguments to the remote NRPE daemon, I'm > not sure... > > Regards, > Guy. > -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plug
Re: [Nagios-users] Monitoring disk usage
Hi Juki, On Wed, Jul 15, 2009 at 10:24 AM, Juki wrote: > Hi Guy, > > Many thanks for your post. > > I decided to go with Option (1) as per your description and did the > following in the nrpe.cfg file on the monitored host: > > command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p / > command[check_var]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p > /var > command[check_server1]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% > -p /var/opt/BGw/Server1 > command[check_ora]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p > /var/opt/mediation/ora Looks good, except the first one... call it 'check_root' like this: command[check_root]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p / > > > > And then on the nagios monitoring server I have this: > > define service{ > use generic-service ; Name of > service template to use > host_name emm4 > service_description Root Partition > check_command check_nrpe!check_disk!15%!10%! > } > > define service{ > use generic-service ; Name of > service template to use > host_name emm4 > service_description var Partition > check_command check_nrpe!check_var!30%!20%! > } > > define service{ > use generic-service ; Name of > service template to use > host_name emm4 > service_description server1 Partition > check_command check_nrpe!check_server1!30%!20%! > } > > define service{ > use generic-service ; Name of > service template to use > host_name emm4 > service_description ora Partition > check_command check_nrpe!check_ora!30%!20%! > } You're still half way between the two approaches with your service definitions, because you're passing arguments :-) Try this instead: define service{ use generic-service ; Name of service template to use host_name emm4 service_description Root Partition check_command check_nrpe!check_root } define service{ use generic-service ; Name of service template to use host_name emm4 service_description var Partition check_command check_nrpe!check_var } define service{ use generic-service ; Name of service template to use host_name emm4 service_description server1 Partition check_command check_nrpe!check_server1 } define service{ use generic-service ; Name of service template to use host_name emm4 service_description ora Partition check_command check_nrpe!check_ora } > > > > When I log into the nagios Web GUI, the services for the ora, server1 and > var partition have been successfully created. However, looking at the > "status information" column, I have the following errors: > > "NRPE: Command 'check_ora' not defined" > "NRPE: Command 'check_server1' not defined" > "NRPE: Command 'check_var' not defined" Try my changes suggested above and see if these errors persist. They might be happening because you're passing arguments to the remote NRPE daemon, I'm not sure... Regards, Guy. > > > This indicates that I need to explicitly define 'check_ora', > 'check_server1' and 'check_var' elsewhere. So, where else must I define > these services so as to clear out the errors above? > > Thanks in advance! > > > > > > > 2009/7/15 Guy Waugh > >> Hi Juki, >> >> As I understand it, you can either (1) 'hard-code' your NRPE checks in the >> nrpe.cfg file on the monitored host (so that the checks are explicitly >> defined), or (2) set up NRPE on the monitored host in such a way that you >> can pass arguments in your NRPE checks. It looks like you're landing >> somewhere between the two with your configuration. >> >> I would either go the Option (1) way, having something like this in your >> nrpe.cfg file on the monitored host: >> *command[check_root]=/usr/local/nagios/libexec/check_disk -w 30% -c 15% >> -p / >> **command[check_var]=/usr/local/nagios/libexec/check_disk -w 30% -c 15%**-p >> /var >> **command[check_server1]=/usr/local/nagios/libexec/check_disk -w 30% -c >> 15%** -p /var/opt/BGw/Server1 >> **command[check_ora]=/usr/local/nagios/libexec/check_disk -w 30% -c 15%**-p >> /var/opt/mediation/ora >> *
Re: [Nagios-users] Monitoring disk usage
Hi Guy, Many thanks for your post. I decided to go with Option (1) as per your description and did the following in the nrpe.cfg file on the monitored host: command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p / command[check_var]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p /var command[check_server1]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p /var/opt/BGw/Server1 command[check_ora]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p /var/opt/mediation/ora And then on the nagios monitoring server I have this: define service{ use generic-service ; Name of service template to use host_name emm4 service_description Root Partition check_command check_nrpe!check_disk!15%!10%! } define service{ use generic-service ; Name of service template to use host_name emm4 service_description var Partition check_command check_nrpe!check_var!30%!20%! } define service{ use generic-service ; Name of service template to use host_name emm4 service_description server1 Partition check_command check_nrpe!check_server1!30%!20%! } define service{ use generic-service ; Name of service template to use host_name emm4 service_description ora Partition check_command check_nrpe!check_ora!30%!20%! } When I log into the nagios Web GUI, the services for the ora, server1 and var partition have been successfully created. However, looking at the "status information" column, I have the following errors: "NRPE: Command 'check_ora' not defined" "NRPE: Command 'check_server1' not defined" "NRPE: Command 'check_var' not defined" This indicates that I need to explicitly define 'check_ora', 'check_server1' and 'check_var' elsewhere. So, where else must I define these services so as to clear out the errors above? Thanks in advance! 2009/7/15 Guy Waugh > Hi Juki, > > As I understand it, you can either (1) 'hard-code' your NRPE checks in the > nrpe.cfg file on the monitored host (so that the checks are explicitly > defined), or (2) set up NRPE on the monitored host in such a way that you > can pass arguments in your NRPE checks. It looks like you're landing > somewhere between the two with your configuration. > > I would either go the Option (1) way, having something like this in your > nrpe.cfg file on the monitored host: > *command[check_root]=/usr/local/nagios/libexec/check_disk -w 30% -c 15% -p > / > **command[check_var]=/usr/local/nagios/libexec/check_disk -w 30% -c 15%**-p > /var > **command[check_server1]=/usr/local/nagios/libexec/check_disk -w 30% -c > 15%** -p /var/opt/BGw/Server1 > **command[check_ora]=/usr/local/nagios/libexec/check_disk -w 30% -c 15%**-p > /var/opt/mediation/ora > * > > , and then having services on the nagios host like this: > > *define service{ > use generic-service ; Name of > service template to use > host_name emm4 > service_description Root Partition > check_command check_nrpe!check_root > }* > > Alternatively, for the Option (2) method, on the monitored host, in > nrpe.cfg: > > *command[check_disk]=/usr/local/nagios/libexec/check_disk -w $ARG1$ -c > $ARG2$ -p $ARG3$* > > , and a service example on the nagios host: > > *define service{ > use generic-service ; Name of > service template to use > host_name emm4 > service_description Root Partition > check_command check_nrpe!check_disk!30!15!/ > }* > > I may not have the syntax in the 'check_command' of that last service > exactly right, but that's the general idea. > > To go down the Option 2 road, you have to have configured the NRPE daemon > on the monitored host with '--enable-command-args', and have to have a line > in your nrpe.cfg file that says 'dont_blame_nrpe=1'. This is because > allowing command arguments in this fashion can be a security risk. The > default nrpe.cfg file has some comments that refer to this. > > HTH, > Guy. > -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge_
Re: [Nagios-users] Monitoring disk usage
Hi Juki, As I understand it, you can either (1) 'hard-code' your NRPE checks in the nrpe.cfg file on the monitored host (so that the checks are explicitly defined), or (2) set up NRPE on the monitored host in such a way that you can pass arguments in your NRPE checks. It looks like you're landing somewhere between the two with your configuration. I would either go the Option (1) way, having something like this in your nrpe.cfg file on the monitored host: *command[check_root]=/usr/local/nagios/libexec/check_disk -w 30% -c 15% -p / **command[check_var]=/usr/local/nagios/libexec/check_disk -w 30% -c 15%** -p /var **command[check_server1]=/usr/local/nagios/libexec/check_disk -w 30% -c 15%* * -p /var/opt/BGw/Server1 **command[check_ora]=/usr/local/nagios/libexec/check_disk -w 30% -c 15%** -p /var/opt/mediation/ora* , and then having services on the nagios host like this: *define service{ use generic-service ; Name of service template to use host_name emm4 service_description Root Partition check_command check_nrpe!check_root }* Alternatively, for the Option (2) method, on the monitored host, in nrpe.cfg: *command[check_disk]=/usr/local/nagios/libexec/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$* , and a service example on the nagios host: *define service{ use generic-service ; Name of service template to use host_name emm4 service_description Root Partition check_command check_nrpe!check_disk!30!15!/ }* I may not have the syntax in the 'check_command' of that last service exactly right, but that's the general idea. To go down the Option 2 road, you have to have configured the NRPE daemon on the monitored host with '--enable-command-args', and have to have a line in your nrpe.cfg file that says 'dont_blame_nrpe=1'. This is because allowing command arguments in this fashion can be a security risk. The default nrpe.cfg file has some comments that refer to this. HTH, Guy. On Wed, Jul 15, 2009 at 7:24 AM, Juki wrote: > Hello again, > > Thanks for the tips so far. Now some responses; > > On the monitored host, I have setup nrpe.cfg and therein I have set the > option "allowed_hosts=" and also revised the format > as shown below; > > > *#specific commands# > command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 15% -p / > -p /var -p /var/opt/BGw/Server1 -p /var/opt/mediation/ora > command[check_swap]= /usr/local/nagios/libexec/check_swap -a -w 10% -c 5% > command[check_procs]=/usr/local/nagios/libexec/check_procs -w 300 -c 400 > command[check_ssh]=/usr/local/nagios/libexec/check_ssh -4 -t 10 -p 22 > localhost > command[check_ftp]=/usr/local/nagios/libexec/check_ftp -t 5 > > *Then on the monitoring server, I have also setup the monitored host file > (host.cfg) with the service description as follows; > > *define service{ > use generic-service ; Name of > service template to use > host_name emm4 > service_description Root Partition > check_command check_nrpe!check_disk!15%!10%! > }* > * > *After setting up all this, I refreshed the nrpe client on the monitored > host and also restarted the nagios service on the monitoring server, this is > after I checked that there were no errors whatsoever in the nagios > configuration files. And yes, I logged into the monitored host as the > nagios user and was able to verify that the check_nrpe command definition > does run successfully. > > When I log into the nagios web GUI, I'm able to see the alerts, warnings > and details for the check_swap, check_procs, check_ssh and for check_ftp. > For the case of check_disk, I only see alerts/warnings and details only for > the root (/) partition and NOT those for the /var, /var/opt/BGw/Server, > /var/opt/mediation/ora. > This is despite the fact that these partitions are defined in the nrpe.cfg > file on the monitored host as shown above. > > This is still puzzling* > > * > > -- > Enter the BlackBerry Developer Challenge > This is your chance to win up to $100,000 in prizes! For a limited time, > vendors submitting new applications to BlackBerry App World(TM) will have > the opportunity to enter the BlackBerry Developer Challenge. See full prize > details at: http://p.sf.net/sfu/Challenge > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null >
Re: [Nagios-users] Monitoring disk usage
Hello again, Thanks for the tips so far. Now some responses; On the monitored host, I have setup nrpe.cfg and therein I have set the option "allowed_hosts=" and also revised the format as shown below; *#specific commands# command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 15% -p / -p /var -p /var/opt/BGw/Server1 -p /var/opt/mediation/ora command[check_swap]= /usr/local/nagios/libexec/check_swap -a -w 10% -c 5% command[check_procs]=/usr/local/nagios/libexec/check_procs -w 300 -c 400 command[check_ssh]=/usr/local/nagios/libexec/check_ssh -4 -t 10 -p 22 localhost command[check_ftp]=/usr/local/nagios/libexec/check_ftp -t 5 *Then on the monitoring server, I have also setup the monitored host file (host.cfg) with the service description as follows; *define service{ use generic-service ; Name of service template to use host_name emm4 service_description Root Partition check_command check_nrpe!check_disk!15%!10%! }* * *After setting up all this, I refreshed the nrpe client on the monitored host and also restarted the nagios service on the monitoring server, this is after I checked that there were no errors whatsoever in the nagios configuration files. And yes, I logged into the monitored host as the nagios user and was able to verify that the check_nrpe command definition does run successfully. When I log into the nagios web GUI, I'm able to see the alerts, warnings and details for the check_swap, check_procs, check_ssh and for check_ftp. For the case of check_disk, I only see alerts/warnings and details only for the root (/) partition and NOT those for the /var, /var/opt/BGw/Server, /var/opt/mediation/ora. This is despite the fact that these partitions are defined in the nrpe.cfg file on the monitored host as shown above. This is still puzzling* * -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring disk usage
On Tue, 14 Jul 2009, Juki wrote: > Hello, > > I have installed nrpe v2.11 on a monitored host running Solaris 10. An > excerpt from nrpe.cfg on the monitored host is as follows; > > command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 15% -p / > command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p > /var/opt/BGw/Server1 -p /var/opt/mediation/ora > command[check_swap]= /usr/local/nagios/libexec/check_swap -a -w 10% -c 5% > command[check_procs]=/usr/local/nagios/libexec/check_procs -w 300 -c 400 > command[check_ssh]=/usr/local/nagios/libexec/check_ssh -4 -t 10 -p 22 > localhost > command[check_ftp]=/usr/local/nagios/libexec/check_ftp -t 5 > > My intention is to monitor disk usage on the partitions /var/opt/BGw/Server1 > and /var/opt/mediation/ora. This is not working yet. When I log into the > Nagios web interface on the monitoring server, I do not see any details > (alerts and warnings) to do with the above mentioned partitions. However, I > do see other details like ssh, processes, swap and even those for the / > partition. > > If I run on the monitored host, I get some info; > > bash-3.00# /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p > /var/opt/BGw/Server1 -p /var/opt/mediation/ora > DISK OK - free space: /var/opt/BGw/Server1 16937 MB (31% inode=94%); > /var/opt/mediation/ora 9738 MB (64% inode=99%);| > /var/opt/BGw/Server1=36502MB;37990;43417;0;54272 > /var/opt/mediation/ora=5323MB;10997;12568;0;15711 > > From my understanding, the output above shows that the check_disk plugin > actually works, however on logging into the web interface, there are no > details to this effect and also no alerts/warnings at all. How do I get to > see the details of these partitions in the web interface including warnings > and alerts. You haven't mentioned anything about configuring Nagios to run a check against the service you've set up on NRPE. Have you done that part yet? If you're not sure how, check out the section of the NRPE docs about the Monitoring Host Setup. That should tell you everything you need to know. -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring disk usage
Juki wrote: > Hello, > > I have installed nrpe v2.11 on a monitored host running Solaris 10. An > excerpt from nrpe.cfg on the monitored host is as follows; > > *command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 15% -p / > > command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p > /var/opt/BGw/Server1 -p /var/opt/mediation/ora > command[check_swap]= /usr/local/nagios/libexec/check_swap -a -w 10% -c 5% > command[check_procs]=/usr/local/nagios/libexec/check_procs -w 300 -c 400 > command[check_ssh]=/usr/local/nagios/libexec/check_ssh -4 -t 10 -p 22 > localhost > command[check_ftp]=/usr/local/nagios/libexec/check_ftp -t 5* > > My intention is to monitor disk usage on the partitions /var/opt/BGw/Server1 > and /var/opt/mediation/ora. This is not working yet. When I log into the > Nagios web interface on the monitoring server, I do not see any details > (alerts and warnings) to do with the above mentioned partitions. However, I > do see other details like ssh, processes, swap and even those for the / > partition. > > If I run on the monitored host, I get some info; > > *bash-3.00# /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p > /var/opt/BGw/Server1 -p /var/opt/mediation/ora > DISK OK - free space: /var/opt/BGw/Server1 16937 MB (31% inode=94%); > /var/opt/mediation/ora 9738 MB (64% inode=99%);| > /var/opt/BGw/Server1=36502MB;37990;43417;0;54272 > /var/opt/mediation/ora=5323MB;10997;12568;0;15711 > * >>From my understanding, the output above shows that the check_disk plugin > actually works, however on logging into the web interface, there are no > details to this effect and also no alerts/warnings at all. How do I get to > see the details of these partitions in the web interface including warnings > and alerts. > > > Thanks in advance, > Juki Did you add a service on the main nagios host to call the nrpe command you just set up? i.e., define service { service_description Check remote disk space host_name the_remote_host check_command check_nrpe!check_disk } HTH, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring disk usage
On Jul 14, 2009, at 3:58 AM, Juki wrote: > Hello, > > I have installed nrpe v2.11 on a monitored host running Solaris 10. > An excerpt from nrpe.cfg on the monitored host is as follows; > > command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c > 15% -p / > command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c > 20% -p /var/opt/BGw/Server1 -p /var/opt/mediation/ora You cannot have two definitions with the same [name], they must be unique. > My intention is to monitor disk usage on the partitions /var/opt/BGw/ > Server1 and /var/opt/mediation/ora. This is not working yet. When I > log into the Nagios web interface on the monitoring server, I do not > see any details (alerts and warnings) to do with the above mentioned > partitions. When reporting problems, please report what you _do_ see. It's often much more useful than what you _don't_ see. "I went to drive to work this morning but didn't see my car" isn't very informative but "Instead, I saw a giant ball of fire in the shape of a car" is much more helpful. Also include the full, complete host{} and service{} definitions for the problem service. You can get them from objects.cache if you don't want to expand templates you may be using yourself. > If I run on the monitored host, I get some info; > > bash-3.00# /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p / > var/opt/BGw/Server1 -p /var/opt/mediation/ora > DISK OK - free space: /var/opt/BGw/Server1 16937 MB (31% > inode=94%); /var/opt/mediation/ora 9738 MB (64% inode=99%);| /var/ > opt/BGw/Server1=36502MB;37990;43417;0;54272 /var/opt/mediation/ > ora=5323MB;10997;12568;0;15711 a) Never test plugins as root. Nagios doesn't run them as 'root' and you're likely to see permissions differences that send you down the completely wrong troubleshooting path. b) When testing something like this, verify you can run your check_nrpe command definition successfully for this check, from the nagios machine, as the nagios user. -- Marc -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Monitoring disk usage
Hello, I have installed nrpe v2.11 on a monitored host running Solaris 10. An excerpt from nrpe.cfg on the monitored host is as follows; *command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 15% -p / command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p /var/opt/BGw/Server1 -p /var/opt/mediation/ora command[check_swap]= /usr/local/nagios/libexec/check_swap -a -w 10% -c 5% command[check_procs]=/usr/local/nagios/libexec/check_procs -w 300 -c 400 command[check_ssh]=/usr/local/nagios/libexec/check_ssh -4 -t 10 -p 22 localhost command[check_ftp]=/usr/local/nagios/libexec/check_ftp -t 5* My intention is to monitor disk usage on the partitions /var/opt/BGw/Server1 and /var/opt/mediation/ora. This is not working yet. When I log into the Nagios web interface on the monitoring server, I do not see any details (alerts and warnings) to do with the above mentioned partitions. However, I do see other details like ssh, processes, swap and even those for the / partition. If I run on the monitored host, I get some info; *bash-3.00# /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p /var/opt/BGw/Server1 -p /var/opt/mediation/ora DISK OK - free space: /var/opt/BGw/Server1 16937 MB (31% inode=94%); /var/opt/mediation/ora 9738 MB (64% inode=99%);| /var/opt/BGw/Server1=36502MB;37990;43417;0;54272 /var/opt/mediation/ora=5323MB;10997;12568;0;15711 * >From my understanding, the output above shows that the check_disk plugin actually works, however on logging into the web interface, there are no details to this effect and also no alerts/warnings at all. How do I get to see the details of these partitions in the web interface including warnings and alerts. Thanks in advance, Juki -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring disk usage of remote server
> I can check the disk usage locally with the command check_disk. But I need to monitor the disk usage of remote servers. Check_disk command doesn't have any option to specify the name of the remote server to be monitored... Use the NRPE add-on: http://nagios.sourceforge.net/docs/3_0/monitoring-linux.html - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring disk usage of remote server
Chandra, >>> jeffrey Lang 08/26/2008 11:38:05 AM >>> One way i figured out how to do this is to setup CIFS/SMB and use check_disk_smb. Share the file system out with samba which is easy to setup. Just share out as read only and limit who can access the shares. Then use check_disk_smb to verify the disk usage. Chandra Bahadur wrote: Hi I can check the disk usage locally with the command check_disk. But I need to monitor the disk usage of remote servers. Check_disk command doesn't have any option to specify the name of the remote server to be monitored... How do I monitor the disk usage of remote servers ?? _ I have SNMP running on the servers I want to check. The Nagios command I use is check_snmp_disk_monitor and it gives a response like /usr/local/groundwork/nagios/libexec/check_snmp_disk_monitor.pl -H aragorn.somewhere.wa.gov -t 30 -C notpublic -v -s -t 30 -m '/' mountpoint / has 16% used, 141426626560 bytes and 21926182912 used Disk OK - 16% used on /|/,141426626560,21926182912 You can find it in quite a few places on the web. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ _ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null Chandra,>>> jeffrey Lang 08/26/2008 11:38:05 AM >>>One way i figured out how to do this is to setup CIFS/SMB and use check_disk_smb. Share the file system out with samba which is easy to setup. Just share out as read only and limit who can access the shares. Then use check_disk_smb to verify the disk usage.Chandra Bahadur wrote: HiI can check the disk usage locally with the command check_disk. But I need to monitor the disk usage of remote servers. Check_disk command doesn't have any option to specify the name of the remote server to be monitored...How do I monitor the disk usage of remote servers ??_ I have SNMP running on the servers I want to check. The Nagios command I use is check_snmp_disk_monitor and it gives a response like /usr/local/groundwork/nagios/libexec/check_snmp_disk_monitor.pl -H aragorn.somewhere.wa.gov -t 30 -C notpublic -v -s -t 30 -m '/'mountpoint / has 16% used, 141426626560 bytes and 21926182912 used Disk OK - 16% used on /|/,141426626560,21926182912 You can find it in quite a few places on the web. -This SF.Net email is sponsored by the Moblin Your Move Developer's challengeBuild the coolest Linux based applications with Moblin SDK & win great prizesGrand prize is a trip for two to an Open Source event anywhere in the worldhttp://moblin-contest.org/redirect.php?banner_id=100&url=""> _ ___Nagios-users mailing listNagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring disk usage of remote server
One way i figured out how to do this is to setup CIFS/SMB and use check_disk_smb.Share the file system out with samba which is easy to setup. Just share out as read only and limit who can access the shares. Then use check_disk_smb to verify the disk usage. Chandra Bahadur wrote: Hi I can check the disk usage locally with the command check_disk. But I need to monitor the disk usage of remote servers. Check_disk command doesn't have any option to specify the name of the remote server to be monitored... How do I monitor the disk usage of remote servers ?? - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null begin:vcard fn:Jeffrey Lang n:Lang;Jeffrey org:University of Wyoming;Geology and Geophyscis adr:;;1000 E. University Ave;Laramie;WY;72071;USA email;internet:[EMAIL PROTECTED] title:Unix/Linux Systems Admiin tel;work:307-766-3381 x-mozilla-html:TRUE url:http://home.gg.uwyo.edu version:2.1 end:vcard - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring disk usage of remote server
On 8/26/08, Chandra Bahadur <[EMAIL PROTECTED]> wrote: > > > Hi > I can check the disk usage locally with the command check_disk. But I > need to monitor the disk usage of remote servers. Check_disk command doesn't > have any option to specify the name of the remote server to be monitored... > > > How do I monitor the disk usage of remote servers ?? > Hey hi ! you could use snmp to do this . use te plugin from this website http://nagios.manubulon.com/snmp_storage.html All the best !! -- Regards Agnello D'souza www.linux-vashi.blogspot.com www.bible-study-india.blogspot.com - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring disk usage of remote server
On 26-Aug-2008, at 12:27 , Chandra Bahadur wrote: How do I monitor the disk usage of remote servers ?? By either using check_by_ssh or with a remote agent like NRPE. The check_by_ssh method looks something like this: define command { command_namecheck_disk_remote command_line$USER1$/check_by_ssh -H $HOSTADDRESS$ -C "$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$" } The documentation for check_by_ssh and check_disk should fill in the blanks as to what's happening there. HTH, Matt PGP.sig Description: This is a digitally signed message part - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] monitoring disk usage of remote server
Hi I can check the disk usage locally with the command check_disk. But I need to monitor the disk usage of remote servers. Check_disk command doesn't have any option to specify the name of the remote server to be monitored... How do I monitor the disk usage of remote servers ?? - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null