Re: [Nagios-users] ping hanging.

2006-01-09 Thread Hammond, Robin-David%KB3IEN


I changed the permission, but it still hangs. did two updates in the web 
interface, and froze. webpage still reloads, but the data are stale.



load averages:  1.17,  1.32,  1.27 
06:57:54

55 processes:  1 runnable, 49 sleeping, 4 stopped, 1 on processor
CPU states:  0.0% user,  100% nice,  0.0% system,  0.0% interrupt,  0.0% 
idle

Memory: 118M Act, 59M Inact, 756K Wired, 8340K Exec, 159M File, 488K Free
Swap: 129M Total, 129M Free

  PID USERNAME PRI NICE   SIZE   RES STATE  TIME   WCPUCPU COMMAND
 3378 nagios280   184K  948K CPU0:00  0.00%  0.00% top
 1345 nagios180   188K4K pause  0:00  0.00%  0.00% 
 3476 nagios100  2500K 1392K wait   0:00  0.00%  0.00% nagios
 1346 nagios100   584K 1320K wait   0:00  0.00%  0.00% bash
 1454 nagios100   140K  580K STOP   0:00  0.00%  0.00% sh
 1099 nagios 40   120K  680K STOP   0:00  0.00%  0.00% ping
 3771 nagios 20  2500K  932K STOP   0:00  0.00%  0.00% nagios
 4336 nagios 20   228K  724K STOP   0:00  0.00%  0.00% 
check_ping




im using the minimal.cfg file.





On Mon, 9 Jan 2006, Hugo van der Kooij wrote:


Date: Mon, 9 Jan 2006 23:21:59 +0100 (CET)
From: Hugo van der Kooij <[EMAIL PROTECTED]>
Reply-To: nagios-users@lists.sourceforge.net
To: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] ping hanging.

On Mon, 9 Jan 2006, Hammond, Robin-David%KB3IEN wrote:



-rw-r-  1 nagios  nagios   1335 Jan  7 04:30 resource.cfg
-rw-rw  1 nagios  nagios   1335 Jan  7 04:27 resource.cfg-sample

It looks like all the (relevant) files are available for reading.


And resorce,cfg is not relevant? Are you 250% sure everything runs as
nagios user?

Just for arguments sake could you `chmod o+r` these files?

Can you trace down the config sections for us?

How did you define the service/host?
How did you define the check?
How did you define the variables?


Hugo.

--
I hate duplicates. Just reply to the relevant mailinglist.
[EMAIL PROTECTED]   http://hvdkooij.xs4all.nl/
Don't meddle in the affairs of magicians,
for they are subtle and quick to anger.


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue.
::: Messages without supporting info will risk being sent to /dev/null



 Microsoft: Where do you want to go tomorrow?
 Linux: Where do you want to go today?
 BSD: Are you guys coming, or what?


Robin-David Hammond KB3IEN
www.aresnyc.org.


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Check_disk problem on remote machines

2006-01-09 Thread Tim O'Hare

Hello Miguel,
In our checkcommands.cfg file we have:

 command_line$USER1$/check_by_ssh -H $HOSTADDRESS$ -t 60 -n 
$HOSTNAME$ -s check_sda1_by_ssh -C '/usr/local/nagios/libexec/check_disk 
-w 20 -c 10 -p /dev/sda1'


and in our services.cfg we have:

# Check sda1 by ssh Service Definition Template
define service{
namecheck_sda1_by_ssh
service_description Check_sda1_by_ssh
check_command   check_sda1_by_ssh
use generic-service

contact_groups  support
notification_interval   0
notification_period workhours
notification_optionsc,f ; 
Critical, Flapping
register0   ; DONT REGISTER THIS 
DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE!

}

define service{
use check_sda1_by_ssh   ; Name 
of service template to use

host_name   megarouter
}


and it works successfully.

How are you referencing it in your services.cfg file i.e. what are you 
using for the check_command?


cheers, Tim

Miguel Faria wrote:

This is a problem that I've been having for quite some time. Searching
the archives turns up a few other people reporting the same problem, but
I can't find a solution.

I am trying to use check_by_ssh to run the check_disk plugin on a few
remote systems. When I run this from the command line, it always does
the right thing. It returns the correct string and return code (checked
with 'echo $?'). It works flawlessly.

However, when this is set up to run through Nagios, it usually reports
the check in the UNKNOWN state...even though it still gives the correct
"OK..." string. Occasionally, Nagios will change it to OK or WARNING or
whatever is correct at the time, but for a good 90% of the time, it just
sits there in UNKNOWN. It doesn't seem to matter if the target machine
is Linux or Solaris, nor can I find any pattern to when and where it
happens. Again, it *always* works perfectly when run from the command
line.

I've played with giving check_by_ssh a hostname instead of an IP
address. I've tried it both with '-l nagios' and without (it's always
run from nagios account). I've also tried the -f flag (the purpose of
which I don't really understand). Nothing seems to help. For the time
being, I'm just running an SNMP client to get this info into Nagios, but
I'd really rather use check_by_ssh if at all possible.

Has anyone else seen this problem and/or found the cause of it?

Here's the exact command I'm using on the command line. This same thing
is configutred in my .cfg file.

check_by_ssh -H 192.168.230.112 -C
'/home/utils/nagios/libexec/check_disk -w 10% -c 5% -p /var'

Thanks!





---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] ping hanging.

2006-01-09 Thread Hugo van der Kooij
On Mon, 9 Jan 2006, Hammond, Robin-David%KB3IEN wrote:

>
> -rw-r-  1 nagios  nagios   1335 Jan  7 04:30 resource.cfg
> -rw-rw  1 nagios  nagios   1335 Jan  7 04:27 resource.cfg-sample
>
> It looks like all the (relevant) files are available for reading.

And resorce,cfg is not relevant? Are you 250% sure everything runs as
nagios user?

Just for arguments sake could you `chmod o+r` these files?

Can you trace down the config sections for us?

How did you define the service/host?
How did you define the check?
How did you define the variables?


Hugo.

-- 
I hate duplicates. Just reply to the relevant mailinglist.
[EMAIL PROTECTED]   http://hvdkooij.xs4all.nl/
Don't meddle in the affairs of magicians,
for they are subtle and quick to anger.


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] ping hanging.

2006-01-09 Thread Hammond, Robin-David%KB3IEN


HRM:

drwxrwxr-x  2 nagios  nagios512 Jan  7 04:40 .
drwxr-xr-x  9 rootwheel 512 Jan  7 04:27 ..
-rw-rw-r--  1 nagios  nagios  30051 Jan  7 04:27 bigger.cfg-sample
-rw-r--r--  1 rootnagios   9580 Jan  7 04:39 cgi.cfg
-rw-rw-r--  1 nagios  nagios   9580 Jan  7 04:27 cgi.cfg-sample
-rw-r--r--  1 nagios  nagios   4826 Jan  7 04:31 checkcommands.cfg
-rw-rw-r--  1 nagios  nagios   4826 Jan  7 04:27 checkcommands.cfg-sample
-rw-r--r--  1 rootnagios 47 Jan  7 04:55 htpasswd.users
-rw-r--r--  1 nagios  nagios  13789 Jan  7 04:35 minimal.cfg
-rw-rw-r--  1 nagios  nagios  13757 Jan  7 04:27 minimal.cfg-sample
-rw-r--r--  1 nagios  nagios   4305 Jan  7 04:31 misccommands.cfg
-rw-rw-r--  1 nagios  nagios   4305 Jan  7 04:27 misccommands.cfg-sample
-rw-r--r--  1 nagios  nagios  30780 Jan  7 04:30 nagios.cfg
-rw-rw-r--  1 nagios  nagios  30780 Jan  7 04:27 nagios.cfg-sample
-rw-r-  1 nagios  nagios   1335 Jan  7 04:30 resource.cfg
-rw-rw  1 nagios  nagios   1335 Jan  7 04:27 resource.cfg-sample

It looks like all the (relevant) files are available for reading.

I am compiling lsof now.

Thanks for the sugestion.



On Sun, 8 Jan 2006, Arno Lehmann wrote:


Date: Sun, 08 Jan 2006 13:24:51 +0100
From: Arno Lehmann <[EMAIL PROTECTED]>
To: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] ping hanging.

Hello,

On 1/7/2006 10:32 PM, Marc Powell wrote:



-Original Message-
From: [EMAIL PROTECTED] [mailto:nagios-users-
[EMAIL PROTECTED] On Behalf Of Hammond, Robin-David%KB3IEN
Sent: Friday, January 06, 2006 10:02 PM
To: nagios-users@lists.sourceforge.net
Subject: [Nagios-users] ping hanging.


...


/Warning: Attempting to execute the command "/check_ping -H 4400::34


-w


3000.0,80% -c 5000.0,100% -p 1 -t 10 -6" resulted in a return code of


127.


Make sure the script or binary you are trying to execute actually
exists...

over and over.

this is particularly confusiong as i never reran configure nor did I


ever


uninstall the /check_ping programme it is still in place and


executable.


Hmm. I *really* hope you never installed the plugins in your root directory.


Any ideas?




Return code 127 means that the plugin was not found in the location
specified in the command definition. Nagios is apparently trying to
execute /check_ping and not /path/to/check_ping (usually
/usr/local/nagios/libexec/check_ping). There are four primary
possibilities --
1) your command definition does not utilize the $USER1$ macro in
the command_line i.e. 'command_line $USER1$/check_ping -H ...'
2) you have manually specified the incorrect path in the
command_line i.e. 'command line  /check_ping -H ...'
3) $USER1$ is not set to the correct plugins path in
resource.cfg
4) you have not included resource.cfg as a resource_file in
nagios.cfg


One other possibility I encountered:
When the resource.cfg file is not readable by nagios, the above happens after 
nagios reloaded its configuration. The initial configuration file access 
happens before nagios drops its rights (when started as root), but 
afterwards, during operation, it's simply no longer permitted to read the 
resource.cfg... he easy work-around is a chown on the file followed by 
another configuration reload.


Arno


--
Marc


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log 
files

for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when 
reporting any issue. ::: Messages without supporting info will risk being 
sent to /dev/null




--
IT-Service Lehmann[EMAIL PROTECTED]
Arno Lehmann  http://www.its-lehmann.de


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. ::: Messages without supporting info will risk being sent to 
/dev/null




 Microsoft: Where do you want to go tomorrow?
 Linux: Where do you want to go today?
 BSD: Are you guys coming, or what?


Robin-David Hammond KB3IEN
www.aresnyc.org.


---
This SF.net email is sponsor

Re: [Nagios-users] nagios do not show "Service Status Details For All Hosts"

2006-01-09 Thread Hugo van der Kooij
On Mon, 9 Jan 2006, Matteo Mancini wrote:

> ..I've restarted my nagios box, then I've started nagios manually.
> My "distribution tools" is a line in my rc.local "nagios -d
> /etc/nagios/nagios.cfg". I suppose that this method could be a fresh
> nagios startup.

So what output do you get from `ps ax|grep nagios` on each step? I for one
would like to see them instead of your interpretation.

You might actually have more then just that line in rc.local if you
installed packages.

Hugo.

-- 
I hate duplicates. Just reply to the relevant mailinglist.
[EMAIL PROTECTED]   http://hvdkooij.xs4all.nl/
Don't meddle in the affairs of magicians,
for they are subtle and quick to anger.


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nagios do not show "Service Status Details For All Hosts"

2006-01-09 Thread Matteo Mancini
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

>
>> Use your distributions tool to stop service processes. Wait some time.
>> Issue something like 'killall nagios; sleep 20; killall -9 nagios'.
>> Verify with ps that no more nagios processes are running. Restart
>> nagios, using your distributions tool.

..I've restarted my nagios box, then I've started nagios manually.
My "distribution tools" is a line in my rc.local "nagios -d
/etc/nagios/nagios.cfg". I suppose that this method could be a fresh
nagios startup.


>
>> Or, alternatively, send me your servers address and grant root access to
>> me... (Invoice would follow, of course).

It will be the next step :-)

>
>> Arno
>

bye

Matteo
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.1 (GNU/Linux)

iD8DBQFDwssG/TjXD9LUVswRAsKhAKCQTB4Kaqt9V3FZQaosKoLgcGF1cwCeOWRK
zentd8LfLBrH0M4+dpEyEXc=
=mU52
-END PGP SIGNATURE-


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nagios do not show "Service Status Details For All Hosts"

2006-01-09 Thread Matteo Mancini
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Hi list..

>>But I don't understand how can I resolve it ?? I've read other child
>>faqs but I couldn't find any interesting.
> 
> 
> Stop nagios, use 'ps' to see if there are any nagios processes still
> running, kill them if there are and restart nagios.


I've just done it, but with no results..


Thanks

Matteo
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.1 (GNU/Linux)

iD8DBQFDwskA/TjXD9LUVswRAuJeAJ9YAq5U6FP8Dd0v3GTkn2YwNR7KvgCfbaQ5
m9L1qDr+LThRWgfWUmipO00=
=lrjV
-END PGP SIGNATURE-


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nagios do not show "Service Status Details For All Hosts"

2006-01-09 Thread Arno Lehmann

Hello,

On 1/9/2006 9:07 PM, Matteo Mancini wrote:

-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Hi ..

thanks for your response..this look like my problem.





It's probably this FAQ --

http://www.nagios.org/faqs/viewfaq.php?faq_id=21&expand=false&showdesc=t
rue





But I don't understand how can I resolve it ?? I've read other child
faqs but I couldn't find any interesting.


I quote the mentioned FAQ text:

"Kill all Nagios processes and restart Nagios. This will ensure that 
only one process is running and that the current object definitions are 
loaded."


What's so difficult to understand here?

Use your distributions tool to stop service processes. Wait some time. 
Issue something like 'killall nagios; sleep 20; killall -9 nagios'. 
Verify with ps that no more nagios processes are running. Restart 
nagios, using your distributions tool.


Or, alternatively, send me your servers address and grant root access to 
me... (Invoice would follow, of course).


Arno



thanks

Matteo

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.1 (GNU/Linux)

iD8DBQFDwsKX/TjXD9LUVswRAvKFAJwKS8hq414Q/HO63V4Hz+xvIrvingCfTvYo
jf3WGQq4+rmJxUVWSuGiDfM=
=Ax8i
-END PGP SIGNATURE-


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




--
IT-Service Lehmann[EMAIL PROTECTED]
Arno Lehmann  http://www.its-lehmann.de


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


RE: [Nagios-users] nagios do not show "Service Status Details For All Hosts"

2006-01-09 Thread Marc Powell


> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Matteo Mancini
> Sent: Monday, January 09, 2006 2:08 PM
> To: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] nagios do not show "Service Status Details
For
> All Hosts"
> 
> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA1
> 
> Hi ..
> 
> thanks for your response..this look like my problem.
> 
> 
> 
> >
> > It's probably this FAQ --
> >
> >
http://www.nagios.org/faqs/viewfaq.php?faq_id=21&expand=false&showdesc=t
> > rue
> >
> 
> 
> But I don't understand how can I resolve it ?? I've read other child
> faqs but I couldn't find any interesting.

Stop nagios, use 'ps' to see if there are any nagios processes still
running, kill them if there are and restart nagios.

--
marc


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue.
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nagios do not show "Service Status Details For All Hosts"

2006-01-09 Thread Matteo Mancini
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Hi ..

thanks for your response..this look like my problem.



>
> It's probably this FAQ --
>
> http://www.nagios.org/faqs/viewfaq.php?faq_id=21&expand=false&showdesc=t
> rue
>


But I don't understand how can I resolve it ?? I've read other child
faqs but I couldn't find any interesting.


thanks

Matteo

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.1 (GNU/Linux)

iD8DBQFDwsKX/TjXD9LUVswRAvKFAJwKS8hq414Q/HO63V4Hz+xvIrvingCfTvYo
jf3WGQq4+rmJxUVWSuGiDfM=
=Ax8i
-END PGP SIGNATURE-


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Escalation questions

2006-01-09 Thread Justin Shore
Greetings.  I'm trying to set up a quick escalation procedure for 3
hosts.  Host Down alerts are generated for these hosts on a daily basis
due to the load on the first of the 3 items.  This particular problem
always resolves itself by the time next check happens (causing a host
down alert followed by a host up alert 2 minutes later).

What I'd like to do is use an escalation procedure with the first
contact being the user "null" I created on my Nagios box for the
explicit purpose of directing mail to /dev/null (this works already with
a few other contacts).  I've already created a "null" contactgroup for
this purpose.  I'd like the normal alerting process to start again after
15 minutes if the host(s) is still down.  This would prevent half a
dozen people from being paged daily with what equates to a bogus alert
(causing them to slowly but surely stop paying as much attention to the
alerts over time).  I'm a bit confused on how to implement this though.
Here's what I've got so far:

define hostescalation{
host_name   cmts-1.clr
contact_groups  isp-net-admins,cmts-admins
first_notification  1
notification_interval   15
escalation_period   24x7
escalation_options  d,u,r
}

isp-net-admins and cmts-admins are the normal contactgroups for these
hosts.  "null" is the contact(group) I want to use for the first 15
minutes before the escalation takes place on these 3 hosts.  I'm a
little confused on the notification_interval options.  My hosts/services
are by default running at 180 (minutes).  Will the notification_interval
in this hostescalation definition override that?  I'm not sure if I'm
taking the right approach to this or not so I welcome any and all
suggestions.

I've been reviewing the online docs and have made good headway.  I'm
stuck at the moment though.

Thanks 
  Justin

--
Justin Shore, Network Engineer
SKT Business Communication Solutions
Office:  (316) 265-9555 x3217 




---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue.
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] notification via YM

2006-01-09 Thread Jason Martin
On Mon, Jan 09, 2006 at 02:17:17AM -0800, Nelson Serafica wrote:
> is there a addons that when a host was down, it will sent a message to a 
> Yahoo messenger  account??
Not directly, but I believe there is an addon to Jabber tthat
can send via YM. You'd have to set up the jabber server and a
notification method that sends it through jabber.

-Jason Martin
-- 
How do you write zero in Roman numerals?
This message is PGP/MIME signed.


pgpU8Bdjiumqg.pgp
Description: PGP signature


Re: [Nagios-users] Dependency issue

2006-01-09 Thread John P. Rouillard

In message <[EMAIL PROTECTED]>,
Deborah Martin writes:
>It seems that its very dependent on when the next scheduled check will be.
>If I set max_check_attempts to 5 for services
>B or C, and then have max_check_attempts set to 1 on service A, it makes no
>difference. Service B or C could still
>be scheduled to be checked before service A. In fact, on the first check
>attempt for service B, for example, a critical 
>alert is generated. 
>
>I want Nagios to detect that Service B or C is a dependency on Service A,
>and then if Service A is alerting, suppress alerts for Service B or C. 
>But nagios doesn't seem to do any kind of dependency lookup if a dependency
>is already alerting. Kind of defeats, in my view, the logic
>of dependencies. There is no point in Nagios telling me that Service B or C
>is down, when Service A is down. Service A must be UP for any of its
>dependencies to work. 
>
>If however, Service A is OK, then yes - Service B or C should alert if
>necessary. 
>
>If I were to have 20 dependencies against a master service, I don't want 20
>potential alerts being generated! Plus I want to know the EXACT cause of the
>problem not have to hunt it down every time. Isn't that one of the points of
>Nagios - to be able to finely tune plugin checks so that we can 
>easily investigate a problem ? It seems that I needn't bother with
>dependencies if this is how it's going to work. 
>
>Does anyone else agree / disagree because it seems to me that either the
>docs are not explaining dependencies correctly, or Nagios 
>is not behaving correctly. I would appreciate feedback on this issue. 

Try reading the nagios docs on soft_state_dependencies. Then setting

   soft_state_dependencies=1

in nagios.cfg.

If A depends on B (aka no alert on A when B in trouble) make sure that:

   retry_check_interval(A) * max_check_attempts(A) >
   normal_check_interval(B)

That should guarantee that at least one check of B occurs during the
trouble interval for A. Without the soft_state_dependencies you must use:

   retry_check_interval(A) * max_check_attempts(A) >
   normal_check_interval(B) +
   (retry_check_interval(B) * max_check_attempts(B))

to make sure that B goes into a hard state during the trouble interval
of A.

-- rouilj
John Rouillard
===
My employers don't acknowledge my existence much less my opinions.


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


RE: [Nagios-users] Service Check Timed Out

2006-01-09 Thread Marc Powell


> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Alexandru Ionica
> Sent: Monday, January 09, 2006 8:52 AM
> To: nagios-users@lists.sourceforge.net
> Subject: [Nagios-users] Service Check Timed Out
> 
> Hello,
> this is the error i see in the cgi web interface.
> in the error log i see stuff like:
> 
> 
> SERVICE NOTIFICATION:
> noc;host-aaa;FPING;CRITICAL;notify-by-email;(Service Check Timed Out)
> 
> SERVICE ALERT: host-bbb;FPING;CRITICAL;HARD;3;(Service Check Timed
Out)
> 
> SERVICE ALERT: host-ccc;FPING;CRITICAL;SOFT;1;(Service Check Timed
Out)
> 
> 
> The FPING service is a perl script i wrote wich does a fping, and a
> snmpwalk. In the worst case scenario this should take are 30 seconds
to
> run.
> 
> I've set up service_check_timeout=90 in the main (nagios.cfg) config
file.
> The machine is under medium to heavy load, but i can't understand why
> it's getting a timeout, as 90 seconds seem more than enough for any of
> the scripts to run.

Does it work for any host/service? Did you perform your testing of your
script as the nagios user? I presume that nagios was restarted or
reloaded after modifying the timeout above.

> Is there any possibility that this is generated by the
check-host-alive
> plugin and it escalates ? or something like that ... ?

No, the output information from a host check is not down-populated to
all its services. The check really is timing out.
 
> any ideeas ?

I would log entry and exit times from your script to see if it really is
taking >90 seconds to run as well as log important milestones within the
script (fping start, fping output, fping end, snmpwalk start, output and
end, etc). I would also verify that it runs as expected by running the
command definition exactly as specified as the nagios user.

--
Marc


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue.
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] downtime not working

2006-01-09 Thread John Jolet




Are you also supplying the required comment? That's not automatically
populated.

of course not :)  that did the trick.  and sorry about the posting,  
for some reason, this is the only list that I have to do a reply-all  
on to get back to the list.

--
Marc




---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


RE: [Nagios-users] downtime not working

2006-01-09 Thread Marc Powell
Please always respond on-list so that others may benefit from your
experience.

> -Original Message-
> From: John Jolet [mailto:[EMAIL PROTECTED]
> Sent: Monday, January 09, 2006 10:00 AM
> To: Marc Powell
> Subject: Re: [Nagios-users] downtime not working
> 
> 
> On Jan 9, 2006, at 9:53 AM, Marc Powell wrote:
> 
> >
> >
> >> -Original Message-
> >> From: [EMAIL PROTECTED]
[mailto:nagios-users-
> >> [EMAIL PROTECTED] On Behalf Of John Jolet
> >> Sent: Monday, January 09, 2006 9:27 AM
> >> Cc: nagios-users@lists.sourceforge.net
> >> Subject: Re: [Nagios-users] downtime not working
> >>
> >
> > [removed OP]
> >
> >>
> >> I'm also unable to do downtime, but I just get "an error occurred
> >> processing your command".  the rw directory is writeable by apache,
> > which
> >> is what my server runs as, and the pipe is group writable by
apache.
> > I
> >> can't find an error either in the apache logs, nor the nagios logs.
> >
> > This indicates that nagios received the external command but was
> > unable
> > to completely parse it. Typically this is caused by leaving out a
> > required field (i.e. one shown as red). Have you supplied all the
> > required information in the correct format?
> >
> went to a problem that currently exists, clicked on the service, then
> on "schedule downtime for this service" and let it poplulate the
> fields...

Are you also supplying the required comment? That's not automatically
populated.

--
Marc


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue.
::: Messages without supporting info will risk being sent to /dev/null


RE: [Nagios-users] downtime not working

2006-01-09 Thread Marc Powell


> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of John Jolet
> Sent: Monday, January 09, 2006 9:27 AM
> Cc: nagios-users@lists.sourceforge.net
> Subject: Re: [Nagios-users] downtime not working
> 

[removed OP]
 
> 
> I'm also unable to do downtime, but I just get "an error occurred
> processing your command".  the rw directory is writeable by apache,
which
> is what my server runs as, and the pipe is group writable by apache.
I
> can't find an error either in the apache logs, nor the nagios logs.

This indicates that nagios received the external command but was unable
to completely parse it. Typically this is caused by leaving out a
required field (i.e. one shown as red). Have you supplied all the
required information in the correct format?

--
Marc


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue.
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] downtime not working

2006-01-09 Thread John Jolet
On Jan 9, 2006, at 8:27 AM, Rafael Bandeira da Costa wrote:  Hello  Your webserver user must have write access on /usr/local/nagios/var/rw directory. Have you set up the nagios command group, and added the webserver user to that group?  Then you would chown nagios.nagcmd /usr/local/nagios/var/rw and chmod g+x /usr/local/nagios/var/rw.  Just take a close look to adapt it to your needs (nagios homedir, nagios user and nagios command group).  --  Cheers, Rafael Costa [EMAIL PROTECTED] Curitiba / PR / Brasil I'm also unable to do downtime, but I just get "an error occurred processing your command".  the rw directory is writeable by apache, which is what my server runs as, and the pipe is group writable by apache.  I can't find an error either in the apache logs, nor the nagios logs.  Nelson Serafica escreveu:   Wheneve i tried to schedule downtime, it shows:Error: Could not open command file '/usr/local/nagios/var/rw/nagios.cmd' for update!  The permissions on the external command file and/or directory may be incorrect. Read the FAQs on how to setup proper permissions.  An error occurred while attempting to commit your command for processing. i run "chown -R nagios.nagios " just to make sure everything is under nagios user.i check cgi and put "noc" (coz this is the user that gets in to my web interface) to have access in all external cgi.What might be the coz of the problem???   What might be the cause of these problems??  Yahoo! DSL Something to write home about. Just $16.99/mo. or less   

[Nagios-users] Service Check Timed Out

2006-01-09 Thread Alexandru Ionica
Hello,
this is the error i see in the cgi web interface.
in the error log i see stuff like:


SERVICE NOTIFICATION:
noc;host-aaa;FPING;CRITICAL;notify-by-email;(Service Check Timed Out)

SERVICE ALERT: host-bbb;FPING;CRITICAL;HARD;3;(Service Check Timed Out)

SERVICE ALERT: host-ccc;FPING;CRITICAL;SOFT;1;(Service Check Timed Out)


The FPING service is a perl script i wrote wich does a fping, and a
snmpwalk. In the worst case scenario this should take are 30 seconds to run.

I've set up service_check_timeout=90 in the main (nagios.cfg) config file.
The machine is under medium to heavy load, but i can't understand why
it's getting a timeout, as 90 seconds seem more than enough for any of
the scripts to run.
Is there any possibility that this is generated by the check-host-alive
plugin and it escalates ? or something like that ... ?

The nagios configuration is pretty simple, some of the more important
things are:

inter_check_delay_method=s
service_interleave_factor=s
max_concurrent_checks=0
service_reaper_frequency=10
service_check_timeout=90
host_check_timeout=90
event_handler_timeout=30
notification_timeout=30
ocsp_timeout=15
perfdata_timeout=15
check_for_orphaned_services=0
check_service_freshness=1
freshness_check_interval=60
==

The service template is:

==
define service{
namegeneric-ping2
active_checks_enabled   1
passive_checks_enabled  1
parallelize_check   1
obsess_over_service 1
check_freshness 0
notifications_enabled   1
event_handler_enabled   1
flap_detection_enabled  1
process_perf_data   1
retain_status_information   1
retain_nonstatus_information1
service_description FPING
is_volatile 0
check_period24x7
max_check_attempts  3
normal_check_interval   5
retry_check_interval1
contact_groups  noc-acm
notification_interval   1800
notification_period 24x7
notification_optionsc,r
register0
}


any ideeas ?


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


RE: [Nagios-users] nagios do not show "Service Status Details For All Hosts"

2006-01-09 Thread Marc Powell


> -Original Message-
> From: [EMAIL PROTECTED] [mailto:nagios-users-
> [EMAIL PROTECTED] On Behalf Of Matteo Mancini
> Sent: Sunday, January 08, 2006 1:59 PM
> To: nagios-users@lists.sourceforge.net
> Subject: [Nagios-users] nagios do not show "Service Status Details For
All
> Hosts"
> 
> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA1
> 
> 
> Hi List..
> I've got a problem this nagios web interface, when I try to view the
> pages "Service Status Details For All Hosts" and "Status Map", the
first
>show only half services and the second (status map) is does not
> appear. These two problems do not always happen, but very often.

It's probably this FAQ --

http://www.nagios.org/faqs/viewfaq.php?faq_id=21&expand=false&showdesc=t
rue

--
Marc 


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue.
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Check_disk problem on remote machines

2006-01-09 Thread Assaf Flatto


have you tried using the nagios-statd plugin ?

http://www.freshports.org/sysutils/nagios-statd/

i use it to monitor both disk usage and workload .

Assaf


Miguel Faria wrote:

This is a problem that I've been having for quite some time. Searching
the archives turns up a few other people reporting the same problem, but
I can't find a solution.

I am trying to use check_by_ssh to run the check_disk plugin on a few
remote systems. When I run this from the command line, it always does
the right thing. It returns the correct string and return code (checked
with 'echo $?'). It works flawlessly.

However, when this is set up to run through Nagios, it usually reports
the check in the UNKNOWN state...even though it still gives the correct
"OK..." string. Occasionally, Nagios will change it to OK or WARNING or
whatever is correct at the time, but for a good 90% of the time, it just
sits there in UNKNOWN. It doesn't seem to matter if the target machine
is Linux or Solaris, nor can I find any pattern to when and where it
happens. Again, it *always* works perfectly when run from the command
line.

I've played with giving check_by_ssh a hostname instead of an IP
address. I've tried it both with '-l nagios' and without (it's always
run from nagios account). I've also tried the -f flag (the purpose of
which I don't really understand). Nothing seems to help. For the time
being, I'm just running an SNMP client to get this info into Nagios, but
I'd really rather use check_by_ssh if at all possible.

Has anyone else seen this problem and/or found the cause of it?

Here's the exact command I'm using on the command line. This same thing
is configutred in my .cfg file.

check_by_ssh -H 192.168.230.112 -C
'/home/utils/nagios/libexec/check_disk -w 10% -c 5% -p /var'

Thanks!



--
Assaf Flatto
m-Wise Operations
Cellular: +972-54-5679230
e-mail: [EMAIL PROTECTED]
MSN : [EMAIL PROTECTED]


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] disk_check for a remote machine

2006-01-09 Thread Yogesh Hasabnis
Hi All,

I managed to configure check_disk for a remote machine using NRPE
addon. Seemed a little complex earlier. But found that it's quite
simple to set up when I actually I did it. 

Thanks

YogeshOn 1/5/06, Yogesh Hasabnis <[EMAIL PROTECTED]> wrote:
Hi All,

Thanks for all the suggestions. I will try out the nrpe option today
and give a feedback. If required, I will also try other options that
have been suggested.

Thanks 

Yogesh
On 1/3/06, Lori Adams <[EMAIL PROTECTED]
> wrote:
also look at nagios' brother, nrpe.  Your nagios server calls check_nrpewhich contacts the remote server and runs a command on the remotemachine returning an informative output and exit status.-Lori

> -Original Message-> From: [EMAIL PROTECTED] [mailto:
nagios-users-> 

[EMAIL PROTECTED]] On Behalf Of Morten Bekkelund> Sent: Monday, January 02, 2006 4:22 AM> To: 
nagios-users@lists.sourceforge.net> Subject: RE: [Nagios-users] disk_check for a remote machine
>> Yeah, check out www.net-snmp.org.> Install net-snmp on the machines you wanna monitor and use the
> functionality it provides for diskchecks. It writes the result to a
> table in snmp and returns the result when it's being polled from the> nagios-server.>




RE: [Nagios-users] Dependency issue

2006-01-09 Thread Deborah Martin
thanks for the tip but this doesn't work. 

It seems that its very dependent on when the next scheduled check will be.
If I set max_check_attempts to 5 for services
B or C, and then have max_check_attempts set to 1 on service A, it makes no
difference. Service B or C could still
be scheduled to be checked before service A. In fact, on the first check
attempt for service B, for example, a critical 
alert is generated. 

I want Nagios to detect that Service B or C is a dependency on Service A,
and then if Service A is alerting, suppress alerts for Service B or C. 
But nagios doesn't seem to do any kind of dependency lookup if a dependency
is already alerting. Kind of defeats, in my view, the logic
of dependencies. There is no point in Nagios telling me that Service B or C
is down, when Service A is down. Service A must be UP for any of its
dependencies to work. 

If however, Service A is OK, then yes - Service B or C should alert if
necessary. 

If I were to have 20 dependencies against a master service, I don't want 20
potential alerts being generated! Plus I want to know the EXACT cause of the
problem not have to hunt it down every time. Isn't that one of the points of
Nagios - to be able to finely tune plugin checks so that we can 
easily investigate a problem ? It seems that I needn't bother with
dependencies if this is how it's going to work. 

Does anyone else agree / disagree because it seems to me that either the
docs are not explaining dependencies correctly, or Nagios 
is not behaving correctly. I would appreciate feedback on this issue. 

regards,
deborah 




-Original Message-
From: Tedman Eng [mailto:[EMAIL PROTECTED]
Sent: 06 January 2006 20:35
To: 'Deborah Martin'; 'nagios-users@lists.sourceforge.net'
Subject: RE: [Nagios-users] Dependency issue


If Service B happens to be queued to be checked before Service A is, you'll
get notified of it's problem first, so that's why it's only sometimes that
you get this behavior.

As for simple solution to your question, increase the max_check_attempts of
Service B/C, so that Service A will error out before Service B/C does, thus
skirting the check queue phenomenon.


-Original Message-
From: Deborah Martin [mailto:[EMAIL PROTECTED]
Sent: Friday, January 06, 2006 3:11 AM
To: 'nagios-users@lists.sourceforge.net'
Subject: [Nagios-users] Dependency issue


Hi, 

I have setup dependencies which all seem to work fine to a point. 

Example : I have 1 host with 3 services 

HOST A 
| 
Service A (master service) 
|
---
|| 
Service BService C 

If service A  fails, checks and notifications are suppressed for Service B
and Service C which is fine. 

However, i've had a couple of situations where Service B or Service C may
fail first, generate an alert AND then Service A fails which also generates
an alert. 
Is there a way in Nagios that can flip over to only alerting for Service A
in this situation and supress anything for Service B and Service C until
recovery occurs. 

Do I need to define further service dependencies or use inheritance of
parent ? I've read the docs (several times) but can't seem to find the
answers there. 
   
I'm using Nagios 2.0b4 / Suse 9.2. 

Any help, pointers to docs that can help, or even a padded cell would be
much appreciated! 

regards,
deborah 

 
This email and any files transmitted with it are confidential and 
intended solely for the use of the individual or entity to whom they 
are addressed. Any unauthorised distribution or copying is strictly 
prohibited. 
Whilst Kognitio Limited takes steps to prevent the transmission of 
viruses via e-mail, we can not guarantee that any email or 
attachment is free from computer viruses and you are strongly 
advised to undertake your own anti-virus precautions. 
Kognitio grants no warranties regarding performance, 
use or quality of any e-mail or attachment and undertakes no 
liability for loss or damage, howsoever caused. 
*** 


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] downtime not working

2006-01-09 Thread Rafael Bandeira da Costa




Hello

Your webserver user must have write access on /usr/local/nagios/var/rw
directory.
Have you set up the nagios command group, and added the webserver user
to that group?

Then you would chown nagios.nagcmd /usr/local/nagios/var/rw and
chmod g+x /usr/local/nagios/var/rw.

Just take a close look to adapt it to your needs (nagios homedir,
nagios user and nagios command group).

-- 
Cheers,
Rafael Costa
[EMAIL PROTECTED]
Curitiba / PR / Brasil


Nelson Serafica escreveu:

  Wheneve i tried to schedule downtime, it shows:
  
  Error: Could not open command file
'/usr/local/nagios/var/rw/nagios.cmd' for update!
  The permissions on the external command
file and/or directory may be incorrect. Read the FAQs on how to setup
proper permissions. 
An error occurred while attempting to commit your command for
processing.
  
  
i run "chown -R nagios.nagios " just to make sure
everything is under nagios user. 
  
i check cgi and put "noc" (coz this is the user that gets in to my web
interface)
to have access in all external cgi. 
  
What might be the coz of the problem???
  
What might be the cause of these problems??
  
   
   Yahoo!
DSL Something to write home about. Just $16.99/mo. or less





[Nagios-users] downtime not working

2006-01-09 Thread Nelson Serafica
Wheneve i tried to schedule downtime, it shows:Error: Could not open command file '/usr/local/nagios/var/rw/nagios.cmd' for update!The permissions on the external  command file and/or directory may be incorrect. Read the FAQs on how to  setup proper permissions.  An error occurred while attempting to commit your command for processing.  i run "chown -R nagios.nagios " just to make sure everything is under nagios user. i check cgi and put "noc" (coz this is the user that gets in to my web interface)  to have access in all external cgi. What might be the coz of the problem???What might be the cause of these problems??  
		 Yahoo! DSL Something to write home about. Just $16.99/mo. or less

[Nagios-users] Check_disk problem on remote machines

2006-01-09 Thread Miguel Faria








This is a problem that I've been having for quite some time. Searching
the archives turns up a few other people reporting the same problem, but
I can't find a solution.

I am trying to use check_by_ssh to run the check_disk plugin on a few
remote systems. When I run this from the command line, it always does
the right thing. It returns the correct string and return code (checked
with 'echo $?'). It works flawlessly.

However, when this is set up to run through Nagios, it usually reports
the check in the UNKNOWN state...even though it still gives the correct
"OK..." string. Occasionally, Nagios will change it to OK or WARNING
or
whatever is correct at the time, but for a good 90% of the time, it just
sits there in UNKNOWN. It doesn't seem to matter if the target machine
is Linux or Solaris, nor can I find any pattern to when and where it
happens. Again, it *always* works perfectly when run from the command
line.

I've played with giving check_by_ssh a hostname instead of an IP
address. I've tried it both with '-l nagios' and without (it's always
run from nagios account). I've also tried the -f flag (the purpose of
which I don't really understand). Nothing seems to help. For the time
being, I'm just running an SNMP client to get this info into Nagios, but
I'd really rather use check_by_ssh if at all possible.

Has anyone else seen this problem and/or found the cause of it?

Here's the exact command I'm using on the command line. This same thing
is configutred in my .cfg file.

check_by_ssh -H 192.168.230.112 -C
'/home/utils/nagios/libexec/check_disk -w 10% -c 5% -p /var'

Thanks!








[Nagios-users] notification via YM

2006-01-09 Thread Nelson Serafica
is there a addons that when a host was down, it will sent a message to a Yahoo messenger  account??  
		Yahoo! Photos – Showcase holiday pictures in hardcover 
Photo Books. You design it and we’ll bind it!

Re: [Nagios-users] notify via sms

2006-01-09 Thread Thibault Genessay

Askar Ali wrote:



My problem is that my gnokii is running on another machine then 
nagios, can someone guide me how to pipe message from  nagios machine 
to machine running sms gateway (gnokii) ?

so that " echo $mess | gnokii --sendsms $number" do the sending of sms.


You can use a remote shell like SSH and issue the command
ssh [EMAIL PROTECTED] 'echo $mess | gnokii --sendsms $number'
from your Nagios host.
However, automating this process requires the use of an authentication 
agent like ssh-agent.
You might also try to use NSCA or NRPE, or any custom remote check 
execution service, especially if one is already deployed on your Gnokii 
machine. This avoids the setup of ssh-agent at the cost of being less 
secure.


Regards

--
Thibault GENESSAY
ALIADIS
www.aliadis.fr
Tel.  0870 723 724
Fax   04 72 13 90 40 




---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Installing Nagios RPM

2006-01-09 Thread srunschke
[EMAIL PROTECTED] schrieb am 05.01.2006 10:56:11:

> Is it seen as recommended behaviour to use "Antwort:" instead of "Re:"?
> Especially when sending mail to a mailing list that runs in english?
> 
> SCNR ;-)

I didn't take any offense ;)
But it's not something I can do anything about.
Lotus Notes sets it and you cannot change it by any means.
But I do manually edit the subject if it has too many
"Antwort: Re: Antwort: Re:" prefixes.

> BTW, a footer is seperated with/preceded by "-- " (without the ") :-> 
not
> by a line full of "-".

I know, but yet again, there is nothing I could do about it. Well
I could, but I would risk getting a phonecall from the management
if they ever find out...

This footer is mandatory in our company. Even though I know it
does not resemble the average netiquette, I think I am valuable
enough to this mailing list that people can bare with it ;)

regards
sash

PS: Yes, I could manually add a "-- " at the end of every post of mine.
Sorry, but I hope everyone understands that I am SO not going to do
it, for the sake of lazyness ;)

--
Sascha Runschke
Netzwerk Administration
IT-Services

ABIT AG
Robert-Bosch-Str. 1
40668 Meerbusch

Tel.:+49 (0) 2150.9153.226
Mobil:+49 (0) 173.5419665
mailto:[EMAIL PROTECTED]

http://www.abit.net
http://www.abit-epos.net
-
Sicherheitshinweis zur E-Mail Kommunikation /
  Security note regarding email communication:
http://www.abit.net/sicherheitshinweis.html


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null