Re: [Nagios-users] Interesting problem while trying to monitor Oracle RAC services [Solved]

2009-04-08 Thread Kumar, Ashish
>> check the environment of the users launching the script. Which user do you
>> "use" to launch the script locally? And which one from remote?
>
> On nagios server I have tried executing it as root user as well as
> nagios user but the problem remains.
>


Hello all,

I made a mistake in the plug-in.  Both Perl script and KSH script were
residing in the same directory.  When executing locally it knew where
to look for external shell script. The code was as follows:

my $PIPED = qx# ksh check_oracle_services.sh $SERVICE #;

But when executed from nagios server, NRPE daemon on monitored host
wouldn't know where to look for the shell script hence the wrong
output.  Adding absolute path to the check_oracle_services.sh fixed
the problem

my $PIPED = qx# ksh /home/nagios/nrpe/libexec/check_oracle_services.sh
$SERVICE #;

The new code is as follows (may be someone would find it useful):

check_oracle_services.pl


#!/usr/bin/env perl

use strict;
use Getopt::Std;

my %return_value = (
OK => 0,
CRIT => 2,
UNKNOWN => 3
);

my $message = "nagios";
my $exit_status;

my %opt=();
getopts("p:h", \%opt);

sub usage(){
print "Usage: $0 -p service_name\n";
exit $return_value{'UNKNOWN'};
}

usage() if defined $opt{'h'};

my $SERVICE = $opt{'p'} if defined $opt{'p'} || usage();

my $PIPED = qx# ksh /home/nagios/nrpe/libexec/check_oracle_services.sh
$SERVICE #;

if ($PIPED =~ /OFFLINE/g) {
$exit_status = $return_value{'CRIT'};
$message = "Critical: $SERVICE is not running.";
} else {
$exit_status = $return_value{'OK'};
$message = "OK: $SERVICE is running.";
}

print "$message\n";
exit $exit_status;




check_oracle_services.sh


#!/usr/bin/ksh

RSC_KEY=$1

/oracle/crs_home/bin/crs_stat -u | awk \
'BEGIN { FS="="; state = 0; } \
$1~/NAME/ && $2~/'$RSC_KEY'/ {appname = $2; state=1}; \
state == 0 {next;} \
$1~/TARGET/ && state == 1 {apptarget = $2; state=2;} \
$1~/STATE/ && state == 2 {appstate = $2; state=3;} \
state == 3 {printf "%-45s %-18s\n", appname, appstate; state=0;}'



Sorry for the inconvenience caused.

Thanks

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] monitor primergy servers with esx

2009-04-08 Thread Natxo Asenjo
hi,

I have done some tests. One needs the RAID.mib file and this file is
only in the windows package for serverviewraid (tsk, tsk).

Anyway, once we have this file we can test stuff.

Important OIDS:
.1.3.6.1.4.1.231.2.49.1.5.2.1 ->
.iso.org.dod.internet.private.enterprises.sni.sniProductMibs.fscRAIDMIB.svrObjects.svrPhysicalDeviceInfo.svrPhysicalDeviceTable.svrPhysicalDeviceEntry

.1.3.6.1.4.1.231.2.49.1.3 ->
.iso.org.dod.internet.private.enterprises.sni.sniProductMibs.fscRAIDMIB.svrObjects.svrStatus

Situation 1: disks are online and working fine:

$ snmpwalk server -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.3

FSC-RAID-MIB::svrStatusLogicalDrives.0 = INTEGER: ok(1)
FSC-RAID-MIB::svrStatusPhysicalDevices.0 = INTEGER: ok(1)
FSC-RAID-MIB::svrStatusControllers.0 = INTEGER: ok(1)
FSC-RAID-MIB::svrStatusOverall.0 = INTEGER: ok(1)

[j...@pc2668-210307 mibs]$ snmpwalk izvm01 -c public -v 1 -m
FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.5.2.1

FSC-RAID-MIB::svrPhysicalDeviceCtrlNr.1.1.0.0 = INTEGER: 1
FSC-RAID-MIB::svrPhysicalDeviceCtrlNr.1.2.3.0 = INTEGER: 1
FSC-RAID-MIB::svrPhysicalDeviceChannel.1.1.0.0 = INTEGER: 1
FSC-RAID-MIB::svrPhysicalDeviceChannel.1.2.3.0 = INTEGER: 2
FSC-RAID-MIB::svrPhysicalDeviceTarget.1.1.0.0 = INTEGER: 0
FSC-RAID-MIB::svrPhysicalDeviceTarget.1.2.3.0 = INTEGER: 3
FSC-RAID-MIB::svrPhysicalDeviceLUN.1.1.0.0 = INTEGER: 0
FSC-RAID-MIB::svrPhysicalDeviceLUN.1.2.3.0 = INTEGER: 0
FSC-RAID-MIB::svrPhysicalDeviceModelName.1.1.0.0 = STRING: "ST373455SS"
FSC-RAID-MIB::svrPhysicalDeviceModelName.1.2.3.0 = STRING: "ST373455SS"
FSC-RAID-MIB::svrPhysicalDeviceVendorName.1.1.0.0 = STRING: "SEAGATE"
FSC-RAID-MIB::svrPhysicalDeviceVendorName.1.2.3.0 = STRING: "SEAGATE"
FSC-RAID-MIB::svrPhysicalDeviceCapacity.1.1.0.0 = INTEGER: 68
FSC-RAID-MIB::svrPhysicalDeviceCapacity.1.2.3.0 = INTEGER: 68
FSC-RAID-MIB::svrPhysicalDeviceMaxTransferRate.1.1.0.0 = INTEGER: 300
FSC-RAID-MIB::svrPhysicalDeviceMaxTransferRate.1.2.3.0 = INTEGER: 300
FSC-RAID-MIB::svrPhysicalDeviceType.1.1.0.0 = INTEGER: disk(2)
FSC-RAID-MIB::svrPhysicalDeviceType.1.2.3.0 = INTEGER: disk(2)
FSC-RAID-MIB::svrPhysicalDeviceConfiguredDisk.1.1.0.0 = INTEGER: true(2)
FSC-RAID-MIB::svrPhysicalDeviceConfiguredDisk.1.2.3.0 = INTEGER: true(2)
FSC-RAID-MIB::svrPhysicalDeviceInterface.1.1.0.0 = INTEGER: sas(6)
FSC-RAID-MIB::svrPhysicalDeviceInterface.1.2.3.0 = INTEGER: sas(6)
FSC-RAID-MIB::svrPhysicalDeviceErrors.1.1.0.0 = Counter32: 0
FSC-RAID-MIB::svrPhysicalDeviceErrors.1.2.3.0 = Counter32: 0
FSC-RAID-MIB::svrPhysicalDeviceNrBadBlocks.1.1.0.0 = Counter32: 0
FSC-RAID-MIB::svrPhysicalDeviceNrBadBlocks.1.2.3.0 = Counter32: 0
FSC-RAID-MIB::svrPhysicalDeviceSmartStatus.1.1.0.0 = INTEGER: ok(1)
FSC-RAID-MIB::svrPhysicalDeviceSmartStatus.1.2.3.0 = INTEGER: ok(1)
FSC-RAID-MIB::svrPhysicalDeviceStatus.1.1.0.0 = INTEGER: online(3)
FSC-RAID-MIB::svrPhysicalDeviceStatus.1.2.3.0 = INTEGER: online(3)
FSC-RAID-MIB::svrPhysicalDeviceFirmwareRevision.1.1.0.0 = STRING: "1651"
FSC-RAID-MIB::svrPhysicalDeviceFirmwareRevision.1.2.3.0 = STRING: "1651"
FSC-RAID-MIB::svrPhysicalDeviceSerialNumber.1.1.0.0 = STRING: "3LQ0DA03"
FSC-RAID-MIB::svrPhysicalDeviceSerialNumber.1.2.3.0 = STRING: "3LQ0DAD7"
FSC-RAID-MIB::svrPhysicalDeviceForeignConfig.1.1.0.0 = INTEGER: false(1)
FSC-RAID-MIB::svrPhysicalDeviceForeignConfig.1.2.3.0 = INTEGER: false(1)
FSC-RAID-MIB::svrPhysicalDeviceIdx.1.1.0.0 = INTEGER: 11
FSC-RAID-MIB::svrPhysicalDeviceIdx.1.2.3.0 = INTEGER: 12
FSC-RAID-MIB::svrPhysicalDeviceEntry.20.1.1.0.0 = INTEGER: 4
FSC-RAID-MIB::svrPhysicalDeviceEntry.20.1.2.3.0 = INTEGER: 4
FSC-RAID-MIB::svrPhysicalDeviceEntry.21.1.1.0.0 = INTEGER: 70007
FSC-RAID-MIB::svrPhysicalDeviceEntry.21.1.2.3.0 = INTEGER: 70007

Disks are online

Situation 2: I remove one disk from its bay

$ snmpwalk server -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.3
FSC-RAID-MIB::svrStatusLogicalDrives.0 = INTEGER: prefailure(2)
FSC-RAID-MIB::svrStatusPhysicalDevices.0 = INTEGER: failure(3)
FSC-RAID-MIB::svrStatusControllers.0 = INTEGER: prefailure(2)
FSC-RAID-MIB::svrStatusOverall.0 = INTEGER: prefailure(2)

Everything is 'prefailure', except for physicaldevices, it's a
'failure' (disk is physically removed from the bay). I forgot to check
the other OID for this one, I'll post the results later.

Situation 3: 'failed' disk is back in bay, rebuilding starts:

$ snmpwalk server -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.3

FSC-RAID-MIB::svrStatusLogicalDrives.0 = INTEGER: prefailure(2)
FSC-RAID-MIB::svrStatusPhysicalDevices.0 = INTEGER: ok(1)
FSC-RAID-MIB::svrStatusControllers.0 = INTEGER: prefailure(2)
FSC-RAID-MIB::svrStatusOverall.0 = INTEGER: prefailure(2)

everthing is 'prefailure' except for svrStatusPhysicalDevices.0, it is
'ok', disks is in bay.

$ snmpwalk server -c public -v 1 -m FSC-RAID-MIB .1.3.6.1.4.1.231.2.49.1.5.2.1
FSC-RAID-MIB::svrPhysicalDeviceCtrlNr.1.0.0.0 = INTEGER: 1
FSC-RAID-MIB::svrPhysicalDeviceCtrlNr.1.3.0.3 = INTEGER: 1
FSC-RAID-MIB::svrPhysicalDeviceChannel

Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Marco Tirado
Hello:

There are a couple of examples in the nagios exchange page of different
approachs for integrating nagios and cacti. You should check that out.

I believe the synchronization is going to cost you time and money, a better
approach is to use nagios + pnp4naigos (this generates nice graphs) +
check_snmp_int.pl (this for bandwidth tests). That way you have only one
place to place your configuration.  There are tons of other snmp plugins you
can use for other tests (CPU, Memory, etc),

//Marco

On Wed, Apr 8, 2009 at 11:15 AM, Christopher McAtackney
wrote:

> Hi all,
>
> I've been looking into making use of Cacti to act as an SNMP
> management tool which runs alongside my Nagios instance.
>
> Ideally, what I would like to do is have Cacti monitor various
> SNMP-exposed metrics on my hosts, and then have a service check in
> Nagios which parses Cacti's results (which I believe are RRD files)
> and send alerts etc.
>
> Nagios itself will still be used for running directly checks for
> services running, errors in log files etc.
>
> Does this approach make sense?
>
> One issue that I can think of is the difficulty in keeping the config
> files of Nagios and Cacti synchronised.  I was planning on using Lilac
> Platform to act as my Nagios config file management tool, but how that
> is kept in synch with Cacti is a problem. Has anyone ever set up an
> arrangement like this before?
>
> Cheers,
> Chris
>
>
> --
> This SF.net email is sponsored by:
> High Quality Requirements in a Collaborative Environment.
> Download a free trial of Rational Requirements Composer Now!
> http://p.sf.net/sfu/www-ibm-com
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Nagios and Cacti

2009-04-08 Thread Christopher McAtackney
Hi all,

I've been looking into making use of Cacti to act as an SNMP
management tool which runs alongside my Nagios instance.

Ideally, what I would like to do is have Cacti monitor various
SNMP-exposed metrics on my hosts, and then have a service check in
Nagios which parses Cacti's results (which I believe are RRD files)
and send alerts etc.

Nagios itself will still be used for running directly checks for
services running, errors in log files etc.

Does this approach make sense?

One issue that I can think of is the difficulty in keeping the config
files of Nagios and Cacti synchronised.  I was planning on using Lilac
Platform to act as my Nagios config file management tool, but how that
is kept in synch with Cacti is a problem. Has anyone ever set up an
arrangement like this before?

Cheers,
Chris

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Name or service not known

2009-04-08 Thread Thierry Lavallée
Hi,
I am getting the following about 50 times per day:

HTTPCRITICAL04-08-2009 09:37:29 0d 0h 1m 50s1/4 Name or 
service not known
SSH UNKNOWN 04-08-2009 09:34:39 0d 0h 4m 40s1/4 
Usage:check_ssh
[-46] [-t ] [-r ] [-p ] 

I am somewhat at lost here...
Can anyone help with this?
-- 
Thierry

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Name or service not known

2009-04-08 Thread Assaf Flatto
can you provide the relevant conf so we may understand more about your problem ?



On Wednesday 08 April 2009 14:41:27 Thierry Lavallée wrote:
> Hi,
> I am getting the following about 50 times per day:
>
> HTTP  CRITICAL04-08-2009 09:37:29 0d 0h 1m 50s1/4 Name or 
> service not
> known SSH UNKNOWN 04-08-2009 09:34:39 0d 0h 4m 40s1/4 
> Usage:check_ssh
> [-46] [-t ] [-r ] [-p ] 
>
> I am somewhat at lost here...
> Can anyone help with this?



-- 
Assaf Flatto
SSP Ops Team
Linux System Administrator
169 Euston Road, London, NW1 2AE





IMPORTANT . this email and the information in it may be confidential, legally
privileged and/or protected by law. It is intended solely for the use of the
person to whom it is addressed. If you are not the intended recipient, please
notify the sender immediately and do not disclose the contents to any other
person, use it for any purpose, or store or copy the information in any medium.
Please also delete all copies of this email and any attachments from your
system.

We cannot guarantee the security or confidentiality of email communications. We
do not accept any liability for losses or damages that you may suffer as a
result of your receipt of this email including but not limited to computer
service or system failure, access delays or interruption, data non-delivery or
mis-delivery, computer viruses or other harmful components.

Copyright in this email and any attachments belong to Select Service Partner UK
Limited. Should you communicate with anyone at Select Service Partner UK 
Limited by
email, you consent to us monitoring and reading any such correspondence.

Nothing in this email shall be taken or read as suggesting, proposing or
relating to any agreement concerted practice or other practice that could
infringe UK or EC competition legislation.

Select Service Partner UK Limited is a company registered in England and Wales
(company number 05687183) whose registered office is at 1 The Heights, 
Brooklands, Weybridge. Surrey. KT13 0NY
 
 

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Name or service not known

2009-04-08 Thread Thierry Lavallée
thanks a lot for your reply Assaf,

I am not sure which conf you mean but here are a few things in the
meantime. Maybe you could point this Nagios newbie what you need? :/

thanks!
-- 
Thierry


HostDescription Max. Check Attempts Normal Check Interval   Retry
Check Interal   Check Command   Check PeriodParallelize Volatile
Obsess Over Enable Active ChecksEnable Passive Checks   Check
Freshness   Freshness Threshold Default Contacts/Groups Enable
Notifications   Notification Interval   First Notification Delay
Notification OptionsNotification Period Event Handler   Enable 
Event
Handler Stalking OptionsEnable Flap Detection   Low Flap
Threshold   High Flap Threshold Flap Detection Options  Process
Performance DataEnable Failure Prediction   Failure Prediction
Options Notes   Notes URL   Action URL  Logo Image  Image Alt   
Retention
Options
localhost   Current Load4   0h 5m 0s0h 1m 0s
check_local_load!5.0,4.0,3.0!10.0,6.0,4.0   24x7Yes No  
Yes Yes
Yes No  Auto-determined value   admins  Yes 1h 0m 0s
0h 0m 0s
Unknown, Warning, Critical, Recovery24x7Yes None
Yes
Program-wide value  Program-wide value  Ok, Warning, Unknown,
CriticalYes Yes 
Status Information, Non-Status
Information
localhost   Current Users   4   0h 5m 0s0h 1m 0s
check_local_users!20!50 24x7Yes No  Yes Yes 
Yes No
Auto-determined value   admins  Yes 1h 0m 0s0h 0m 0s
Unknown,
Warning, Critical, Recovery 24x7Yes NoneYes 
Program-wide
value   Program-wide value  Ok, Warning, Unknown, Critical  Yes Yes 
Status Information, Non-Status 
Information
localhost   HTTP4   0h 5m 0s0h 1m 0scheck_http  
24x7Yes No  Yes
Yes Yes No  Auto-determined value   admins  No  1h 0m 
0s0h 0m 0s
Unknown, Warning, Critical, Recovery24x7Yes None
Yes
Program-wide value  Program-wide value  Ok, Warning, Unknown,
CriticalYes Yes 
Status Information, Non-Status
Information
localhost   PING4   0h 5m 0s0h 1m 0s
check_ping!100.0,20%!500.0,60%
24x7Yes No  Yes Yes Yes No  Auto-determined 
value   admins  Yes
1h 0m 0s0h 0m 0sUnknown, Warning, Critical, Recovery
24x7
Yes NoneYes Program-wide value  Program-wide value  
Ok, Warning,
Unknown, Critical   Yes Yes 
Status Information,
Non-Status Information
localhost   Root Partition  4   0h 5m 0s0h 1m 0s
check_local_disk!20%!10%!/  24x7Yes No  Yes Yes 
Yes No
Auto-determined value   admins  Yes 1h 0m 0s0h 0m 0s
Unknown,
Warning, Critical, Recovery 24x7Yes NoneYes 
Program-wide
value   Program-wide value  Ok, Warning, Unknown, Critical  Yes Yes 
Status Information, Non-Status 
Information
localhost   SSH 4   0h 5m 0s0h 1m 0scheck_ssh   
24x7Yes No  Yes
Yes Yes No  Auto-determined value   admins  No  1h 0m 
0s0h 0m 0s
Unknown, Warning, Critical, Recovery24x7Yes None
Yes
Program-wide value  Program-wide value  Ok, Warning, Unknown,
CriticalYes Yes 
Status Information, Non-Status
Information
localhost   Swap Usage  4   0h 5m 0s0h 1m 0s
check_local_swap!20!10
24x7Yes No  Yes Yes Yes No  Auto-determined 
value   admins  Yes
1h 0m 0s0h 0m 0sUnknown, Warning, Critical, Recovery
24x7
Yes NoneYes Program-wide value  Program-wide value  
Ok, Warning,
Unknown, Critical   Yes Yes 
Status Information,
Non-Status Information
localhost   Total Processes 4   0h 5m 0s0h 1m 0s
check_local_procs!250!400!RSZDT 24x7Yes No  Yes 
Yes Yes No
Auto-determined value   admins  Yes 1h 0m 0s0h 0m 0s
Unknown,
Warning, Critical, Recovery 24x7Yes NoneYes 
Program-wide
value   Program-wide value  Ok, Warning, Unknown, Critical  

Re: [Nagios-users] Name or service not known

2009-04-08 Thread Thierry Lavallée
or maybe you mean:


check_http

check-host-alive$USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c
5000.0,100% -p 5
check_dhcp  $USER1$/check_dhcp $ARG1$
check_ftp   $USER1$/check_ftp -H $HOSTADDRESS$ $ARG1$
check_hpjd  $USER1$/check_hpjd -H $HOSTADDRESS$ $ARG1$
check_http  $USER1$/check_http -I $HOSTADDRESS$ $ARG1$
check_imap  $USER1$/check_imap -H $HOSTADDRESS$ $ARG1$
check_local_disk$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$
check_local_load$USER1$/check_load -w $ARG1$ -c $ARG2$
check_local_mrtgtraf$USER1$/check_mrtgtraf -F $ARG1$ -a $ARG2$ -w
$ARG3$ -c $ARG4$ -e $ARG5$
check_local_procs   $USER1$/check_procs -w $ARG1$ -c $ARG2$ -s $ARG3$
check_local_swap$USER1$/check_swap -w $ARG1$ -c $ARG2$
check_local_users   $USER1$/check_users -w $ARG1$ -c $ARG2$
check_nt$USER1$/check_nt -H $HOSTADDRESS$ -p 12489 -v $ARG1$ $ARG2$
check_ping  $USER1$/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ -p 5
check_pop   $USER1$/check_pop -H $HOSTADDRESS$ $ARG1$
check_smtp  $USER1$/check_smtp -H $HOSTADDRESS$ $ARG1$
check_snmp  $USER1$/check_snmp -H $HOSTADDRESS$ $ARG1$
check_ssh   $USER1$/check_ssh $ARG1$ $HOSTADDRESS$
check_tcp   $USER1$/check_tcp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$
check_udp   $USER1$/check_udp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$
notify-host-by-email/usr/bin/printf "%b" "* Nagios
*\n\nNotification Type: $NOTIFICATIONTYPE$\nHost:
$HOSTNAME$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo:
$HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | /bin/mail -s "**
$NOTIFICATIONTYPE$ Host Alert: $HOSTNAME$ is $HOSTSTATE$ **"
$CONTACTEMAIL$
notify-service-by-email /usr/bin/printf "%b" "* Nagios
*\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService:
$SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState:
$SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional
Info:\n\n$SERVICEOUTPUT$" | /bin/mail -s "** $NOTIFICATIONTYPE$
Service Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **"
$CONTACTEMAIL$
process-host-perfdata   /usr/bin/printf "%b"
"$LASTHOSTCHECK$\t$HOSTNAME$\t$HOSTSTATE$\t$HOSTATTEMPT$\t$HOSTSTATETYPE$\t$HOSTEXECUTIONTIME$\t$HOSTOUTPUT$\t$HOSTPERFDATA$\n"
>> /usr/local/nagios/var/host-perfdata.out
process-service-perfdata/usr/bin/printf "%b"
"$LASTSERVICECHECK$\t$HOSTNAME$\t$SERVICEDESC$\t$SERVICESTATE$\t$SERVICEATTEMPT$\t$SERVICESTATETYPE$\t$SERVICEEXECUTIONTIME$\t$SERVICELATENCY$\t$SERVICEOUTPUT$\t$SERVICEPERFDATA$\n"
>> /usr/local/nagios/var/service-perfdata.out



check_SSH


check-host-alive$USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c
5000.0,100% -p 5
check_dhcp  $USER1$/check_dhcp $ARG1$
check_ftp   $USER1$/check_ftp -H $HOSTADDRESS$ $ARG1$
check_hpjd  $USER1$/check_hpjd -H $HOSTADDRESS$ $ARG1$
check_http  $USER1$/check_http -I $HOSTADDRESS$ $ARG1$
check_imap  $USER1$/check_imap -H $HOSTADDRESS$ $ARG1$
check_local_disk$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$
check_local_load$USER1$/check_load -w $ARG1$ -c $ARG2$
check_local_mrtgtraf$USER1$/check_mrtgtraf -F $ARG1$ -a $ARG2$ -w
$ARG3$ -c $ARG4$ -e $ARG5$
check_local_procs   $USER1$/check_procs -w $ARG1$ -c $ARG2$ -s $ARG3$
check_local_swap$USER1$/check_swap -w $ARG1$ -c $ARG2$
check_local_users   $USER1$/check_users -w $ARG1$ -c $ARG2$
check_nt$USER1$/check_nt -H $HOSTADDRESS$ -p 12489 -v $ARG1$ $ARG2$
check_ping  $USER1$/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ -p 5
check_pop   $USER1$/check_pop -H $HOSTADDRESS$ $ARG1$
check_smtp  $USER1$/check_smtp -H $HOSTADDRESS$ $ARG1$
check_snmp  $USER1$/check_snmp -H $HOSTADDRESS$ $ARG1$
check_ssh   $USER1$/check_ssh $ARG1$ $HOSTADDRESS$
check_tcp   $USER1$/check_tcp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$
check_udp   $USER1$/check_udp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$
notify-host-by-email/usr/bin/printf "%b" "* Nagios
*\n\nNotification Type: $NOTIFICATIONTYPE$\nHost:
$HOSTNAME$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo:
$HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | /bin/mail -s "**
$NOTIFICATIONTYPE$ Host Alert: $HOSTNAME$ is $HOSTSTATE$ **"
$CONTACTEMAIL$
notify-service-by-email /usr/bin/printf "%b" "* Nagios
*\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService:
$SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState:
$SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional
Info:\n\n$SERVICEOUTPUT$" | /bin/mail -s "** $NOTIFICATIONTYPE$
Service Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **"
$CONTACTEMAIL$
process-host-perfdata   /usr/bin/printf "%b"
"$LASTHOSTCHECK$\t$HOSTNAME$\t$HOSTSTATE$\t$HOSTATTEMPT$\t$HOSTSTATETYPE$\t$HOSTEXECUTIONTIME$\t$HOSTOUTPUT$\t$HOSTPERFDATA$\n"
>> /usr/local/nagios/var/ho

Re: [Nagios-users] Name or service not known

2009-04-08 Thread Assaf Flatto
Thierry

The information you need to look into is the configuration files of the nagios 
, located in the etc 
directory of the nagios installation directory .

which version of nagios are you running ?
did you install from source or packages ?
what distro are you using ?

how did you preform the initial installation ?

Assaf


On Wednesday 08 April 2009 15:29:02 Thierry Lavallée wrote:
> thanks a lot for your reply Assaf,
>
> I am not sure which conf you mean but here are a few things in the
> meantime. Maybe you could point this Nagios newbie what you need? :/
>
> thanks!
> --
> Thierry
>
> > --
> > Assaf Flatto
> > SSP Ops Team
> > Linux System Administrator
> > 169 Euston Road, London, NW1 2AE





IMPORTANT . this email and the information in it may be confidential, legally
privileged and/or protected by law. It is intended solely for the use of the
person to whom it is addressed. If you are not the intended recipient, please
notify the sender immediately and do not disclose the contents to any other
person, use it for any purpose, or store or copy the information in any medium.
Please also delete all copies of this email and any attachments from your
system.

We cannot guarantee the security or confidentiality of email communications. We
do not accept any liability for losses or damages that you may suffer as a
result of your receipt of this email including but not limited to computer
service or system failure, access delays or interruption, data non-delivery or
mis-delivery, computer viruses or other harmful components.

Copyright in this email and any attachments belong to Select Service Partner UK
Limited. Should you communicate with anyone at Select Service Partner UK 
Limited by
email, you consent to us monitoring and reading any such correspondence.

Nothing in this email shall be taken or read as suggesting, proposing or
relating to any agreement concerted practice or other practice that could
infringe UK or EC competition legislation.

Select Service Partner UK Limited is a company registered in England and Wales
(company number 05687183) whose registered office is at 1 The Heights, 
Brooklands, Weybridge. Surrey. KT13 0NY
 
 

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Name or service not known

2009-04-08 Thread Assaf Flatto
Thierry

please send all your replies to the list and not to individual members - so 
other may learn from the 
information gathered by other members .

second :
this is a command declaration in the commands file , have you associated a 
service to the host ?

and even before that - did you install the nagios-plugins ? 
the nagios-plugins are the actual check scripts executed to preform the checks 
, and without them 
you will not be able to use any of nagios capabilities.

Assaf

> check_ssh   $USER1$/check_ssh $ARG1$ $HOSTADDRESS$



-- 
Assaf Flatto
SSP Ops Team
Linux System Administrator
169 Euston Road, London, NW1 2AE





IMPORTANT . this email and the information in it may be confidential, legally
privileged and/or protected by law. It is intended solely for the use of the
person to whom it is addressed. If you are not the intended recipient, please
notify the sender immediately and do not disclose the contents to any other
person, use it for any purpose, or store or copy the information in any medium.
Please also delete all copies of this email and any attachments from your
system.

We cannot guarantee the security or confidentiality of email communications. We
do not accept any liability for losses or damages that you may suffer as a
result of your receipt of this email including but not limited to computer
service or system failure, access delays or interruption, data non-delivery or
mis-delivery, computer viruses or other harmful components.

Copyright in this email and any attachments belong to Select Service Partner UK
Limited. Should you communicate with anyone at Select Service Partner UK 
Limited by
email, you consent to us monitoring and reading any such correspondence.

Nothing in this email shall be taken or read as suggesting, proposing or
relating to any agreement concerted practice or other practice that could
infringe UK or EC competition legislation.

Select Service Partner UK Limited is a company registered in England and Wales
(company number 05687183) whose registered office is at 1 The Heights, 
Brooklands, Weybridge. Surrey. KT13 0NY
 
 

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Adaptive Monitoring: Broken?

2009-04-08 Thread Marc Powell

On Apr 7, 2009, at 1:26 PM, Patrick Morris wrote:

> Here are the important stats:
>
> Nagios Version: Version 3.1.0
> Proficiency Level: Pretty damned high

> While the first command works fine, and sets the service to an OK  
> state,
> the next two (which I've tried in various combinations) show up in the
> Nagios logs as having been sent, but do nothing. The check that  
> appears
> in the config files keeps running instead of my check_ok check.
>
> Here's how it shows up in the logs:
>
> [1239128528] EXTERNAL COMMAND: CHANGE_SVC_EVENT_HANDLER;dummy- 
> host;DNS;check_ok
> [1239128528] EXTERNAL COMMAND: CHANGE_SVC_CHECK_COMMAND;dummy- 
> host;DNS;check_ok
>
> I've noticed the message is different if I use an invalid command, so
> I'm relatively sure I'm using the right ones; they just don't do
> anything.
>
> Event handlers are enabled for these services, but even if they  
> weren't
> the check command should change, right?
>
> Am I doing something wrong here, or have I run into a bug?

I'm not using 3.x yet but just to provide some feedback, what you're  
doing looks reasonable from my reading of the documentation. I do see  
this in 3.1.0's commands.c though --

 /* SECURITY PATCH - disable these for the time being */
 switch(cmd){
 case CMD_CHANGE_GLOBAL_HOST_EVENT_HANDLER:
 case CMD_CHANGE_GLOBAL_SVC_EVENT_HANDLER:
 case CMD_CHANGE_HOST_EVENT_HANDLER:
 case CMD_CHANGE_SVC_EVENT_HANDLER:
 case CMD_CHANGE_HOST_CHECK_COMMAND:
 case CMD_CHANGE_SVC_CHECK_COMMAND:
 return ERROR;
 }

That's in the right section and my reading of the code is that it does  
exactly that; prevent changing of those values... Maybe it's something  
being worked on in the development branch?

--
Marc


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Andrew Davis
And just an FYI from my own experience... putting Nagios & Cacti on the 
same server has been somewhat problematic for us. We have over 400 
network devices between switches, routers, WAPs, etc. We also have about 
300 monitored servers. Initially I had Nagios and Cacti both on one 
server with Cacti running via cron every 5 minutes. About every 5 
minutes, my shells would become unresponsive for roughly 30 to 90 
seconds. Turning off either Nagios or Cacti resolved the issue. Running 
both seems to have hammered the server a bit (4Gb of RAM, 2 x dual core 
2.x Ghz CPUs). We don't integrate Cacti and Nagios, however. Nagios does 
both trending and alerts of all servers. Cacti does trending only of all 
network devices/ports. Once I moved Cacti to its own server, all was 
fine as far as load/latency went.


 A. Davis
 Email: ncc...@gmail.com

 "There is no limit to what a man can accomplish
  if he doesn't care who gets the credit." - Ronald Reagan



Marco Tirado wrote:

Hello:

There are a couple of examples in the nagios exchange page of 
different approachs for integrating nagios and cacti. You should check 
that out.


I believe the synchronization is going to cost you time and money, a 
better approach is to use nagios + pnp4naigos (this generates nice 
graphs) + check_snmp_int.pl (this for bandwidth tests). That way you 
have only one place to place your configuration.  There are tons of 
other snmp plugins you can use for other tests (CPU, Memory, etc),


//Marco

On Wed, Apr 8, 2009 at 11:15 AM, Christopher McAtackney 
mailto:crist...@gmail.com>> wrote:


Hi all,

I've been looking into making use of Cacti to act as an SNMP
management tool which runs alongside my Nagios instance.

Ideally, what I would like to do is have Cacti monitor various
SNMP-exposed metrics on my hosts, and then have a service check in
Nagios which parses Cacti's results (which I believe are RRD files)
and send alerts etc.

Nagios itself will still be used for running directly checks for
services running, errors in log files etc.

Does this approach make sense?

One issue that I can think of is the difficulty in keeping the config
files of Nagios and Cacti synchronised.  I was planning on using Lilac
Platform to act as my Nagios config file management tool, but how that
is kept in synch with Cacti is a problem. Has anyone ever set up an
arrangement like this before?

Cheers,
Chris


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net

https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null




--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com


___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Christopher McAtackney
2009/4/8 Andrew Davis :
> And just an FYI from my own experience... putting Nagios & Cacti on the same
> server has been somewhat problematic for us. We have over 400 network
> devices between switches, routers, WAPs, etc. We also have about 300
> monitored servers. Initially I had Nagios and Cacti both on one server with
> Cacti running via cron every 5 minutes. About every 5 minutes, my shells
> would become unresponsive for roughly 30 to 90 seconds. Turning off either
> Nagios or Cacti resolved the issue. Running both seems to have hammered the
> server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't integrate
> Cacti and Nagios, however. Nagios does both trending and alerts of all
> servers. Cacti does trending only of all network devices/ports. Once I moved
> Cacti to its own server, all was fine as far as load/latency went.

That's useful to know Andrew, thanks.

Regarding the trending of network devices - is there any reason why
this can't be done by Nagios? I intend to install PNP4Nagios to take
care of graphing anyway, but I think it would be nice to have all my
monitored resources under the one system (for notifications and ease
of administration).

Is there some major advantage that Cacti provides when it comes to
SNMP monitoring of network devices that cannot be achieved with Nagios
and the various SNMP plug-ins available for it (e.g. like these ones
http://nagios.manubulon.com) ?

Cheers,
Chris

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Name or service not known

2009-04-08 Thread Thierry Lavallée
thanks Assaf,
But I cannot get passed step #1 Relax - it's going to take some time. ;)
hehe.

Really, I don't want to redo the whole install as I am not THAT server
techy and I try to rely on a service called Supreme Support that are
supposed to make my life better. I need to point to Supreme Support
where the problem is.

Do you think you can point it out quickly from the config files I sent?

Because most seems installed correctly, it looks there are loose handles.

thanks again!
-- 
Thierry

2009/4/8 Assaf Flatto :
>
> In that case I suggest you start here :
> http://nagios.sourceforge.net/docs/3_0/beginners.html
>
>
> read the documentation  and most of your questions will be answered.
>
>
> Assaf
>
>
> On Wednesday 08 April 2009 16:00:35 you wrote:
>> thanks Assaf.
>> I did not do the installation, but the support who did the
>> installation (Supreme Support) do not seem knowledgeable enough.
>>
>> I am attaching my config files (please tell me if not secure to send like
>> this) I am runnig 3.0.6
>> No idea about Distro
>>
>> hoping you can still help me.
>> thanks!
>
>
> --
> Assaf Flatto
> SSP Ops Team
> Linux System Administrator
> 169 Euston Road, London, NW1 2AE
>
>
>
>
>
> IMPORTANT . this email and the information in it may be confidential, legally
> privileged and/or protected by law. It is intended solely for the use of the
> person to whom it is addressed. If you are not the intended recipient, please
> notify the sender immediately and do not disclose the contents to any other
> person, use it for any purpose, or store or copy the information in any 
> medium.
> Please also delete all copies of this email and any attachments from your
> system.
>
> We cannot guarantee the security or confidentiality of email communications. 
> We
> do not accept any liability for losses or damages that you may suffer as a
> result of your receipt of this email including but not limited to computer
> service or system failure, access delays or interruption, data non-delivery or
> mis-delivery, computer viruses or other harmful components.
>
> Copyright in this email and any attachments belong to Select Service Partner 
> UK
> Limited. Should you communicate with anyone at Select Service Partner UK 
> Limited by
> email, you consent to us monitoring and reading any such correspondence.
>
> Nothing in this email shall be taken or read as suggesting, proposing or
> relating to any agreement concerted practice or other practice that could
> infringe UK or EC competition legislation.
>
> Select Service Partner UK Limited is a company registered in England and Wales
> (company number 05687183) whose registered office is at 1 The Heights, 
> Brooklands, Weybridge. Surrey. KT13 0NY
>
>

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] problems with log rotation

2009-04-08 Thread Eric Doutreleau
hi

i used nagios3.1
i have configured a daily log rotation

but after several days i suspect there s a problem with the log rotation

i found several instance of nagios which the majority are defunct ones
and they all begun at 00:00 but on different days.

and the scheduler is going mad.
a lot of check are not launched and i got a lot of these messages

[1239201355] Warning: The check of service 'memoire disponible' on host
'www-tp' looks like it was orphaned (results never came back).  I'm
scheduling an immediate check of the service...
[1239201355] Warning: The check of service 'nombre total de process' on
host 'www-tp' looks like it was orphaned (results never came back).  I'm
scheduling an immediate check of the service...
[1239201355] Warning: The check of service 'HTTP' on host 'yum' looks
like it was orphaned (results never came back).  I'm scheduling an
immediate check of the service...
[1239201355] Warning: The check of service 'charge' on host 'yum' looks
like it was orphaned (results never came back).  I'm scheduling an
immediate check of the service...
[1239201355] Warning: The check of service 'nombre d utilisateur' on
host 'yum' looks like it was orphaned (results never came back).  I'm
scheduling an immediate check of the service...
[1239201414] Warning: The check of service 'ciscoswitch' on host
'Indicateurs' looks like it was orphaned (results never came back).  I'm
scheduling an immediate check of the service...
[1239201414] Warning: The check of service 'smtpsout' on host
'Indicateurs_detail' looks like it was orphaned (results never came
back).  I'm scheduling an immediate check of the service...

Does someone knows how to solve that problems?

thanks in advance for any help


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Name or service not known

2009-04-08 Thread Assaf Flatto

>From the first glance it looks like they just installed it and never bothered 
>to configure anything 
in it .

not nagios is a great tool , but it requires quite a bit of initial 
configuration and setup , which 
looks like the didn't do .

Assaf



On Wednesday 08 April 2009 16:20:06 Thierry Lavallée wrote:
> thanks Assaf,
> But I cannot get passed step #1 Relax - it's going to take some time. ;)
> hehe.
>
> Really, I don't want to redo the whole install as I am not THAT server
> techy and I try to rely on a service called Supreme Support that are
> supposed to make my life better. I need to point to Supreme Support
> where the problem is.
>
> Do you think you can point it out quickly from the config files I sent?
>
> Because most seems installed correctly, it looks there are loose handles.
>
> thanks again!



-- 
Assaf Flatto
SSP Ops Team
Linux System Administrator
169 Euston Road, London, NW1 2AE






IMPORTANT . this email and the information in it may be confidential, legally
privileged and/or protected by law. It is intended solely for the use of the
person to whom it is addressed. If you are not the intended recipient, please
notify the sender immediately and do not disclose the contents to any other
person, use it for any purpose, or store or copy the information in any medium.
Please also delete all copies of this email and any attachments from your
system.

We cannot guarantee the security or confidentiality of email communications. We
do not accept any liability for losses or damages that you may suffer as a
result of your receipt of this email including but not limited to computer
service or system failure, access delays or interruption, data non-delivery or
mis-delivery, computer viruses or other harmful components.

Copyright in this email and any attachments belong to Select Service Partner UK
Limited. Should you communicate with anyone at Select Service Partner UK 
Limited by
email, you consent to us monitoring and reading any such correspondence.

Nothing in this email shall be taken or read as suggesting, proposing or
relating to any agreement concerted practice or other practice that could
infringe UK or EC competition legislation.

Select Service Partner UK Limited is a company registered in England and Wales
(company number 05687183) whose registered office is at 1 The Heights, 
Brooklands, Weybridge. Surrey. KT13 0NY
 
 

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Daniel Emmanuel Feinsmith

It depends on the intensity of your snmp usage. Cacti has a native  
daemon to do large scale snmp getting, and it does a great job of it.  
So if u have hundreds of devices, each with a lot of interfaces, u  
will probably like cacti. The user interface is also well done for  
graphing snmp data and thresholding on it using the threshold plugin.

=
Daniel Feinsmith
=
{sent from iPhone}

On Apr 8, 2009, at 8:15 AM, Christopher McAtackney  
 wrote:

> 2009/4/8 Andrew Davis :
>> And just an FYI from my own experience... putting Nagios & Cacti on  
>> the same
>> server has been somewhat problematic for us. We have over 400 network
>> devices between switches, routers, WAPs, etc. We also have about 300
>> monitored servers. Initially I had Nagios and Cacti both on one  
>> server with
>> Cacti running via cron every 5 minutes. About every 5 minutes, my  
>> shells
>> would become unresponsive for roughly 30 to 90 seconds. Turning off  
>> either
>> Nagios or Cacti resolved the issue. Running both seems to have  
>> hammered the
>> server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't  
>> integrate
>> Cacti and Nagios, however. Nagios does both trending and alerts of  
>> all
>> servers. Cacti does trending only of all network devices/ports.  
>> Once I moved
>> Cacti to its own server, all was fine as far as load/latency went.
>
> That's useful to know Andrew, thanks.
>
> Regarding the trending of network devices - is there any reason why
> this can't be done by Nagios? I intend to install PNP4Nagios to take
> care of graphing anyway, but I think it would be nice to have all my
> monitored resources under the one system (for notifications and ease
> of administration).
>
> Is there some major advantage that Cacti provides when it comes to
> SNMP monitoring of network devices that cannot be achieved with Nagios
> and the various SNMP plug-ins available for it (e.g. like these ones
> http://nagios.manubulon.com) ?
>
> Cheers,
> Chris
>
> --- 
> --- 
> --- 
> -
> This SF.net email is sponsored by:
> High Quality Requirements in a Collaborative Environment.
> Download a free trial of Rational Requirements Composer Now!
> http://p.sf.net/sfu/www-ibm-com
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when  
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Daniel Emmanuel Feinsmith
If you move your mysql instance to another server, you can get much  
better performance on a nagios/cacti server. Check top while cacti is  
running a large install and you will see that mysql is hoarding CPU  
and memory resources not leaving much for nagios.

=
Daniel Feinsmith
=
{sent from iPhone}

On Apr 8, 2009, at 8:03 AM, Andrew Davis  wrote:

> And just an FYI from my own experience... putting Nagios & Cacti on  
> the same server has been somewhat problematic for us. We have over  
> 400 network devices between switches, routers, WAPs, etc. We also  
> have about 300 monitored servers. Initially I had Nagios and Cacti  
> both on one server with Cacti running via cron every 5 minutes.  
> About every 5 minutes, my shells would become unresponsive for  
> roughly 30 to 90 seconds. Turning off either Nagios or Cacti  
> resolved the issue. Running both seems to have hammered the server a  
> bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't integrate  
> Cacti and Nagios, however. Nagios does both trending and alerts of  
> all servers. Cacti does trending only of all network devices/ports.  
> Once I moved Cacti to its own server, all was fine as far as load/ 
> latency went.
>   A. Davis
>   Email: ncc...@gmail.com
>
>   "There is no limit to what a man can accomplish
>if he doesn't care who gets the credit." - Ronald Reagan
>
>
> Marco Tirado wrote:
>>
>> Hello:
>>
>> There are a couple of examples in the nagios exchange page of  
>> different approachs for integrating nagios and cacti. You should  
>> check that out.
>>
>> I believe the synchronization is going to cost you time and money,  
>> a better approach is to use nagios + pnp4naigos (this generates  
>> nice graphs) + check_snmp_int.pl (this for bandwidth tests). That  
>> way you have only one place to place your configuration.  There are  
>> tons of other snmp plugins you can use for other tests (CPU,  
>> Memory, etc),
>>
>> //Marco
>>
>> On Wed, Apr 8, 2009 at 11:15 AM, Christopher McAtackney > > wrote:
>> Hi all,
>>
>> I've been looking into making use of Cacti to act as an SNMP
>> management tool which runs alongside my Nagios instance.
>>
>> Ideally, what I would like to do is have Cacti monitor various
>> SNMP-exposed metrics on my hosts, and then have a service check in
>> Nagios which parses Cacti's results (which I believe are RRD files)
>> and send alerts etc.
>>
>> Nagios itself will still be used for running directly checks for
>> services running, errors in log files etc.
>>
>> Does this approach make sense?
>>
>> One issue that I can think of is the difficulty in keeping the config
>> files of Nagios and Cacti synchronised.  I was planning on using  
>> Lilac
>> Platform to act as my Nagios config file management tool, but how  
>> that
>> is kept in synch with Cacti is a problem. Has anyone ever set up an
>> arrangement like this before?
>>
>> Cheers,
>> Chris
>>
>> --- 
>> --- 
>> --- 
>> -
>> This SF.net email is sponsored by:
>> High Quality Requirements in a Collaborative Environment.
>> Download a free trial of Rational Requirements Composer Now!
>> http://p.sf.net/sfu/www-ibm-com
>> ___
>> Nagios-users mailing list
>> Nagios-users@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/nagios-users
>> ::: Please include Nagios version, plugin version (-v) and OS when  
>> reporting any issue.
>> ::: Messages without supporting info will risk being sent to /dev/ 
>> null
>>
>>
>> --- 
>> --- 
>> --- 
>> -
>> This SF.net email is sponsored by:
>> High Quality Requirements in a Collaborative Environment.
>> Download a free trial of Rational Requirements Composer Now!
>> http://p.sf.net/sfu/www-ibm-com
>>
>> ___
>> Nagios-users mailing list
>> Nagios-users@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/nagios-users
>> ::: Please include Nagios version, plugin version (-v) and OS when  
>> reporting any issue.
>> ::: Messages without supporting info will risk being sent to /dev/ 
>> null
>
> --- 
> --- 
> --- 
> -
> This SF.net email is sponsored by:
> High Quality Requirements in a Collaborative Environment.
> Download a free trial of Rational Requirements Composer Now!
> http://p.sf.net/sfu/www-ibm-com
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when  
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
--
This SF.net email is spons

Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Max
On Wed, Apr 8, 2009 at 11:52 AM, Daniel Emmanuel Feinsmith
 wrote:
>
> It depends on the intensity of your snmp usage. Cacti has a native
> daemon to do large scale snmp getting, and it does a great job of it.
> So if u have hundreds of devices, each with a lot of interfaces, u
> will probably like cacti. The user interface is also well done for
> graphing snmp data and thresholding on it using the threshold plugin.

With parallel checks in Nagios 3 and some configuration tuning and
well-written SNMP checks, I'd argue that Nagios is as good if not a
better poller than cactid :).   our instance is not huge, but
currently we do 7000+ SNMP-based checks in 3 minutes on a dual
quad-core Linux-based server.

Before PNP I used to use Cacti and Nagios.  I like Cacti, but with PNP
around I would never go back to that combination again .. Nagios + PNP
really does simplify life for Nagios administrators and provides a lot
of flexibility as far as how you scale your graphing as your node base
grows.

- Max

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread Andrew Davis
I agree. Initially I had Nagios doing all the trending. But with 400+ 
network devices and many of them with multiple 48 port blades, I found 
Cacti was easier to configure... it scaled a lot better. For a smaller 
network, you could easily do just Nagios. I've had no issues at all with 
Nagios + PNP for alerts and trending. In fact, Nagios still watches my 
core network devices (but not all the ports of them... ie: Nagios 
watches that switch1 is up and available and trends its CPU and memory 
usage... however I use Cacti for trending the 6 blades each with 48 
ports in switch1). This way, if switch1 fails or utilization is too 
high, Nagios tells me, but if a particular user is hogging all our 
bandwidth or having lots of packet loss, I find that via Cacti.


 A. Davis
 Email: ncc...@gmail.com

 "There is no limit to what a man can accomplish
  if he doesn't care who gets the credit." - Ronald Reagan



Daniel Emmanuel Feinsmith wrote:
It depends on the intensity of your snmp usage. Cacti has a native  
daemon to do large scale snmp getting, and it does a great job of it.  
So if u have hundreds of devices, each with a lot of interfaces, u  
will probably like cacti. The user interface is also well done for  
graphing snmp data and thresholding on it using the threshold plugin.


=
Daniel Feinsmith
=
{sent from iPhone}

On Apr 8, 2009, at 8:15 AM, Christopher McAtackney  
 wrote:


  

2009/4/8 Andrew Davis :

And just an FYI from my own experience... putting Nagios & Cacti on  
the same

server has been somewhat problematic for us. We have over 400 network
devices between switches, routers, WAPs, etc. We also have about 300
monitored servers. Initially I had Nagios and Cacti both on one  
server with
Cacti running via cron every 5 minutes. About every 5 minutes, my  
shells
would become unresponsive for roughly 30 to 90 seconds. Turning off  
either
Nagios or Cacti resolved the issue. Running both seems to have  
hammered the
server a bit (4Gb of RAM, 2 x dual core 2.x Ghz CPUs). We don't  
integrate
Cacti and Nagios, however. Nagios does both trending and alerts of  
all
servers. Cacti does trending only of all network devices/ports.  
Once I moved

Cacti to its own server, all was fine as far as load/latency went.
  

That's useful to know Andrew, thanks.

Regarding the trending of network devices - is there any reason why
this can't be done by Nagios? I intend to install PNP4Nagios to take
care of graphing anyway, but I think it would be nice to have all my
monitored resources under the one system (for notifications and ease
of administration).

Is there some major advantage that Cacti provides when it comes to
SNMP monitoring of network devices that cannot be achieved with Nagios
and the various SNMP plug-ins available for it (e.g. like these ones
http://nagios.manubulon.com) ?

Cheers,
Chris

--- 
--- 
--- 
-

This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when  
reporting any issue.

::: Messages without supporting info will risk being sent to /dev/null




--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
  
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Children "unreachable" on soft down?

2009-04-08 Thread Israel Brewster
So is this just something I'll have to live with? I don't seem to be  
getting much feedback on the subject. :(
---
Israel Brewster
Computer Support Technician II
Frontier Flying Service Inc.
5245 Airport Industrial Rd
Fairbanks, AK 99709
(907) 450-7250 x293
---



On Apr 6, 2009, at 10:55 AM, Israel Brewster wrote:

> On Apr 6, 2009, at 9:03 AM, Giorgio Zarrelli wrote:
>
>> Hi,
>>
>> I've not quite clear what happens to you,
>
> Thanks for the response. For clarification, the exact sequence of
> events is as follows:
>
> 1) The link between the nagios box and one of our routers, which we
> will refer to as the parent host glitches for 30 seconds or so. Due to
> the nature of the link (satellite connection) this is semi-expected,
> and happens a couple of times a day.
>
> 2) Nagios catches this glitch in one of it's regularly scheduled host
> checks, and puts the parent host into a soft down state. Again, normal
> and expected - even good.
>
> 3) At the same time, Nagios puts the children of the parent host into
> an "unreachable" state. Makes sense, at least, but leads to the issue
>
> 4) The parent host is now in recheck mode (as it is only in a soft
> down state and has three rechecks set), so it checks again a minute
> later. This check succeeds, as the outage was transitory. The parent
> host is put back into an "UP" state. As it never was in a hard "down"
> state, no notification is sent. This is good.
>
> 5) since the parent is now up, the child host now is changed to a
> (soft I think) "down" state.
>
> 6) check continue on a normal schedule. As the link does not glitch
> again for several hours, parent remains up and child remains
> (correctly) down. Three checks later, child enters a hard "down" state
> (since it was unreachable and only just switched back to down). Down
> notification is sent for child.
>
> 7) Everything remains good for the next several hours until the link
> glitches again. Repeat from step one.
>
> The notification in step 6 is the problem here - the child host was
> down before the glitch, the child host is still down after. But
> because the child host was temporarily put in an unreachable state, we
> get notified again that it is down, resulting in a string of "DOWN"
> messages with no up or real change in status.
>
>> but one thing I have in mind is try
>>
>> soft_state_dependencies=0
>>
>> Besides that, the problems seems to be in the roots of the check.
>> It's not
>> healty to have a ping check failing every 2 strikes. Try to change
>> the host
>> alive check, using a ssh check instead.
>
> The check is not failing every 2 strikes. It's failing once, briefly,
> every few hours - just barely long enough to make one check fail and
> throw the parent host into a soft down state. The first recheck (one
> minute later) works fine, bringing the parent back to an up state. The
> next several hundred or more checks also work fine (as the problem was
> transitory and brief). For this reason, changing the check wouldn't
> help - for the duration of that single check, the host really is down
> (or more precisely, unreachable, as it is a link issue), and any check
> I used would say so.
>
>> Another approach, not so useful, would be to increase the timeout
>> for the ping
>> (-W) so it will have less chances to fail.
>
> except that it's not a timeout issue. It is a very real, albeit brief
> (around 30 seconds or so), outage. Not long enough or frequent enough
> to really impact productivity or anything, but long enough for nagios
> to catch it (for a single check).
>
> ---
> Israel Brewster
> Computer Support Technician II
> Frontier Flying Service Inc.
> 5245 Airport Industrial Rd
> Fairbanks, AK 99709
> (907) 450-7250 x293
> ---
>>
>> Giorgio
>>
>> Israel Brewster (isr...@frontierflying.com) scritto:
>>>
>>> So does anyone have any ideas as to how I can resolve this  
>>> situation?
>>> It continues to be an annoyance. Thanks.
>>>
>>> ---
>>> Israel Brewster
>>> Computer Support Technician II
>>> Frontier Flying Service Inc.
>>> 5245 Airport Industrial Rd
>>> Fairbanks, AK 99709
>>> (907) 450-7250 x293
>>> ---
>>>
>>>
>>>
>>> On Mar 31, 2009, at 8:17 AM, Israel Brewster wrote:
>>>
 On Mar 31, 2009, at 1:09 AM, Andreas Ericsson wrote:

> Israel Brewster wrote:
>> Does nagios (3.0.3) mark a child host as unreachable when its
>> parent  enters a soft down state? I am finding myself getting
>> repeated down  messages for a host (which is, in fact, down),  
>> even
>> though I have  notifications set to only send a single message.
>> Looking at the logs,  it would appear that what is happening is
>> that the host is flipping  between "down" (which notifies me) and
>> "unreachable"

Re: [Nagios-users] Nagios Warning Bug/Misconfiguration

2009-04-08 Thread Thomas Donnelly
Got the issue. Turns out someone installed the Cisco MIBS along with the 
net snmp mibs. Thank you very much for your help.

For anyone searching and finds this check

/usr/local/share/snmp/mibs

and see if you have some-snmp-mib.txt and some-snmp-mib.my


Thanks Again!
-=Tom Donnelly


Patrick Morris wrote:
> On Tue, 07 Apr 2009, Thomas Donnelly wrote:
>
>   
>> Thanks for the quick reply!
>>
>> Ran from the command line got:
>>
>> # ./check_snmp -H 192.168.97.71 -o mib-2.33.1.2.4.0 -C secret -w 95: -c 75:
>> SNMP WARNING - 100 | SNMPv2-SMI::mib-2.33.1.2.4.0=100
>>
>> # ./check_snmp -H 192.168.97.71 -o mib-2.33.1.2.4.0 -C secret -w 95 -c 75:
>> SNMP WARNING - *100* | SNMPv2-SMI::mib-2.33.1.2.4.0=100
>>
>> So by intentionally triggering it again (remove ":"), it shows the *'s
>> 
>
> How about if you add a "-v" to get verbose output?
>
> Also, you may want to check the return code from the manual run on an OK
> resultx (for example, by running "echo $?" aafter your check_snmp
> command to make sure it matches what you see in the output).
>
> What happens in my case occasionally is that I install a screwed-up MIB
> for an unrelated service. It won't show any obvious errors, but it will
> cause check_snmp to return a warning result code regardless of whether
> the SNMP result falls within my thresholds.
>
> In effect, it's warning me that my MIBs are hosed, based on
> the fact that it got a non-OK result from snmpget (which is what
> check_snmp calls to do the actual SNMP getting).
>
>
>
>   
>> Not really sure what they mean by:
>>
>> 1. Prevent check_snmp from loading the MIBs (default behaviour) by using 
>> numeric oids AND using the "-m :" option
>> 
>
> If you a numeric OID rather than "mib-2.33.1.2.4.0" and pass the "-m :"
> then check_snmp (and, by extension, snmpget) don't need to load the MIBs
> at all, so you don't get an error if you've got a bad MIB.
>
>   
>>
>>
>> Patrick Morris wrote:
>> 
>>> What happens when you run it manually? 
>>>
>>> This, maybe?
>>>
>>> http://www.nagios.org/faqs/viewfaq.php?faq_id=208
>>>
>>> On Tue, 07 Apr 2009, Thomas Donnelly wrote:
>>>
>>>   
>>>   
 Hi all,

 I am having an issue with all of the devices I added showing warning all 
 the time. It is a simple snmp check to see if the amps are above 
 160warn/180critical. They always say warning even though they are less 
 than the specified 160. One thing to note is once it hits the 160 mark 
 it gets the * value * in turn, showing that it is actually in the 
 warning range. I have shown the neccesary data I hope below. Any/all 
 help is greatly appreciated.



 # uname -a
 FreeBSD server.example.net 5.5-RELEASE-p2 FreeBSD 5.5-RELEASE-p2 #3: Tue 
 Oct  9 22:39:13 EST 2007 
 r...@server.example.net:/usr/obj/usr/src/sys/MONITOR  i386

 Nagios
 Version 2.0b3


 # ./check_snmp -V
 check_snmp (nagios-plugins 1.4.3) 1.58



  From the webui

 APC-RR-R3-1.hou
  check_rr_amp
  WARNING  04-07-2009 13:00:17   8d 3h 17m 2s  10/10  SNMP WARNING - 90

  APC-RR-R3-2.hou
  check_rr_amp
  WARNING  04-07-2009 12:57:52  18d 2h 46m 48s  10/10  SNMP WARNING - *160*

 ^note the * 160 * for the one that actually is in the warning range.


 from checkcommands.cfg

 define command {
command_name check_rr_amp
command_line$USER1$/check_snmp -H $HOSTADDRESS$ -o 
 mib-2.33.1.4.4.1.3.1 -C cPanel -w $ARG1$ -c $AR
 }


  From the hosts config file.

 define service{
host_name   APC-RR-R1-1.hou
service_description check_rr_amp
check_command   check_rr_amp!159!179
max_check_attempts  10
normal_check_interval   5
retry_check_interval3
check_period24x7
notification_interval   30
notification_period 24x7
notification_optionsw,c,r
contact_groups  backup-admins
 }



 Thanks!
 -=Tom


 --
 This SF.net email is sponsored by:
 High Quality Requirements in a Collaborative Environment.
 Download a free trial of Rational Requirements Composer Now!
 http://p.sf.net/sfu/www-ibm-com
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when 
 reporting any issue. 
 ::: Messages without supporting info will risk being sent to /dev/null
 
 


-

Re: [Nagios-users] Children "unreachable" on soft down?

2009-04-08 Thread Marc Powell

On Apr 8, 2009, at 11:44 AM, Israel Brewster wrote:

> So is this just something I'll have to live with? I don't seem to be
> getting much feedback on the subject. :(

Well, my response would be to fix the problem that's causing the  
outages in the first place or adjust the way you're monitoring the  
parents so that the plugin used recognizes when this temporary event  
is occurring. What you're asking for is that nagios track that the  
child went from down->unreachable->down without an intermediate OK  
state and suppress notifications in that case. That would appear to be  
a code change and would be better discussed on nagios-devel but I  
would encourage the check plugin approach first.

--
Marc


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread gmarzot

> Is there some major advantage that Cacti provides when it comes to
> SNMP monitoring of network devices that cannot be achieved with Nagios
> and the various SNMP plug-ins available for it (e.g. like these ones
> http://nagios.manubulon.com) ?

Also does anyone have some nagios config examples integrating PNP and
these SNMP plugins...

I have been trying to get an idea how to create the commands.cfg and
services.cfg using these parts... Any examples of host based checks
would be great. 

I have tried to read the relevant docs but have not found explicit
nagios .cfg examples... if they exist a gentle pointer would also be
great.

thank you, Giovanni


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Children "unreachable" on soft down?

2009-04-08 Thread Israel Brewster




On Apr 8, 2009, at 9:28 AM, Marc Powell wrote:

>
> On Apr 8, 2009, at 11:44 AM, Israel Brewster wrote:
>
>> So is this just something I'll have to live with? I don't seem to be
>> getting much feedback on the subject. :(
>
> Well, my response would be to fix the problem that's causing the
> outages in the first place or adjust the way you're monitoring the
> parents so that the plugin used recognizes when this temporary event
> is occurring.

Ok, fair enough. There is nothing we can do about the outages (as I  
explained in one of my e-mail, they are an artifact of the connection  
type), so that leaves us with adjusting the monitoring. Now I thought  
that the recheck options were there exactly for this reason: to catch  
brief outages and not alert. And for the parent host that seems to be  
the case, but apparently that logic doesn't carry on to the child  
hosts. As such, somehow things would need to be adjusted so it never  
even sees the outages, even enough to go into a soft down state.  
Anyone have any suggestions for how I can accomplish this? Adjusting  
the timeout or using, say, an ssh check rather than icmp won't do it -  
the packets are still lost, and the ssh check would still timeout..  
Perhaps if I sent more pings at longer intervals (so that if it  
doesn't get a response the single check retries at 15 second intervals  
or so before returning a response), but then the check would start  
taking several seconds or more to complete, and that wouldn't be a  
good thing. Assuming nagios even allowed a check to run that long -  
doesn't it have a mechanism to kill a check that doesn't return in a  
given time frame? I'm a little stumped here how I can adjust things.

> What you're asking for is that nagios track that the
> child went from down->unreachable->down without an intermediate OK
> state and suppress notifications in that case. That would appear to be
> a code change and would be better discussed on nagios-devel but I
> would encourage the check plugin approach first.

Ok. I know there is code in there that know who it sent down messages  
to and doesn't send up messages to people that didn't get a down  
(primarily dealing with escalations) so I was hoping that maybe there  
would be something similar for this, i.e. seeing that the last  
notification sent was a down notification, and as such there is no  
need to send another. But if not, so be it. Thanks for the response!

---
Israel Brewster
Computer Support Technician II
Frontier Flying Service Inc.
5245 Airport Industrial Rd
Fairbanks, AK 99709
(907) 450-7250 x293
---
>
> --
> Marc
>
>
> --
> This SF.net email is sponsored by:
> High Quality Requirements in a Collaborative Environment.
> Download a free trial of Rational Requirements Composer Now!
> http://p.sf.net/sfu/www-ibm-com
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when  
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] multiple parents

2009-04-08 Thread Lori Adams
We have several hosts that have multiple parents.

Will the child notify as down if only one of the parents is down?  Or will the 
child suppress notifications because one of its parents is down?

-Lori
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Children "unreachable" on soft down?

2009-04-08 Thread Christopher Burke
I wonder if there is something you can do with notification escalations?
I know you can control how the notifications are sent out, but I don't
know if a state change from down to unreachable to down will cause the
escalation to reset.

 

 

From: Israel Brewster [mailto:isr...@frontierflying.com] 
Sent: Wednesday, April 08, 2009 2:32 PM
To: Marc Powell
Cc: nagios-users@lists.sourceforge.net Users
Subject: Re: [Nagios-users] Children "unreachable" on soft down?

 





On Apr 8, 2009, at 9:28 AM, Marc Powell wrote:

>
> On Apr 8, 2009, at 11:44 AM, Israel Brewster wrote:
>
>> So is this just something I'll have to live with? I don't seem to be
>> getting much feedback on the subject. :(
>
> Well, my response would be to fix the problem that's causing the
> outages in the first place or adjust the way you're monitoring the
> parents so that the plugin used recognizes when this temporary event
> is occurring.

Ok, fair enough. There is nothing we can do about the outages (as I 
explained in one of my e-mail, they are an artifact of the connection 
type), so that leaves us with adjusting the monitoring. Now I thought 
that the recheck options were there exactly for this reason: to catch 
brief outages and not alert. And for the parent host that seems to be 
the case, but apparently that logic doesn't carry on to the child 
hosts. As such, somehow things would need to be adjusted so it never 
even sees the outages, even enough to go into a soft down state. 
Anyone have any suggestions for how I can accomplish this? Adjusting 
the timeout or using, say, an ssh check rather than icmp won't do it - 
the packets are still lost, and the ssh check would still timeout.. 
Perhaps if I sent more pings at longer intervals (so that if it 
doesn't get a response the single check retries at 15 second intervals 
or so before returning a response), but then the check would start 
taking several seconds or more to complete, and that wouldn't be a 
good thing. Assuming nagios even allowed a check to run that long - 
doesn't it have a mechanism to kill a check that doesn't return in a 
given time frame? I'm a little stumped here how I can adjust things.

> What you're asking for is that nagios track that the
> child went from down->unreachable->down without an intermediate OK
> state and suppress notifications in that case. That would appear to be
> a code change and would be better discussed on nagios-devel but I
> would encourage the check plugin approach first.

Ok. I know there is code in there that know who it sent down messages 
to and doesn't send up messages to people that didn't get a down 
(primarily dealing with escalations) so I was hoping that maybe there 
would be something similar for this, i.e. seeing that the last 
notification sent was a down notification, and as such there is no 
need to send another. But if not, so be it. Thanks for the response!

---
Israel Brewster
Computer Support Technician II
Frontier Flying Service Inc.
5245 Airport Industrial Rd
Fairbanks, AK 99709
(907) 450-7250 x293
---
>
> --
> Marc



--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios and Cacti

2009-04-08 Thread jmoseley
I agree with Daniel's post below.  We have Nagios and Cacti running on the
same system; Nagios monitors 691 hosts and 1800 services while Cacti is
pulling stats for about the same number of hosts, but something like 3200
data sources.  They run on a dual Xeon 2.8 Ghz box with only 2 Gb or RAM
(no swapping going on).  Average load is about 1.5 and peaks at 3 about 3-4
times a day.
The key is that mysql operations are on a dedicated box with 15k SCSI
drives and RAID 10.


James Moseley




   
 Daniel Emmanuel   
 Feinsmith 
   
cc 
 04/08/2009 10:36  Nagios Users
 AM   
   Subject 
   Re: [Nagios-users] Nagios and Cacti 
   
   
   
   
   
   




If you move your mysql instance to another server, you can get much better
performance on a nagios/cacti server. Check top while cacti is running a
large install and you will see that mysql is hoarding CPU and memory
resources not leaving much for nagios.

=
Daniel Feinsmith
=
{sent from iPhone}

On Apr 8, 2009, at 8:03 AM, Andrew Davis  wrote:

  And just an FYI from my own experience... putting Nagios & Cacti on
  the same server has been somewhat problematic for us. We have over
  400 network devices between switches, routers, WAPs, etc. We also
  have about 300 monitored servers. Initially I had Nagios and Cacti
  both on one server with Cacti running via cron every 5 minutes. About
  every 5 minutes, my shells would become unresponsive for roughly 30
  to 90 seconds. Turning off either Nagios or Cacti resolved the issue.
  Running both seems to have hammered the server a bit (4Gb of RAM, 2 x
  dual core 2.x Ghz CPUs). We don't integrate Cacti and Nagios,
  however. Nagios does both trending and alerts of all servers. Cacti
  does trending only of all network devices/ports. Once I moved Cacti
  to its own server, all was fine as far as load/latency went.
A. Davis
Email: ncc...@gmail.com

"There is no limit to what a man can accomplish
 if he doesn't care who gets the credit." - Ronald Reagan



  Marco Tirado wrote:
Hello:

There are a couple of examples in the nagios exchange page of
different approachs for integrating nagios and cacti. You
should check that out.

I believe the synchronization is going to cost you time and
money, a better approach is to use nagios + pnp4naigos (this
generates nice graphs) + check_snmp_int.pl (this for bandwidth
tests). That way you have only one place to place your
configuration.  There are tons of other snmp plugins you can
use for other tests (CPU, Memory, etc),

//Marco

On Wed, Apr 8, 2009 at 11:15 AM, Christopher McAtackney <
crist...@gmail.com> wrote:
  Hi all,

  I've been looking into making use of Cacti to act as an SNMP
  management tool which runs alongside my Nagios instance.

  Ideally, what I would like to do is have Cacti monitor
  various
  SNMP-exposed metrics on my hosts, and then have a service
  check in
  Nagios which parses Cacti's results (which I believe are RRD
  files)
  and send alerts etc.

  Nagios itself will still be used for running directly checks
  for
  services running, errors in log files etc.

  Does this approach make sense?

  One issue that I can think of is the difficulty in keeping
  the config
  files of Nagios and Cacti synchronised.  I was planning on
  using Lilac
  Platform to act as my Nagios config file management tool, but
  how that
  is kept in synch with Cacti is a problem. Has anyone ever set
  up an
  arrangement like this before?

  Cheers,
  Chris

 

Re: [Nagios-users] Adaptive Monitoring: Broken?

2009-04-08 Thread Patrick Morris
On Wed, 08 Apr 2009, Marc Powell wrote:

> 
> On Apr 7, 2009, at 1:26 PM, Patrick Morris wrote:
> 
> > Here are the important stats:
> >
> > Nagios Version: Version 3.1.0
> > Proficiency Level: Pretty damned high
> 
> > While the first command works fine, and sets the service to an OK  
> > state,
> > the next two (which I've tried in various combinations) show up in the
> > Nagios logs as having been sent, but do nothing. The check that  
> > appears
> > in the config files keeps running instead of my check_ok check.
> >
> > Here's how it shows up in the logs:
> >
> > [1239128528] EXTERNAL COMMAND: CHANGE_SVC_EVENT_HANDLER;dummy- 
> > host;DNS;check_ok
> > [1239128528] EXTERNAL COMMAND: CHANGE_SVC_CHECK_COMMAND;dummy- 
> > host;DNS;check_ok
> >
> > I've noticed the message is different if I use an invalid command, so
> > I'm relatively sure I'm using the right ones; they just don't do
> > anything.
> >
> > Event handlers are enabled for these services, but even if they  
> > weren't
> > the check command should change, right?
> >
> > Am I doing something wrong here, or have I run into a bug?
> 
> I'm not using 3.x yet but just to provide some feedback, what you're  
> doing looks reasonable from my reading of the documentation. I do see  
> this in 3.1.0's commands.c though --
> 
>  /* SECURITY PATCH - disable these for the time being */
>  switch(cmd){
>  case CMD_CHANGE_GLOBAL_HOST_EVENT_HANDLER:
>  case CMD_CHANGE_GLOBAL_SVC_EVENT_HANDLER:
>  case CMD_CHANGE_HOST_EVENT_HANDLER:
>  case CMD_CHANGE_SVC_EVENT_HANDLER:
>  case CMD_CHANGE_HOST_CHECK_COMMAND:
>  case CMD_CHANGE_SVC_CHECK_COMMAND:
>  return ERROR;
>  }
> 
> That's in the right section and my reading of the code is that it does  
> exactly that; prevent changing of those values... Maybe it's something  
> being worked on in the development branch?

Thanks! I should have done some code-diving, because that goes pretty
far toward explaining why those commands don't work for me as currently
documented.



--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] how to add Q1,Q2,Q3,Q4 reports

2009-04-08 Thread XYZ XYZ
Any idea how do i add few custom reports to nagios "report period" drop down 
list in "availability report" like Q1(first quarter), Q2(second quarter)... etc.





  --
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] how to add Q1,Q2,Q3,Q4 reports

2009-04-08 Thread Andy Shellam
XYZ XYZ wrote:
> Any idea how do i add few custom reports to nagios "report period" 
> drop down list in "availability report" like Q1(first quarter), 
> Q2(second quarter)... etc.
>
>
>

Edit the source code and recompile?

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Adaptive Monitoring: Broken?

2009-04-08 Thread Andreas Ericsson
Marc Powell wrote:
> On Apr 7, 2009, at 1:26 PM, Patrick Morris wrote:
> 
>> Here are the important stats:
>>
>> Nagios Version: Version 3.1.0
>> Proficiency Level: Pretty damned high
> 
>> While the first command works fine, and sets the service to an OK  
>> state,
>> the next two (which I've tried in various combinations) show up in the
>> Nagios logs as having been sent, but do nothing. The check that  
>> appears
>> in the config files keeps running instead of my check_ok check.
>>
>> Here's how it shows up in the logs:
>>
>> [1239128528] EXTERNAL COMMAND: CHANGE_SVC_EVENT_HANDLER;dummy- 
>> host;DNS;check_ok
>> [1239128528] EXTERNAL COMMAND: CHANGE_SVC_CHECK_COMMAND;dummy- 
>> host;DNS;check_ok
>>
>> I've noticed the message is different if I use an invalid command, so
>> I'm relatively sure I'm using the right ones; they just don't do
>> anything.
>>
>> Event handlers are enabled for these services, but even if they  
>> weren't
>> the check command should change, right?
>>
>> Am I doing something wrong here, or have I run into a bug?
> 
> I'm not using 3.x yet but just to provide some feedback, what you're  
> doing looks reasonable from my reading of the documentation. I do see  
> this in 3.1.0's commands.c though --
> 
>  /* SECURITY PATCH - disable these for the time being */
>  switch(cmd){
>  case CMD_CHANGE_GLOBAL_HOST_EVENT_HANDLER:
>  case CMD_CHANGE_GLOBAL_SVC_EVENT_HANDLER:
>  case CMD_CHANGE_HOST_EVENT_HANDLER:
>  case CMD_CHANGE_SVC_EVENT_HANDLER:
>  case CMD_CHANGE_HOST_CHECK_COMMAND:
>  case CMD_CHANGE_SVC_CHECK_COMMAND:
>  return ERROR;
>  }
> 
> That's in the right section and my reading of the code is that it does  
> exactly that; prevent changing of those values... Maybe it's something  
> being worked on in the development branch?
> 

It's not. That snippet comes from Nov 30 2008 as a measure to prevent
CVE-2008-5027 (cmd.cgi authorization bypass vulnerability) and
CVE-2008-5028 (cross-site request forgery) from becoming remote command
execution vulnerabilities.

Ethan added that snippet as an extra security measure. It's been in
Nagios since 3.0.4.

Assuming both the patches I sent are applied, it's safe to remove that
particular snippet and recompile Nagios.


I wrote about the two vulnerabilities here in case anyone needs to
refresh their memory:
http://blogs.op5.org/blog4.php/2008/11/11/nagios-cmd-cgi-authorization-bypass-vuln
http://blogs.op5.org/blog4.php/2008/11/11/cross-site-request-forgery-vulnerability-6

The patches to prevent them are available here:
http://git.op5.org/git/?p=nagios.git;a=shortlog;h=refs/heads/security

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] multiple parents

2009-04-08 Thread Andreas Ericsson
Lori Adams wrote:
> We have several hosts that have multiple parents.
> 
> Will the child notify as down if only one of the parents is down?  Or
> will the child suppress notifications because one of its parents is
> down?
> 

All parents of a host has to be down for it to become unreachable.
If you have configured the host (and your contacts) to send notifications
on unreachable states, you may still get notifications for it, but not
HOST DOWN ones.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Hostnames and regex

2009-04-08 Thread Niall O Broin
I recently changed a Nagios configuration to use a regex with  
hostnames but it hasn't been entirely satisfactory. I have e.g.

define service{
 use generic-service
 host_name host*
 service_description Total Processes
 check_command snmp_procs!120!150

but for some hosts, I need to use different thresholds. I tried to use

define service{
 use generic-service
 host_name hostNN
 service_description Total Processes
 check_command snmp_procs!200!250

but hostNN still alerts based on the host* thresholds. I tried placing  
the definition for host NN before AND after the host* definition - it  
made no difference. From what I read of Nagios regex, they're not full  
regex so it wouldn't be possible to write one which matched host* but  
didn't match hostNN.

Is there a way of doing what I want, apart from the obvious one of  
renaming the hosts which I don't want to match the regex?


__
Kindest regards,


Niall  O Broin
MakaluMedia Group | http://makalumedia.com


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] NSClient not providing results

2009-04-08 Thread Murali Krishnan S
Hi,

  When I set NSClient++ to start automatically in services, its running but
not providing the results. All I'm getting is

"could not fetch information from server" .

Even when I run the "check_nt" plugin manually also, I'm getting the same
result.

[r...@nagios libexec]# /usr/local/nagios/libexec/check_nt -H 192.168.0.119
-p 12489 -v MEMUSE  -w 80 -c 90
could not fetch information from server

After searching and reading mailing lists I've tweaked certain things like
"disabling firewall",  "enabling DEP", etc. Still the same result.

Only one way I could make the NSClient to respond is,  running it in Command
line with "-test" option.

c:\NSClient++>"NSClient++.exe" -test

When I run this, I'm getting the CHECKS GREEN..

[r...@nagios libexec]# /usr/local/nagios/libexec/check_nt -H 192.168.0.119
-p 12489 -v MEMUSE  -w 80 -c 90
Memory usage: total:4308.71 Mb - used: 1606.41 Mb (37%) - free: 2702.30 Mb
(63%) | 'Memory usage'=1606.41Mb;3446.97;3877.84;0.00;4308.71

Everytime I reboot the machine. I need to start this in command line.  Any
solution, inputs ?   Please provide...

Thanks.

--
Regards
Mkrish
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Monitor netstat connection states using nagios.

2009-04-08 Thread asam30
Hi All,

I am using check_tcp to check status of a particular port on the server.
This is working good. I would also need to monitor LISTEN status (SYN_RECV)
of that port. for ex,

I have a ldap service running on port 3890, so the command

netstat  -anp  | grep 3890

tcp0  0 0.0.0.0:3890 0.0.0.0:*
   LISTEN 16029/java
tcp0  0 10.121.30.121:3890  10.121.6.1:8831
ESTABLISHED 16029/java
tcp0  0 10.121.30.121:3890  10.121.6.1:61052
ESTABLISHED 16029/java
tcp  228  0 10.121.30.121:3890  10.121.6.1:49440
ESTABLISHED 16029/java
tcp0  0 10.121.30.121:3890  10.121.6.1:11664
  SYN_RECV16029/java

The establish connections are ok to allow, but we need to monitor SYN_RECV
status. If there is any such(SYN_RECV) connection appears, we immediately
get an alert from nagios. Is there any way to monitor such states with
nagios or check_tcp?

I have written some shell script to monitor such events, but also I would
like to integrate that scripts into nagios? Is that possible?

Please help me

Thanks


-- 
Shankar
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Monitor netstat connection states using nagios.

2009-04-08 Thread asam30
Hi All,

I am using check_tcp to check status of a particular port on the server.
This is working good. I would also need to monitor LISTEN status (SYN_RECV)
of that port. for ex,

I have a ldap service running on port 3890, so the command

netstat  -anp  | grep 3890

tcp0  0 0.0.0.0:3890 0.0.0.0:*
   LISTEN 16029/java
tcp0  0 10.121.30.121:3890  10.121.6.1:8831
ESTABLISHED 16029/java
tcp0  0 10.121.30.121:3890  10.121.6.1:61052
ESTABLISHED 16029/java
tcp  228  0 10.121.30.121:3890  10.121.6.1:49440
ESTABLISHED 16029/java
tcp0  0 10.121.30.121:3890  10.121.6.1:11664
  SYN_RECV16029/java

The establish connections are ok to allow, but we need to monitor SYN_RECV
status. If there is any such(SYN_RECV) connection appears, we immediately
get an alert from nagios. Is there any way to monitor such states with
nagios or check_tcp?

I have written some shell script to monitor such events, but also I would
like to integrate that scripts into nagios? Is that possible?

Please help me or provide some suggestions

-- 
Shankar
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Nagios for IRIX

2009-04-08 Thread amol.bute
Greeting.

 

 

I have IRIX 6.5 servers which I like to monitor using Nagios 3.0. I am
able to monitor other servers like Linux & Windows but not able to
configure nagios on IRIX OS servers.

 

Please guide me how can I install nagios plug-in on IRIX OS and
configure. Where do we will get nagios plug-ins for IRIX.

 

 

 

Thanks and Regards

Amol Bute

 

**
Email Disclaimer:

Information contained and transmitted by this e-mail (including any 
attachments) is confidential, proprietary and legally privileged data of Tata 
Technologies that is intended for use only by the addressee. If you are not the 
intended recipient, you are notified that any review, use, dissemination, 
distribution, copying or printing of this e-mail is strictly prohibited. You 
are requested to delete this e-mail or any copies immediately and notify the 
sender by reply email. Internet communications cannot be guaranteed to be 
timely, secure, error or virus-free.  Tata Technologies does not accept any 
liability for virus infected email or errors or omissions or consequences which 
may arise as a result of this e-mail transmission. To know more about Tata 
Technologies please visit http://www.tatatechnologies.com

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Service Dependency

2009-04-08 Thread Nair
Friends,
I need to define mutal dependancy between two web services in different hosts.

PLease see the config below which end up throwing error:
---
Error: A circular execution dependency (which could result in a deadlock) 
exists for service 'www' on host 'host2'!
---
# Service Dependency Definition for cfservd 
define servicedependency{
host_namehost1
service_description   www
dependent_host_name host2
dependent_service_description   www
execution_failure_criteriao
notification_failure_criteria  c,p
}

define servicedependency{
host_namehost2
service_descriptionwww
dependent_host_name   host1
dependent_service_description  www  
execution_failure_criteriao
notification_failure_criteria  c,p
}

Please share your thoughts for getting the config right.

Thank you in advance,
Nair

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios for IRIX

2009-04-08 Thread ankit . mehrotra
HI,

Not very sure about the problem u are facing 
To my understanding of your, problem you can compile the source files of 
the add-ons and the plug-ins and use them for monitoring.

If this is not the issue, then send  the problem in detail !

Cheers,
Ankit 




 
04/09/2009 10:40 AM

To

cc

Subject
[Nagios-users] Nagios for IRIX






Greeting.
 
 
I have IRIX 6.5 servers which I like to monitor using Nagios 3.0. I am 
able to monitor other servers like Linux & Windows but not able to 
configure nagios on IRIX OS servers.
 
Please guide me how can I install nagios plug-in on IRIX OS and configure. 
Where do we will get nagios plug-ins for IRIX.
 
 
 
Thanks and Regards
Amol Bute
 
**
Email Disclaimer:

Information contained and transmitted by this e-mail (including any 
attachments) is confidential, proprietary and legally privileged data of 
Tata Technologies that is intended for use only by the addressee. If you 
are not the intended recipient, you are notified that any review, use, 
dissemination, distribution, copying or printing of this e-mail is 
strictly prohibited. You are requested to delete this e-mail or any copies 
immediately and notify the sender by reply email. Internet communications 
cannot be guaranteed to be timely, secure, error or virus-free.  Tata 
Technologies does not accept any liability for virus infected email or 
errors or omissions or consequences which may arise as a result of this 
e-mail transmission. To know more about Tata Technologies please visit 
http://www.tatatechnologies.com

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when 
reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
ForwardSourceID:NT00034FB2 
=-=-=
Notice: The information contained in this e-mail
message and/or attachments to it may contain 
confidential or privileged information. If you are 
not the intended recipient, any dissemination, use, 
review, distribution, printing or copying of the 
information contained in this e-mail message 
and/or attachments to it are strictly prohibited. If 
you have received this communication in error, 
please notify us by reply e-mail or telephone and 
immediately and permanently delete the message 
and any attachments. Thank you


--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null