Re: [Nagios-users] Bonding/Teaming Monitoring

2010-08-30 Thread diego . roccia
On the server part, you should go with:
 - monitor physical interfaces (snmp)
 - monitor bonding sytatus (in /proc/net/bonding/bond0, or the sysfs equivalent)

Moreover, you could monitor network interfaces on the switch side. You can for 
example, use custom variables to store switch ports on server side.

Sent from my BlackBerry® wireless device

-Original Message-
From: Jones, Stuart stuart.jo...@health.wa.gov.au
Date: Mon, 30 Aug 2010 09:29:00 
To: Nagios Users Listnagios-users@lists.sourceforge.net
Reply-To: Nagios Users List nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] Bonding/Teaming Monitoring

--
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users 
worldwide. Take advantage of special opportunities to increase revenue and 
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d
--
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users 
worldwide. Take advantage of special opportunities to increase revenue and 
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios script output issue

2010-08-30 Thread diego . roccia
You aren't passing that http://xyz.com thing to the server, are you?

Sent from my BlackBerry® wireless device

-Original Message-
From: newme me allan...@gmail.com
Date: Sun, 29 Aug 2010 18:12:10 
To: nagios-users@lists.sourceforge.net
Reply-To: Nagios Users List nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] Nagios script output issue

--
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users 
worldwide. Take advantage of special opportunities to increase revenue and 
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d
--
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users 
worldwide. Take advantage of special opportunities to increase revenue and 
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Are service contacts 'inherited' from respective host?

2010-08-30 Thread Danilo Godec
 Hi,

I'm monitoring a bunch of hosts and each of these hosts has at least two
services used to monitor and graph ping RTA and packet-loss. Hosts
configurations are organized in several .cfg files (by building), there
is also a hostgroup definition grouping all hosts in a building.
However, hosts in a building can have different contacts...

I defined services using hostgroups - like this:

define service {
use template
hostgroup_name MYHOSTGROUP
}

I was hoping a service that doesn't have a contact definition would
'inherit' the contact from the host - but it seems that's not the case.

Is there a way to make that happen? Or will I need to define services
one-by-one while and add contact definition to each?

  Danilo


-- 
Danilo Godec, sistemska podpora / system administration

Predlog! Obiscite prenovljeno spletno stran www.agenda.si 

ODPRTA KODA IN LINUX 
STORITVE : POSLOVNE RESITVE : UPRAVLJANJE IT : INFRASTRUKTURA IT : 
IZOBRAZEVANJE : PROGRAMSKA OPREMA 

Visit our updated web page at www.agenda.si 

OPEN SOURCE AND LINUX 
SERVICES : BUSINESS SOLUTIONS : IT MANAGEMENT : IT INFRASTRUCTURE : TRAINING : 
SOFTWARE 


--
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users 
worldwide. Take advantage of special opportunities to increase revenue and 
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Suggestion: Macro cleansing/clean_macro_chars() in macros.c should not allow trailing \

2010-08-30 Thread Peter Valdemar Mørch (Lists)
Hi,

clean_macro_chars() basically removes illegal_output_chars from char 
*macro if(options  STRIP_ILLEGAL_MACRO_CHARS).

Now, by default '\' is not an illegal output char, so it is not 
stripped. However that can cause problems if e.g. we have a

command_line plugin arg0 $STRIPPED_MACRO$ arg2

Consider:

$STRIPPED_MACRO$ = normal

plugin sees:

arg0: arg0
arg1: normal
arg2: arg2

All is well. However, if:

$STRIPPED MACRO$ = foo\

arg0: arg0
arg1: foo arg2
arg2: NULL

The trailing backslash from foo cases the space to be backslashed. It 
can be avoided by putting in two spaces after $STRIPPED_MACRO$ in the 
command_line but that isn't quite intuitive, is it? This value could 
arise from the before-stripping values of $STRIPPED_MACRO$: foo\, 
foo\$, foo\ etc.

I suggest that if stripping is to be truly safe, somehow it should be 
possible to avoid trailing backslashes as that messes up the args given 
to plugins.

Peter

(Yes, in another thread in nagios-devel I'm talking about Custom Object 
Variables and cleansing, but I think this applies to cleansing in 
general so I post it as a separate subject. Hope that is ok.)
-- 
Peter Valdemar Mørch
http://www.morch.com

--
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users 
worldwide. Take advantage of special opportunities to increase revenue and 
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Are service contacts 'inherited' from respectivehost?

2010-08-30 Thread Ortner, Gerald


-Ursprüngliche Nachricht-
Von: Danilo Godec [mailto:danilo.go...@agenda.si]
Gesendet: Montag, 30. August 2010 11:03
An: Nagios Users List
Betreff: [Nagios-users] Are service contacts 'inherited' from respectivehost?

 Hi,

I'm monitoring a bunch of hosts and each of these hosts has at least two 
services used to monitor and graph ping RTA and packet-loss. Hosts 
configurations are organized in several .cfg files (by building), there is also 
a hostgroup definition grouping all hosts in a building.
However, hosts in a building can have different contacts...

I defined services using hostgroups - like this:

define service {
use template
hostgroup_name MYHOSTGROUP
}

I was hoping a service that doesn't have a contact definition would 'inherit' 
the contact from the host - but it seems that's not the case.

Is there a way to make that happen? Or will I need to define services 
one-by-one while and add contact definition to each?

  Danilo

Hi,

What Nagios version do you use?
Since Nagios 3.0 contact_groups, notification_interval, notification_period are 
inherited from the related host definition if you don't explicitly specify them 
in the service object or an inherited template.

http://nagios.sourceforge.net/docs/3_0/objectinheritance.html

Gerald

OOe. Gesundheits- und Spitals-AG
Techcenter, Hafenstrasse 47-51, 4020 Linz DVR 2107870, ATU 51928204, 
Firmenbuchgericht: Landesgericht Linz, FN 210146 p

--
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users 
worldwide. Take advantage of special opportunities to increase revenue and 
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Help needed: How do you configure dynamic thresholds depending on timeframes.

2010-08-30 Thread Alex Peeters

Help needed: How do you configure dynamic thresholds depending on timeframes.

Example.   -w = 80 -c = 90 during business hours but -w = 90 -c = 95  
outsite business hours.

How can we configure into nagios dynamic thresholds depending on  
timeframes: Part II

define service{
 use local-service ; Name  
of service template to use
 host_name   localhost
 service_description Current Users
   check_command   check_local_users!20!50
   check_period nonworkhours
   notification_period  nonworkhours
 }

define service{
 use local-service ; Name  
of service template to use
 host_name   localhost
 service_description Current Users
   check_command   check_local_users!40!60
   check_period workhours
   notification_period  workhours
 }

In my above example i dit configure the same test twice. The two  
timeframes 'nonworkhours' en 'workhours' together equals 24x7

Is this way of configuring allowed. Because this solves my problem.

1) how will the nagios scheduling react on this configuration?

2) how will the display react on this configuration?


# 'workhours' timeperiod definition
define timeperiod{
 timeperiod_name workhours
 alias   Normal Working Hours
 monday  09:00-17:00
 tuesday 09:00-17:00
 wednesday   09:00-17:00
 thursday09:00-17:00
 friday  09:00-17:00
 }


# 'nonworkhours' timeperiod definition
define timeperiod{
 timeperiod_name nonworkhours
 alias   Non-Work Hours
 sunday  00:00-24:00
 monday  00:00-09:00,17:00-24:00
 tuesday 00:00-09:00,17:00-24:00
 wednesday   00:00-09:00,17:00-24:00
 thursday00:00-09:00,17:00-24:00
 friday  00:00-09:00,17:00-24:00
 saturday00:00-24:00
 }


check_period: This directive is used to specify the short name of the  
time period during which active checks of this host can be made.

check_period: This directive is used to specify the short name of the  
time period during which active checks of this service can be made.

If you do not use the check_period directive to specify a timeperiod,  
Nagios will be able to schedule active
checks of the host or service anytime it needs to. This is essentially  
a 24x7 monitoring scenario.

Specifying a timeperiod in the check_period directive allows you to  
restrict the time that Nagios perform
regularly scheduled, active checks of the host or service. When Nagios  
attempts to reschedule a host or
service check, it will make sure that the next check falls within a  
valid time range within the defined
timeperiod. If it doesn’t, Nagios will adjust the next check time to  
coincide with the next valid time in
the specified timeperiod. This means that the host or service may not  
get checked again for another hour,
day, or week, etc.

Timeperiods:
Exclusions and Host/Service Checks - There is a bug in the  
service/host check scheduling logic
that rears its head when you use timeperiod definitions that use the  
excludedirective. The
problem occurs when Nagios Core tries to re-schedule the next check.  
In this case, the
scheduling logic may incorrectly schedule the next check further out  
in the future than it
should. In essence, it skips over the (missing) logic where it could  
determine an earlier possible
time using the exception times. Imperfect Solution: Don’t use  
timeperiod definitions that
exclude other timeperods for your host/service check periods. A fix is  
being worked on, and
will hopefully make it into a 3.4.x release.


--
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users 
worldwide. Take advantage of special opportunities to increase revenue and 
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Internal Server Error Occuring when trying to access Nagios on the browser

2010-08-30 Thread Alex Dehaini
what ports do tomcat and postgres listen on?

On Fri, Aug 27, 2010 at 3:40 AM, Ifeanyi Agu ifepet2...@yahoo.com wrote:

   Hello Alex,

 I want to monitor Apache Tomcat (i.e. to know when it is running or not). I
 also want to monitor Postgres service.

 My Nagios is working well now but I noticed that it does not send me mails.
 Am using Cent OS. Is there any other configuration I need to do outside or
 within Nagios that would enable mails to be sent?

 Ifeanyi Agu
 08063728802


 --- On *Thu, 26/8/10, Alex Dehaini alexdeha...@gmail.com* wrote:


 From: Alex Dehaini alexdeha...@gmail.com
 Subject: Re: [Nagios-users] Internal Server Error Occuring when trying to
 access Nagios on the browser
 To: Nagios Users List nagios-users@lists.sourceforge.net
 Date: Thursday, 26 August, 2010, 7:24 PM


 Yep,

 Nagios has tons and tons of plugins that can monitor virtually any service
 - what exactly do you want to monitor, a service, connection to a port,
 what? Let us know and we can help

 Regards,
 Alex

 PS: Review the docx, common questions are usually answered in the docx. :)


 -Inline Attachment Follows-



 --
 Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
 Be part of this innovative community and reach millions of netbook users
 worldwide. Take advantage of special opportunities to increase revenue and
 speed time-to-market. Join now, and jumpstart your future.
 http://p.sf.net/sfu/intel-atom-d2d

 -Inline Attachment Follows-


 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.nethttp://mc/compose?to=nagios-us...@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null




 --
 Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
 Be part of this innovative community and reach millions of netbook users
 worldwide. Take advantage of special opportunities to increase revenue and
 speed time-to-market. Join now, and jumpstart your future.
 http://p.sf.net/sfu/intel-atom-d2d
 ___
 Nagios-users mailing list
 Nagios-users@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/nagios-users
 ::: Please include Nagios version, plugin version (-v) and OS when
 reporting any issue.
 ::: Messages without supporting info will risk being sent to /dev/null

--
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users 
worldwide. Take advantage of special opportunities to increase revenue and 
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios script output issue

2010-08-30 Thread Marc Powell

On Aug 29, 2010, at 8:12 PM, newme me wrote:

 ./check_http -H xyz.com -p  -u /abc -t 3
 OK
 
 Result from Nagios Interface - 
 nodename nor servname provided, or not known
 HTTP CRITICAL - Unable to open TCP socket
 
 I am thinking that this has something to do with Nagios as the box is 
 behaving just fine based on the commandline result(and verified on the URL 
 through the browser).

It seems like you may have multiple nagios daemons running at the same time; 
one that's using a previously bad config, and one with a good new config.

Stop nagios, verify that all nagios processes are dead and restart.

--
Marc


--
Sell apps to millions through the Intel(R) Atom(Tm) Developer Program
Be part of this innovative community and reach millions of netbook users 
worldwide. Take advantage of special opportunities to increase revenue and 
speed time-to-market. Join now, and jumpstart your future.
http://p.sf.net/sfu/intel-atom-d2d
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios script output issue

2010-08-30 Thread newme me
Hi Marc,

Thanks for your reply, I checked on the multiple process of Nagios but there
is only one running.

Any other thing you can think of which might be causing this odd behaviour?

Thanks,
Allan.

On Sun, Aug 29, 2010 at 6:12 PM, newme me allan...@gmail.com wrote:

 Folks,

 I switched to check_http plugin (official nagios plugin) but that also
 has same problem.

 Initially when I start nagios then the status on the Nagios Web interface
 is same as the one returned from commandline.

 After some time the status becomes critical but is not as same as the
 commandline. Command line returns the correct status of OK instead of what
 the Nagios web interface shows as CRITICAL

 Result from commandline( browser) -
 ./check_http -H xyz.com -p  -u /abc -t 3
 OK

 Result from Nagios Interface -
 nodename nor servname provided, or not known
 HTTP CRITICAL - Unable to open TCP socket

 I am thinking that this has something to do with Nagios as the box is
 behaving just fine based on the commandline result(and verified on the URL
 through the browser).

 Please help!

 Thanks,
 Allan.


 On Thu, Aug 26, 2010 at 10:26 PM, newme me allan...@gmail.com wrote:

 Hi Folks,

 Nagios is acting a little weird for me, I have this external script which
 I hooked into Nagios, it merely does a curl/wget on a URL and returns the
 status based on string in the content/output. Initially for 2-3 hrs the
 script returns the right status and Nagios reports correctly i.e. OK, WARN,
 ERROR based on the exit from the script. After 2-3 hrs output which was (and
 should be) OK or WARN starts returning CRITICAL and the output line says
 Application is and not even Application is ERROR or Application is
 FATAL.

 There is nothing in the logs to suggest what could be the problem. Have
 you experienced this before and let me know the corrective action. I am
 running Nagios on Mac OSX.

 Here is the script for the curious -

 #!/bin/bash

 read URL  $1

 STATUS=`curl -s $URL |grep summary|awk -F\ '{print $2}'`
 echo Application is $STATUS
 echo curl $URL

 case $STATUS in
 OK)
exit 0
;;
 WARN)
   exit 1
   ;;
 ERROR)
   exit 2
   ;;
 FATAL)
   exit 2
   ;;
 *)
   exit 2
   ;;
 esac

 Thanks,

 Allan.



--
This SF.net Dev2Dev email is sponsored by:

Show off your parallel programming skills.
Enter the Intel(R) Threading Challenge 2010.
http://p.sf.net/sfu/intel-thread-sfd___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Help hacking a script to montior my AppServers?

2010-08-30 Thread Mirza Dedic
I am trying to monitor our OpenEdge Database Application Servers on our AIX box 
with Nagios, and to send an alert based on a tricky situation, described 
below:

The tool that I run locally on the AIX host to check our servers outputs the 
following:

#asbman -name production -query
PID   State Port  nRqnRcvd  nSent  Started  Last Change
2175264 AVAILABLE 15614 006968 007643 008661 Aug 29, 2010 19:03 Aug 30, 2010 
11:16
922080 AVAILABLE 15616 002675 002970 003465 Aug 30, 2010 09:22 Aug 30, 2010 
11:16
483832 AVAILABLE 15607 001740 001939 002204 Aug 30, 2010 10:02 Aug 30, 2010 
11:16
2936902 AVAILABLE 15620 001753 001955 002236 Aug 30, 2010 10:02 Aug 30, 2010 
11:16
3658058 AVAILABLE 15623 001742 001921 002190 Aug 30, 2010 10:02 Aug 30, 2010 
11:16
3027394 AVAILABLE 15608 001297 001377 001493 Aug 30, 2010 10:21 Aug 30, 2010 
11:16
3965150 SENDING  15611 001290 001352 001465 Aug 30, 2010 10:21 Aug 30, 2010 
06:12
4186168 AVAILABLE 15615 001300 001379 001549 Aug 30, 2010 10:21 Aug 30, 2010 
11:16
2044322 AVAILABLE 15617 001298 001369 001558 Aug 30, 2010 10:21 Aug 30, 2010 
11:16

I would want Nagios to raise an alert if State = SENDING for more than 6 hours 
(Last Change)? Would this be possible?

Anyone have an idea of how I would do this?

I guess one way would be to make a shell script to output asbman -name 
production -query | grep SENDING  status.txt and have a check log type 
check read it and raise an error if stuff is found? But it does not exactly 
implement the Last Change date time idea, we will always have PIDs going from 
AVAILABLE to SENDING and back, but those that are stuck in SENDING for 6+ hours 
is of concern.

Any help on this is appreciated.


The Oppenheimer Group  CONFIDENTIAL

This message is for the designated recipient only and may contain privileged, 
proprietary, or otherwise private information. If you have received it in 
error, please notify the sender immediately and delete the original. Any other 
use of the email by you is prohibited.

--
This SF.net Dev2Dev email is sponsored by:

Show off your parallel programming skills.
Enter the Intel(R) Threading Challenge 2010.
http://p.sf.net/sfu/intel-thread-sfd___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Help hacking a script to montior my AppServers?

2010-08-30 Thread Herb J.
You will have to write your script in a way that it is able to parse the 
date fields in that output and calculate the difference between that 
time and the current time..


Is it possible to have that program output UNIX timestamps instead? If 
not, it will require you to get creative with awk, Perl, shell, etc. 
scripting to parse the fields properly in order to do the required math 
to determine how long it has been in that state.


Depending on how long that binary takes to run (e.g., if it takes less 
than a few seconds), you may not need to have it run separately and 
output data to a text file. You may be able to just have your custom 
plugin script run it and parse the output directly.



On 08/30/2010 02:20 PM, Mirza Dedic wrote:
I am trying to monitor our OpenEdge Database Application Servers on 
our AIX box with Nagios, and to send an alert based on a tricky 
situation, described below:
The tool that I run locally on the AIX host to check our servers 
outputs the following:

#asbman --name production --query
PID   State Port  nRqnRcvd  nSent  Started  Last Change
2175264 AVAILABLE 15614 006968 007643 008661 Aug 29, 2010 19:03 Aug 
30, 2010 11:16
922080 AVAILABLE 15616 002675 002970 003465 Aug 30, 2010 09:22 Aug 30, 
2010 11:16
483832 AVAILABLE 15607 001740 001939 002204 Aug 30, 2010 10:02 Aug 30, 
2010 11:16
2936902 AVAILABLE 15620 001753 001955 002236 Aug 30, 2010 10:02 Aug 
30, 2010 11:16
3658058 AVAILABLE 15623 001742 001921 002190 Aug 30, 2010 10:02 Aug 
30, 2010 11:16
3027394 AVAILABLE 15608 001297 001377 001493 Aug 30, 2010 10:21 Aug 
30, 2010 11:16
*3965150 *_*SENDING*_* ** 15611 001290 001352 001465 Aug 30, *_*2010 
10:21 Aug 30, 2010 *__*06*__*:*__*12*_* *
4186168 AVAILABLE 15615 001300 001379 001549 Aug 30, 2010 10:21 Aug 
30, 2010 11:16
2044322 AVAILABLE 15617 001298 001369 001558 Aug 30, 2010 10:21 Aug 
30, 2010 11:16
I would want Nagios to raise an alert if State = SENDING for more than 
6 hours (Last Change)? Would this be possible?

Anyone have an idea of how I would do this?
I guess one way would be to make a shell script to output asbman 
--name production --query | grep SENDING  status.txt and have a 
check log type check read it and raise an error if stuff is found? But 
it does not exactly implement the Last Change date time idea, we 
will always have PIDs going from AVAILABLE to SENDING and back, but 
those that are stuck in SENDING for 6+ hours is of concern.

Any help on this is appreciated.
The Oppenheimer Group  CONFIDENTIAL

This message is for the designated recipient only and may contain privileged, 
proprietary, or otherwise private information. If you have received it in 
error, please notify the sender immediately and delete the original. Any other 
use of the email by you is prohibited.



--
This SF.net Dev2Dev email is sponsored by:

Show off your parallel programming skills.
Enter the Intel(R) Threading Challenge 2010.
http://p.sf.net/sfu/intel-thread-sfd


___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue.
::: Messages without supporting info will risk being sent to /dev/null


--
This SF.net Dev2Dev email is sponsored by:

Show off your parallel programming skills.
Enter the Intel(R) Threading Challenge 2010.
http://p.sf.net/sfu/intel-thread-sfd___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Nagios script output issue

2010-08-30 Thread newme me
One more thing I would like to add here , I have 5 URLs check , 2 thru
check_http and 3 thru my own curl script and all of them fail at the same
time, initially for 2-3 hrs all are returning the right status.

The error  that I am getting from Nagios(check_http) - nodename nor
servname provided, or not known HTTP CRITICAL - Unable to open TCP socket
is somewhat as if the dns server becomes inaccessible but when I run the
commands from the command line, the status is fine. Is there something
within Nagios that I need to set/configure?

Please help!

Thanks,
Allan.

On Mon, Aug 30, 2010 at 10:47 AM, newme me allan...@gmail.com wrote:

 Hi Marc,

 Thanks for your reply, I checked on the multiple process of Nagios but
 there is only one running.

 Any other thing you can think of which might be causing this odd behaviour?

 Thanks,
 Allan.


 On Sun, Aug 29, 2010 at 6:12 PM, newme me allan...@gmail.com wrote:

  Folks,

 I switched to check_http plugin (official nagios plugin) but that also
 has same problem.

 Initially when I start nagios then the status on the Nagios Web interface
 is same as the one returned from commandline.

 After some time the status becomes critical but is not as same as the
 commandline. Command line returns the correct status of OK instead of what
 the Nagios web interface shows as CRITICAL

 Result from commandline( browser) -
 ./check_http -H xyz.com -p  -u /abc -t 3
 OK

 Result from Nagios Interface -
 nodename nor servname provided, or not known
 HTTP CRITICAL - Unable to open TCP socket

 I am thinking that this has something to do with Nagios as the box is
 behaving just fine based on the commandline result(and verified on the URL
 through the browser).

 Please help!

 Thanks,
 Allan.


 On Thu, Aug 26, 2010 at 10:26 PM, newme me allan...@gmail.com wrote:

 Hi Folks,

 Nagios is acting a little weird for me, I have this external script which
 I hooked into Nagios, it merely does a curl/wget on a URL and returns the
 status based on string in the content/output. Initially for 2-3 hrs the
 script returns the right status and Nagios reports correctly i.e. OK, WARN,
 ERROR based on the exit from the script. After 2-3 hrs output which was (and
 should be) OK or WARN starts returning CRITICAL and the output line says
 Application is and not even Application is ERROR or Application is
 FATAL.

 There is nothing in the logs to suggest what could be the problem. Have
 you experienced this before and let me know the corrective action. I am
 running Nagios on Mac OSX.

 Here is the script for the curious -

 #!/bin/bash

 read URL  $1

 STATUS=`curl -s $URL |grep summary|awk -F\ '{print $2}'`
 echo Application is $STATUS
 echo curl $URL

 case $STATUS in
 OK)
exit 0
;;
 WARN)
   exit 1
   ;;
 ERROR)
   exit 2
   ;;
 FATAL)
   exit 2
   ;;
 *)
   exit 2
   ;;
 esac

 Thanks,

 Allan.




--
This SF.net Dev2Dev email is sponsored by:

Show off your parallel programming skills.
Enter the Intel(R) Threading Challenge 2010.
http://p.sf.net/sfu/intel-thread-sfd___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null