Re: [Nagios-users] sms notifications

2012-09-11 Thread Frater, Greg J
Hi Frank, 

 

We just bought the MultiTech MultiModem iCell and need to interface it
to Nagios.  Care to share what your using (home grown script, SMS
gateways, etc.) to send alerts from Nagios through your iSMS?

 

Thanks,

 

-greg

 

From: Frank Bulk [mailto:frnk...@iname.com] 
Sent: Wednesday, February 01, 2012 6:55 PM
To: NAGIOS
Subject: Re: [Nagios-users] sms notifications

 

We use the MultiTech MultiModem iSMS - works great.

 

Frank

 

From: Marco Borsani [mailto:m.bors...@it.net] 
Sent: Wednesday, February 01, 2012 9:04 AM
To: NAGIOS
Subject: [Nagios-users] sms notifications

 

Hi all

 

To send sms notifications, I am using since 2003 a couple of radiomodem,
connected via serial to my server and smstools SW.

 

Now, I need something "stronger" compared that configuration because for
specific checks I have to send many and many sms at the same time.

 

What can I use ?

 

Regards

Marco

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] qpage - OT

2010-12-16 Thread Frater, Greg J
Thanks for the response Benny, I appreciate any help I can get

>> qpage error:
>> <502 MESSAGE REJECTED - STX OR EOT EXPECTED>

>It would have been nice to see your qpage.cf file...  ;)
That seems obvious, see below

>Be sure you have 'parity=even' in your config.  When you run a test
with verbose and interactive flags set, do you fail five or six times
before you get that message?

I've never tried the interactive flag, I will do so.  As far as the
failures go when I had the retry set to 20 it would to fail 5 times in a
row and then reset the modem or something, I can't fully interpret the
logs, and then retry again possible 20 times? as in 20 sets of 5. 

The interactive (-i) option seems to require a page to be sent right
now.  As of yet I have been unable to get a failure when sending a page
manually but I think I've really only sent a small number 10-20 pages
manually.  The only times it has failed so far is when it's running in
daemon mode.  Do you guys use USB modems with qpage?  These problems got
much worse after switching to a USB modem.

qpage.cf:
#
# QuickPage configuration file
#
administrator=xxx...@xxx.xxx
queuedir=/var/spool/qpage
pidfile=/var/spool/qpage/qpage.pid

modem=modem1
  device=/dev/modem

service=att
  device=modem1
  phone=9,18009094602
  baudrate=9600
  parity=even
  allowpid=yes
  #maxtries=6
  maxtries=3
  msgprefix=false
  #maxmsgsize=250
  maxmsgsize=500


--
Lotusphere 2011
Register now for Lotusphere 2011 and learn how
to connect the dots, take your collaborative environment
to the next level, and enter the era of Social Business.
http://p.sf.net/sfu/lotusphere-d2d
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] qpage - OT

2010-12-13 Thread Frater, Greg J
Cheers All, 

I know a number of people use qpage for sending alerts via modem as we
do.  I'm hoping someone can help me with a qpage problem we're having,
there does not seem to be a qpage mailing list and as of yet the
developer has not responded to my email.

We are getting *random* alert pages that fail to send with the following
error.  I say random because I have not figured out any patterns or
commonality between them (other than the obvious they are all from
Nagios, etc.).  Roughly once a day a page will fail to send but others
before and after it will send fine, they all use the same send script.
We've always had some problems but they recently got worse after we
replaced a failed PCI modem with a USB modem from US Robotics.  Is
anyone else using USB modems, if so what brand/model?  Has anyone seen
problems like this?  Is the a better forum or mailing list I could look
to for help?  Sorry for the off topic post.

qpage error:
<502 MESSAGE REJECTED - STX OR EOT EXPECTED>

Thanks,  

Greg Frater
System Admin

--
Lotusphere 2011
Register now for Lotusphere 2011 and learn how
to connect the dots, take your collaborative environment
to the next level, and enter the era of Social Business.
http://p.sf.net/sfu/lotusphere-d2d___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] trying to fix problem with excessive latency

2010-10-08 Thread Frater, Greg J
Hi All, 

This is just a me too post to hopefully help others down the road! 

 I have inherited maintenance of a medium-sized Nagios installation.

 We currently have 649 hosts and 5415 services. Our setup works 
 nicely, with one exception: Nagios falls behind on host/service 
 checks. Our usual latency once Nagios has been running for a while 
 is about 190-200 seconds. Our Nagios host is reasonably powerful
and 
 isn't struggling; it seems that Nagios itself is limited somehow.

We are similar in size 530 hosts, 3300 service checks with a service
check latency of around 300 seconds and 350 for hosts.  There are no
obvious performance bottlenecks on our sever the closest thing to it for
us was the CPU's ave of around 30% utilization across 4 procs with 4 GB
of RAM.  Our current configuration did not have any latency issues
running Nagios 3.1.2, our latency issues started a couple of weeks ago
with an upgrade to 3.2.2. 

>It looks like use_large_installation_tweaks did the trick. I could have
sworn I tried that before; likely I set it to 0 instead of 1 or
something foolish like that.

We've avoided this in an effort to keep our configuration as plain as
possible and because I remember reading that this was not expected to be
effective unless you had something like 1000+ hosts and many more
service checks.  Apparently that is not correct, at least under 3.2.2.

>Anyway, I wanted to test the individual components of
large_installation_tweaks to see what helped.
>* child_processes_fork_twice=0: latency drops from 149 to 18

This change took our latency from ~300s to .228 for services and ~350
seconds to .458 for hosts

>* free_child_process_memory=0:  latency drops from 149 to 0.6

I made this change with the 'child_processes_fork_twice=0' change still
in place and saw no measurable drop in latencies.

>* disabling summary macros in environment variables doesn't seem to be
>  individually configurable, but I expect that's disabled already by
>  enable_environment_macros=0.

Did not do this, was not deemed necessary

>So, thanks again to everybody who helped. Latency is now 0.35% of what
it used to be.
>
>-Corey

Ditto, this was a big help to us as well, thanks.

-greg

--
Beautiful is writing same markup. Internet Explorer 9 supports
standards for HTML5, CSS3, SVG 1.1,  ECMAScript5, and DOM L2 & L3.
Spend less time writing and  rewriting code and more time creating great
experiences on the web. Be a part of the beta today.
http://p.sf.net/sfu/beautyoftheweb
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios did not send notification to some contacts --again

2010-09-01 Thread Frater, Greg J
Hi All,

We had an event several weeks ago, it has happened again, I'm posted
after the first time and reposting to the same thread because they are
related.  Sorry if this makes the it confusing.  Marc thanks for the
reply to the first event, see below for my responses.

In the second event a host went down due to a power outage but only a
portion of the contacts were sent notifications (which is the same
problem as the first event).  This is the second time, that I know of,
that Nagios has failed to send to some of the contacts.  The problem has
now occurred on two different hosts.  I can't explain why it's happening
which does not instill confidence in our customers.  Any help or
suggestions in fixing this are greatly appreciated.

This next part is from/for the first event

 first event --
>> There was a routing issue on our WAN that caused this event, the SMTP
server we use is across the WAN.  Could the routing issue have prevented
some of the SMTP notifications from being sent, wouldn't they just queue
up and go once the problem was resolved?

>They would be queued by the SMTP server running on your nagios machine.
Redelivery attempts would occur based on the configuration there.

Okay, makes sense.

>>  I have seen messages that did not arrive at the recipients phone but
I've never seen Nagios not generate notifications for contacts that are
configured for that host or service.  Has anyone else seen this, any
suggestions on a cause or how to troubleshoot?

>- Check nagios.log for a HOST NOTIFICATION event for that group. Make
sure there were no errors logged. 

nagios.log only shows notifications sent to some of the contacts, these
notifications were received.

>- Check your local SMTP server logs to see if the messages were
received there and no errors were reported.

Not necessary, nagios did not send the notifications

>- Make sure that nagios has been restarted since adding this group and
contacts.

Done.  The contact groups in question have been in place for many
months.

>- Make sure you don't have multiple nagios daemons running at the same
time.

Done. Only a single instance is running.
--- end of first event
-


 Second event with logs and configs
-
Below are the configs for the host from the second event.  If you look
the log at the bottom you'll see that 11 of 16 contacts were sent
notifications, some but not all from each of the contact groups
configured. I'm trying to figure out why.  Does anyone see a problem
with my configs?


Host in question:

CONFIGS:
define host {
host_name   Host_A
alias   Host_A
parents Host_B
use upshost
contact_groups  +network-email,onguard
register1
}

define contactgroup {
contactgroup_name   network-email
alias   Users who monitor the
network - email only
members
netuser1,netuser2,netuser3
}

define contactgroup {
contactgroup_name   onguard
alias   On Guard Admins
members
og_user1-phone,og_user2-phone,og_user3,og_user3-home,og_user3-phone,og_u
ser4,og_user4-phone,og_user5-phone,og_user6,og_user6-phone,og_user7,og_u
ser7-phone,og_user8
}

define host {
   name upshost
   aliasNetInfra UPS' template
   check_commandcheck-host-alive
   use  generic-pnp,generic-host
   max_check_attempts   5
   check_interval   60
   retry_interval   3
   active_checks_enabled1
   passive_checks_enabled   1
   flap_detection_enabled   1
   process_perf_data1
   retain_status_information1
   retain_nonstatus_information 1
   contact_groups   network
   notification_interval60
   notification_period  24x7
   notification_options d,u,r
   notifications_enabled1
   register 0

}



Excerpt from nagios.log
[1283265540] HOST NOTIFICATION:
netuser2-cell;Host_A;UNREACHABLE;alert-host-by-sms;PING CRITICAL -
Packet loss = 100%
[1283265540] HOST NOTIFICATION:
netuser2-pager;Host_A;UNREACHABLE;alert-host-by-modem;PING CRITICAL -
Packet loss = 100%
[1283265540] HOST NOTIFICATION:
netuser2;Host_A;UNREACHABLE;alert-host-by-email-long;PING CRITICAL -
Packet loss = 100%
[1283265540] H

[Nagios-users] Nagios did not send notification to some contacts

2010-08-03 Thread Frater, Greg J
Hi All, 

I'm looking for troubleshooting suggestions to a problem we had.  During
a recent host down event Nagios sent notifications to 8 of the 18
configured contacts.  The contacts are configured in two contact groups,
of the 8 contacts that were sent notifications all were from the same
contact group but there are another 4 contacts in that group that did
not get notifications sent.  No contacts from the second group had
notifications generated.  There was a routing issue on our WAN that
caused this event, the SMTP server we use is across the WAN.  Could the
routing issue have prevented some of the SMTP notifications from being
sent, wouldn't they just queue up and go once the problem was resolved?
I have seen messages that did not arrive at the recipients phone but
I've never seen Nagios not generate notifications for contacts that are
configured for that host or service.  Has anyone else seen this, any
suggestions on a cause or how to troubleshoot?

Nagios 3.2.0 on RHEL 5.5

Thanks, 

-greg  
--
The Palm PDK Hot Apps Program offers developers who use the
Plug-In Development Kit to bring their C/C++ apps to Palm for a share
of $1 Million in cash or HP Products. Visit us here for more details:
http://p.sf.net/sfu/dev2dev-palm___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] regularly schedule host checks?

2009-07-09 Thread Frater, Greg J
Thanks Max!  I'll read up on cached checks, a lot has changed since 1.0.

Regards, 

-greg

-Original Message-
From: max.schub...@gmail.com [mailto:max.schub...@gmail.com] On Behalf Of Max
Sent: Thursday, July 09, 2009 9:48 AM
To: Frater, Greg J
Cc: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] regularly schedule host checks?

On Thu, Jul 9, 2009 at 11:58 AM, Frater, Greg J wrote:
> Hi All,
>
> I'm wondering if there is any consensus from Nagios users regarding 
> scheduled host checks.  In general is it better to have Nagios 
> schedule host checks or is it better to use the old logic i.e. only 
> run the host check if a service check fails.  How do you turn off the 
> scheduled host checks in version 3?  I've been messing with some of 
> the host check settings but there's no obvious 'disable scheduled host 
> checks'.

recommended method with Nagios 3 is to run regularly scheduled host checks and 
then use host check caching as this will keep Nagios from re-running the host 
check every time a service fails :):

http://nagios.sourceforge.net/docs/3_0/cachedchecks.html

If you really do not want to use regularly scheduled host checks, set

active_checks_enabled 0

in your host definition, then a host check will only be run on-demand when a 
service check fails.

--
Enter the BlackBerry Developer Challenge  
This is your chance to win up to $100,000 in prizes! For a limited time, 
vendors submitting new applications to BlackBerry App World(TM) will have
the opportunity to enter the BlackBerry Developer Challenge. See full prize  
details at: http://p.sf.net/sfu/Challenge
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] regularly schedule host checks?

2009-07-09 Thread Frater, Greg J
Hi All, 

I'm wondering if there is any consensus from Nagios users regarding
scheduled host checks.  In general is it better to have Nagios schedule
host checks or is it better to use the old logic i.e. only run the host
check if a service check fails.  How do you turn off the scheduled host
checks in version 3?  I've been messing with some of the host check
settings but there's no obvious 'disable scheduled host checks'.

Thanks,

-greg
--
Enter the BlackBerry Developer Challenge  
This is your chance to win up to $100,000 in prizes! For a limited time, 
vendors submitting new applications to BlackBerry App World(TM) will have
the opportunity to enter the BlackBerry Developer Challenge. See full prize  
details at: http://p.sf.net/sfu/Challenge___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Checking URL on windows-client using NSClient++

2009-06-30 Thread Frater, Greg J
Dennis, 
 
Your almost there, you need to set your vbs script to output an exit
code of 0-4 according to the nagios plug in documentation
(http://nagiosplug.sourceforge.net/developer-guidelines.html#PLUGOUTPUT)
.  Then setup the nsclient to run your script as a plug in check.  To do
that you will need to follow these basic steps.
 
1. add the configuration to the nsc.ini file on each xp workstation you
want to run this check on.  It should look something like this:
 
[NRPE Handlers]
webpage_check=c:\windows\system32\cscript.exe //nologo "c:\program
files\nsclient++\scripts\web_page_check.vbs"
 
2. Setup check in Nagios 
 
In your commands definition file:
define command {
   command_name webpage_check
   command_line$USER1$/check_nrpe -H
$HOSTADDRESS$ -t 60 -c webpage_check
}
 
In your services definition file:
 
define service {
host_name   XPhost01, XPhost02
service_description check web page on XP host
display_namewebpage check
check_command webpage_check
use service-template
.
}
 
That should do it, and it uses your existing script (i.e. no rework).  
 
Good luck, hope that helps,
 
-greg

 
 


From: jvc_dvl...@hotmail.com [mailto:jvc_dvl...@hotmail.com] 
Sent: Tuesday, June 30, 2009 4:08 AM
To: nagios-users@lists.sourceforge.net
Subject: [Nagios-users] Checking URL on windows-client using NSClient++


Hi,
 
I would like to check if a certain website is available on some of my
windows xp clients. These clients are public self-service computers to
make bookings for a railway organisation. Every now and then, some of
these self-service computers show a blank screen without the default
start-page. 
 
I would like to check if the page is shown up and if it's not, it
notifies Nagios which notifies me by email.
 
I've tried to run an external script (vbs) which checks the availability
of the site and that goes well if you use it manually but i dont want
any popups saying: "OK". It should run in the background. So i've edited
the vbs file and now it does check it and doesnt give any output
anymore.
 
Has anyone else tried something like this before?
 
I use NSClient++ and Nagios v3.0.6.
 
Kind Regards,
 
Dennis de Vries.
--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] weirdness in the scheduling of host checks

2009-06-29 Thread Frater, Greg J
I figured out my problem, I had two instances of Nagios running.  That
would explain a lot of the scheduling weirdness, maybe all of it.  :-)


> Is anyone else seeing weird things in the scheduling of checks?  I
don't have a good sense of what is wrong but, it's definitely not the
way it was under Nagios 1.0 (or the way it should  be).  I've been
watching the scheduling queue on our Nagios 3 box for a week or so,
here's a list of what I've seen:

 > Under Nagios 3.0.6: 
  > - host checks staying at the top of the queue for a long time (over
an hour sometimes) even when they have a timeout set at 30 seconds

 > Under Nagios 3.1.6 
 >  - host check showing up unexpectedly in the scheduling queue, this
morning when I looked at the queue the top event was about 15 minutes
behind the current time but things were moving along okay, when I last
checked there was a host check at the top of the queue with a next check
time from 4 days ago.

 >  - We had a host go down yesterday (Sunday) but we did not get
alerted.  When I looked at it in Nagios I noticed the host check was in
an OKAY state and the 'last check' value for it was from 12 days ago
(6/17/2009)!

 >  - Host checks don't seem to be getting stuck in the queue like they
were under 3.0.6, at least not for as long 

 > I'm going to submit a ticket to tracker.nagios.org but would like to
have more empirical evidence of the problem first, all I have so far are
symptoms, no good data points (logs, errors, etc.).  Is anyone else
seeing this type of behavior?

 > Nagios 3.1.2 (also had trouble with 3.0.6) 
 > RHEL 5 64 bit 

 

--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] weirdness in the scheduling of host checks

2009-06-29 Thread Frater, Greg J
Greetings All, 

Is anyone else seeing weird things in the scheduling of checks?  I don't
have a good sense of what is wrong but, it's definitely not the way it
was under Nagios 1.0 (or the way it should be).  I've been watching the
scheduling queue on our Nagios 3 box for a week or so, here's a list of
what I've seen:

Under Nagios 3.0.6:
 - host checks staying at the top of the queue for a long time (over an
hour sometimes) even when they have a timeout set at 30 seconds

Under Nagios 3.1.6
 - host check showing up unexpectedly in the scheduling queue, this
morning when I looked at the queue the top event was about 15 minutes
behind the current time but things were moving along okay, when I last
checked there was a host check at the top of the queue with a next check
time from 4 days ago.
 - We had a host go down yesterday (Sunday) but we did not get alerted.
When I looked at it in Nagios I noticed the host check was in an OKAY
state and the 'last check' value for it was from 12 days ago
(6/17/2009)!
 - Host checks don't seem to be getting stuck in the queue like they
were under 3.0.6, at least not for as long

I'm going to submit a ticket to tracker.nagios.org but would like to
have more empirical evidence of the problem first, all I have so far are
symptoms, no good data points (logs, errors, etc.).  Is anyone else
seeing this type of behavior?

Nagios 3.1.2 (also had trouble with 3.0.6)
RHEL 5 64 bit

Regards, 

-greg
--
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] disk IO for windows?

2009-06-11 Thread Frater, Greg J
Hi All,

We just started monitoring IO on our Windows servers.  We measure two PerfMon 
counters (using the nsclient++ agent).

System\File Control Operations/sec

Description: File Control Operations/sec is the combined rate of file system 
operations that are neither reads nor writes, such as file system control 
requests and requests for information about device characteristics or status.  
This is the inverse of System: File Data Operations/sec and is measured in 
number of operations perf second.  This counter displays the difference between 
the values observed in the last two samples, divided by the duration of the 
sample interval. 

System\Data Operations/sec

Description: File Data Operations/ sec is the combined rate of read and write 
operations on all logical disks on the computer.  This is the inverse of 
System: File Control Operations/sec.  This counter displays the difference 
between the values observed in the last two samples, divided by the duration of 
the sample interval. 

I'm not sure what good thresholds are for these, currently we have a warning at 
1 and critical at 2 on our test servers.  We've had a couple of servers 
cross those thresholds but I don't have a good sense for what they should be 
yet.  I've never seen any documentation regarding what a Windows box can do (in 
terms of IO).

>From a command line they look like this:

check_nrpe -H $HOSTADDRESS$ -c CheckCounter -a "\\System\\File Data 
Operations/Sec" ShowAll MaxWarn=1 MaxCrit=2 

Regards, 

-greg

-Original Message-
From: Curtis LaMasters [mailto:curtislamast...@gmail.com] 
Sent: Tuesday, June 09, 2009 8:15 PM
To: Anthony Montibello
Cc: Andreas Ericsson; Nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] disk IO for windows?

I use Disk Idle time as an indicator.  Not an original idea :(  I was told to 
mimic the monitoring built into Windows SBS.

Curtis LaMasters
http://www.curtis-lamasters.com
http://www.builtnetworks.com



On Tue, Jun 9, 2009 at 8:27 PM, Anthony Montibello wrote:
> That is partialy right,
>
> The Alt delete thing is To get to TaskManager but only on New versions 
> of windows does it give access to some counter names,
>
> The best place to go is Performance Monitor Since thats on all version 
> of windows since 2000 Control Panel--> Administrative Tools--> 
> Computer Managment--> then Performance Counter on newer systems from 
> Computer Management --> Reliability and performance
> --> Monitoring Tools --> Performance Monitor
> Once you FIND performance Monitor then click the Green + to get into 
> the "add counters"
> Click the Checkbox to Show the Counter description then click arround 
> till you find what you need Look for Disks for Drive stuff,
>
> Tony (Author of NC_NEt)
>
> On Tue, Jun 9, 2009 at 9:24 AM, Andreas Ericsson  wrote:
>>
>> dave stern - e-mail.pluribus.unum wrote:
>> > Anyone know of a plug-in or mechanism to log local disk I/O on windows?
>> >
>> > My nagios server is currently using check_nt to connect to windows 
>> > hosts via nsclient++. I was hoping perhaps COUNTER has something 
>> > buried within it to pull down this info.
>> >
>>
>> There are indeed counters for that, but due to Microsoft's stupidity 
>> the counter-names are different depending on which base-language 
>> you've used for your windows servers.
>>
>> I don't know what they're named for english platforms (or any other 
>> for that matter), but you should be able to view them with that thing 
>> you can pop up when pressing ctrl-alt-del (task manager or whatever 
>> it's called).
>>
>> --
>> Andreas Ericsson                   andreas.erics...@op5.se
>> OP5 AB                             www.op5.se
>> Tel: +46 8-230225                  Fax: +46 8-230231
>>
>> Considering the successes of the wars on alcohol, poverty, drugs and 
>> terror, I think we should give some serious thought to declaring war 
>> on peace.
>>
>>
>> -
>> - Crystal Reports - New Free Runtime and 30 Day Trial Check 
>> out the new simplified licensing option that enables unlimited 
>> royalty-free distribution of the report engine for externally facing 
>> server and web deployment.
>> http://p.sf.net/sfu/businessobjects
>> ___
>> Nagios-users mailing list
>> Nagios-users@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/nagios-users
>> ::: Please include Nagios version, plugin version (-v) and OS when 
>> reporting any issue.
>> ::: Messages without supporting info will risk being sent to 
>> /dev/null
>
>
> --
>  Crystal Reports - New Free Runtime and 30 Day Trial Check out 
> the new simplified licensing option that enables unlimited 
> royalty-free distribution of the report engine for externally facing 
> server and web deployment.
> http://p.sf.net/sfu/businessobjects
> _

[Nagios-users] Assigning contacts to hosts confusion

2009-05-13 Thread Frater, Greg J
Hi All, 

Can someone help me understand the best way to assign contacts to hosts
en masse?  In version 1.x, we would create a hostgroup put the hosts
that we needed in it and then assign a contact group to that hostgroup,
done deal.  Now it appears that under Nagios 3 to accomplish the same
thing I have to create the hostgroup as before and then go to each host
and assign a contact or contact group to it.  That's much more tedious
than the old way, am I missing something?  Please tell me there's an
easier way that I'm overlooking.

I found this in the what's new in Nagios 2 doc's
(http://nagios.sourceforge.net/docs/2_0/whatsnew.html):

Contact group changes - The contact_groups directive has been moved from
hostgroup definitions to host definitions  . This was
done in order to maintain consistency with the way service contacts are
specified. Make sure to update your config files! 

Thanks, 

-greg
--
The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your
production scanning environment may not be a perfect world - but thanks to
Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700
Series Scanner you'll get full speed at 300 dpi even with all image 
processing features enabled. http://p.sf.net/sfu/kodak-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Im confused about scheduled downtime and notifications

2009-05-07 Thread Frater, Greg J
>> Hi All,
>>
>> We've just migrated from Nagios 1 to 3.0.6.  I have a host that was in 
>> scheduled downtime and turned off.  I turned it back on, while still 
>> in scheduled downtime, and got a HOST UP alert for it.  Is this 
>> expected behavior?  Could I have a misconfiguration somewhere, where should 
>> I look?
>> Has anyone else seen this?

>I've occasionally seen notifications for hosts in scheduled downtime, but only 
>in the few seconds after restarting the Nagios daemon.  I don't think it's 
>>expected behaviour, no.

In this case the daemon had been running for 3 days.

>I've not bothered to find a fix for it, as it happens only rarely and so long 
>as I don't restart the daemon at three in the morning, it won't bother anyone.

This was on a host that had just been retired, I turned it back on to get a 
file off of it and that's when the alert was sent, it had been offline for a 
while.  The interesting thing was that Nagios only sent a UP alert when I shut 
it down again (a few minutes later) the alerts for down were not sent (as would 
be expected).  It does not appear to be a major problem (no one else 
responded), I'm not going to worry about it too much.  Thanks for your feedback.

-greg

--
The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your
production scanning environment may not be a perfect world - but thanks to
Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700
Series Scanner you'll get full speed at 300 dpi even with all image 
processing features enabled. http://p.sf.net/sfu/kodak-com
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Im confused about scheduled downtime and notifications

2009-05-05 Thread Frater, Greg J
Hi All, 

We've just migrated from Nagios 1 to 3.0.6.  I have a host that was in
scheduled downtime and turned off.  I turned it back on, while still in
scheduled downtime, and got a HOST UP alert for it.  Is this expected
behavior?  Could I have a misconfiguration somewhere, where should I
look?  Has anyone else seen this?

Thanks, 

-greg


--
The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your
production scanning environment may not be a perfect world - but thanks to
Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700
Series Scanner you'll get full speed at 300 dpi even with all image 
processing features enabled. http://p.sf.net/sfu/kodak-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] stopping and starting NDO daemon

2009-04-20 Thread Frater, Greg J
 > Hello Greg,
 
 > here it is, simply took one of RHEL's init-scripts and modified it:
 
 > [r...@atvies00nx libexec]# cat /etc/init.d/ndo2db
 > #! /bin/sh
...
 > echo $"Usage: $0
{start|stop|restart|try-restart|status}"
 >RETVAL=3
 > esac
 
 > exit $RETVAL
 
Thanks Christian, I'll give that a try.
--
Stay on top of everything new and different, both inside and 
around Java (TM) technology - register by April 22, and save
$200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco.
300 plus technical and hands-on sessions. Register today. 
Use priority code J9JMT32. http://p.sf.net/sfu/p___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] stopping and starting NDO daemon

2009-04-20 Thread Frater, Greg J
Hi All, 

We've just setup nagios 3 using NDO (to support Nagvis) and I realized
that NDO does not come with an init script.  I found one post to the
list
(http://article.gmane.org/gmane.network.nagios.user/52477/match=ndo+init
) from about a year ago, someone was asking for help incorporating the
NDO daemon into the Nagios init script, but there were no responses.
I'm wondering how others are handling the stopping and starting of the
NDO daemon?  Would it be better to incorporate it into the Nagios init
script or should it have it's own, has somebody already done this that
is willing to share with the list?

Thanks, 

-greg



--
Stay on top of everything new and different, both inside and 
around Java (TM) technology - register by April 22, and save
$200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco.
300 plus technical and hands-on sessions. Register today. 
Use priority code J9JMT32. http://p.sf.net/sfu/p___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Some hostgroups NOT displaying (weirdness) SOLVED

2009-04-07 Thread Frater, Greg J
Okay, I apparently have not learned the lesson to check the archives
first.  I found the answer there
(http://article.gmane.org/gmane.network.nagios.user/45395/match=hostgrou
p+not+display), I did not check there until after I posted.  Anyways for
future reference our problem was due to NOT having the web ui users as a
contact of a hosts in the offending hostgroups.
 
Regards, 
 
-greg



From: Frater, Greg J 
Sent: Tuesday, April 07, 2009 1:57 PM
To: nagios-users@lists.sourceforge.net
Subject: [Nagios-users] Some hostgroups NOT displaying (weirdness)



Greetings All, 

I'm pretty lost and could use some help.  We are moving a Nagios 1.x
setup to Nagios 3.0.6.  We have about 550 hosts and some 5000 services.
About 3/4 of the way through our migration of config files we noticed
that some of our hostgroups are not showing up in the web ui,  but I
have yet to figure out why.  There are certain hostgroups (but not all)
that are not displaying and if they are listed in the
'hostgroup_members' directive of another hostgroup it will cause the
second hostgroup not to display either.  I've tried every combination of
member and hostgroup_members that I can think of but so far the only way
I can duplicate the problem is use one of the known problem groups.
Does anyone know what the logic or rules are that are used to determine
if a hostgroup will display in the web ui or not?  Any other suggestions
on what to try to track this down?  Any help is appreciated.

Thanks, 

-greg 

--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Some hostgroups NOT displaying (weirdness)

2009-04-07 Thread Frater, Greg J
Greetings All, 

I'm pretty lost and could use some help.  We are moving a Nagios 1.x
setup to Nagios 3.0.6.  We have about 550 hosts and some 5000 services.
About 3/4 of the way through our migration of config files we noticed
that some of our hostgroups are not showing up in the web ui,  but I
have yet to figure out why.  There are certain hostgroups (but not all)
that are not displaying and if they are listed in the
'hostgroup_members' directive of another hostgroup it will cause the
second hostgroup not to display either.  I've tried every combination of
member and hostgroup_members that I can think of but so far the only way
I can duplicate the problem is use one of the known problem groups.
Does anyone know what the logic or rules are that are used to determine
if a hostgroup will display in the web ui or not?  Any other suggestions
on what to try to track this down?  Any help is appreciated.

Thanks, 

-greg
--
This SF.net email is sponsored by:
High Quality Requirements in a Collaborative Environment.
Download a free trial of Rational Requirements Composer Now!
http://p.sf.net/sfu/www-ibm-com___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] nsclient 0.3.6-RC4-x64 issues

2009-02-17 Thread Frater, Greg J
Have you checked the firewall settings in 2008?  If they have been
enabled then they will block the nsclient++ exe from communicating on
the network unless you tell the firewall to let nsclient++ traffic
through.  That's the next thing I would check.

Good luck, 

-greg 

-Original Message-
From: Joseph L. Casale [mailto:jcas...@activenetwerx.com] 
Sent: Saturday, February 14, 2009 12:54 PM
To: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] nsclient 0.3.6-RC4-x64 issues

>Normally Windows monitoring use NC_Net or NSClient++. Can you give more

>details on the check you're doing (the real command, not
check_command)?

Well, just started with Nagios, and I am using one of the windows.cfg
templates with NSClient++ installed on the Windows Server 2008x64 boxes.

>If you're using check_nt or one of its substitute (NC_Net has 
>check_nc_net, NSClient++ may have one too) make sure the proper service

>is installed and running on the Windows server. Try restarting the 
>service if it's already running. Make sure the configuration is good 
>(especially port and password).

I am using check_nt and the port/pass is correct for WinXP machines, but
the same ini on the 2008 machines doesn't work. Given Nagios suggests a
socket error, I suspect it is probably some bug in the x64 client on the
2008 machines...

In the meantime, since I am just starting out, I have been reading up on
snmp as that is how I am preferring to do my monitoring rather than have
to install any applications.

jlc




--
Open Source Business Conference (OSBC), March 24-25, 2009, San
Francisco, CA -OSBC tackles the biggest issue in open source: Open
Sourcing the Enterprise -Strategies to boost innovation and cut costs
with open source participation -Receive a $600 discount off the
registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Using a modem with Nagios as backup alert path

2009-02-12 Thread Frater, Greg J
>-Original Message-
>From: Russell Adams [mailto:rlad...@adamsinfoserv.com] 
>Sent: Thursday, February 12, 2009 4:12 PM
>To: nagios-users@lists.sourceforge.net
>Subject: Re: [Nagios-users] Using a modem with Nagios as backup alert
path

>On Thu, Feb 12, 2009 at 03:05:54PM -0800, Frater, Greg J wrote:
>> Greetings All,
>> 
>> We are setting up a new Nagios instance running version 3.  With it
we 
>> are adding a modem so that alerts can be sent via the phone system 
>> even if the data network goes down.  In the testing I've done
messages 
>> sent via the modem take quite a bit longer to be received than alerts

>> sent via SMTP, this is because the modem goes through the dial up 
>> connection process and then sends the messages to each recipient in 
>> separate sessions, so I want to keep SMTP as the primary method and 
>> use the modem as a backup path.  What is the best way to do this, can

>> we do it without a bunch of work to implement?  Suggestions and
advice gladly accepted.
>> 
>> Thanks,
>> 
>> -greg

>What are you sending via modem, SMS, text pages? Are you connecting to
an ISP via PPP to fire off one SMTP message?

The modem is sending sms messages.

>I have successfully used Sendpage with Nagios to send to IXO/TAP text
pagers, and it supports piggybacking to send out multiple pages in a
single phone call. I think it supports sending SMS too.

I guess I need to have a closer look at the sendpage config, there's
probably a way to do that I just did take the time to dig into it.

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Using a modem with Nagios as backup alert path

2009-02-12 Thread Frater, Greg J
 > From: Gareth Fletcher [mailto:gareth.fletc...@onenet.co.nz] 
 > Sent: Thursday, February 12, 2009 3:44 PM
 > To: Frater, Greg J
 > Subject: RE: Using a modem with Nagios as backup alert path



 > Hi Greg,

 

 > Just something to get you started before someone with more knowledge
answers - what about monitoring the data network in Nagios, create   an
event handler so if it goes down change bring the modem up and start
sending messages that way. When it comes back up you can keep SMTP
primary. 

 

I'm not sure how to effectively check the "network" to see if SMTP is
available, on one hand it seems straight forward but when I think about
details it does not seem so obvious.  My SMTP path looks something like
this:
 
Nagios -> local sendmail daemon -> smtp smarthost -> corporate smtp
gateway -> corporate internet connection -> internet
 
Maybe it's overkill to worry about all of those pieces but in my
paranoid (i.e. mad boss wanting to know why we didn't know about a major
outage) mind I don't want to leave anything out :-)
 
Honestly I'm not checking all of those things now, but I would like to
be a bit more thorough on the new setup, some of those things I don't
have access to be able to check them.

 

 

 > How come you have to dial up each time to send alerts?   

 

That's a good question, that's just the way it worked when I first set
it up and I did not check for another or better way. 

 

 

--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Using a modem with Nagios as backup alert path

2009-02-12 Thread Frater, Greg J
Greetings All, 

We are setting up a new Nagios instance running version 3.  With it we
are adding a modem so that alerts can be sent via the phone system even
if the data network goes down.  In the testing I've done messages sent
via the modem take quite a bit longer to be received than alerts sent
via SMTP, this is because the modem goes through the dial up connection
process and then sends the messages to each recipient in separate
sessions, so I want to keep SMTP as the primary method and use the modem
as a backup path.  What is the best way to do this, can we do it without
a bunch of work to implement?  Suggestions and advice gladly accepted.

Thanks, 

-greg
--
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Monitoring Windows Eventviewer

2008-06-03 Thread Frater, Greg J

 

 > Thanks for the info,

 

 > Great that it can be done with the NSClient++ as I have it installed
on all our servers.

 

 > What do I need to define in the NSClient++ agent and where ?

 > Is it in the .ini file or elsewhere ? 

 

The only thing you need to do on the Windows server is enable the
CheckEventLog.dll by removing the semicolon from that line in the
nsc.ini file.

 

Regards, 

 

-greg   

 

 

From: Frater, Greg J [mailto:[EMAIL PROTECTED] 
Sent: maandag 2 juni 2008 21:41
To: Tim Van Caeyzeele
Cc: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] Monitoring Windows Eventviewer

 

>Dear All, 

>Would anyone have experience in checking the windows eventviewer for
certain events, or turning nagios red in case of ERRORs ?

>What script are you using ? preferably something that can simply
interact with NSClient 

We do this using the NSClient++ agent (www.nsclient.org
 ).  It checks the event logs and filters
them based on criteria you define, alerting when the number of hits you
specify is reached (i.e. when the system log has 1 or more events with
an ID of  within the last 10 minutes send alerts).  Here is an
example we use to monitor for a specific Oracle error.  In the example
we check the "application" log of the server every "60" minutes for
events with an ID of "20" with event type of "Error" containing a string
in the text of the message "Can not allocate log", check turns critical
after 1 matching event is found that is time stamped within the last
"65" minutes.

 

Checkcommands.cfg: 
define command{ 
command_namecheck_eventlogs 
command_line$USER1$/check_nrpe -H $HOSTADDRESS$ -p 5666 -c
checkEventLog -a filter=new $ARG1$ MaxWarn=$ARG2$ MaxCrit=$ARG3$
filter-generated=\$ARG4$ $ARG5$ truncate=$ARG6$

# Desc: 
#   $ARG1$ = event logs to check (i.e. file=system file=application)

#   $ARG2$ = Warning level (i.e. number of hits to generate a
warning response) 
#   $ARG3$ = Critical level (i.e. number of hits to generate a
critcal response) 
#   $ARG4$ = Time period (i.e. 1 day is '1d' 30 hours is '>30h') 
#   $ARG5$ = Filters (i.e. filter-eventID==9009
filter-eventSource=Tcpip) see
http://www.nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog
<http://www.nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog>  for
detailed info

#   $ARG6$ = Amount of data to return in characters (i.e.
truncate=150) 
#   Example: check_nrpe -H server_name_here -p 5666 -c checkEventLog
-a filter=new file=system MaxWarn=1 MaxCrit=1 filter-generated=\>30h
filter+eventID==10002 descriptions truncate=138

} 

Services.cfg: 
define service{ 
use standard-srv 
service_description eventlog: Oracle archive log
errors 
check_command
check_eventlogs!file=application!1!1!>65m!filter+eventID==20
filter+eventType==error filter+message=substr:"Can not allocate log"!100

normal_check_interval   60 
notification_optionsw,c 
contact_groups  apps 
host_name   server1, server2 
} 

HTH, 

-greg 

 

-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Monitoring Windows Eventviewer

2008-06-02 Thread Frater, Greg J
>Dear All,

>Would anyone have experience in checking the windows eventviewer for
certain events, or turning nagios red in case of ERRORs ?
>What script are you using ? preferably something that can simply
interact with NSClient

We do this using the NSClient++ agent (www.nsclient.org).  It checks the
event logs and filters them based on criteria you define, alerting when
the number of hits you specify is reached (i.e. when the system log has
1 or more events with an ID of  within the last 10 minutes send
alerts).  Here is an example we use to monitor for a specific Oracle
error.  In the example we check the "application" log of the server
every "60" minutes for events with an ID of "20" with event type of
"Error" containing a string in the text of the message "Can not allocate
log", check turns critical after 1 matching event is found that is time
stamped within the last "65" minutes.


Checkcommands.cfg:
define command{
command_namecheck_eventlogs
command_line$USER1$/check_nrpe -H $HOSTADDRESS$ -p 5666 -c
checkEventLog -a filter=new $ARG1$ MaxWarn=$ARG2$ MaxCrit=$ARG3$
filter-generated=\$ARG4$ $ARG5$ truncate=$ARG6$
# Desc:
#   $ARG1$ = event logs to check (i.e. file=system file=application)
#   $ARG2$ = Warning level (i.e. number of hits to generate a
warning response)
#   $ARG3$ = Critical level (i.e. number of hits to generate a
critcal response)
#   $ARG4$ = Time period (i.e. 1 day is '1d' 30 hours is '>30h')
#   $ARG5$ = Filters (i.e. filter-eventID==9009
filter-eventSource=Tcpip) see
http://www.nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog for
detailed info
#   $ARG6$ = Amount of data to return in characters (i.e.
truncate=150)
#   Example: check_nrpe -H server_name_here -p 5666 -c checkEventLog
-a filter=new file=system MaxWarn=1 MaxCrit=1 filter-generated=\>30h
filter+eventID==10002 descriptions truncate=138
}

Services.cfg:
define service{
use standard-srv
service_description eventlog: Oracle archive log
errors
check_command
check_eventlogs!file=application!1!1!>65m!filter+eventID==20
filter+eventType==error filter+message=substr:"Can not allocate log"!100
normal_check_interval   60
notification_optionsw,c
contact_groups  apps
host_name   server1, server2
}

HTH, 

-greg



-
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] SMS and ATT with Nagios

2008-05-12 Thread Frater, Greg J
Correct, it is good to have at least two ways out, modem and network. 

-Original Message-
From: Tom Brown [mailto:[EMAIL PROTECTED] 
Sent: Monday, May 12, 2008 8:04 AM
To: Frater, Greg J
Cc: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] SMS and ATT with Nagios


>
> We use SMS for our alerts, it is our primary mechanism.  We send them 
> out using the same Nagios command as the email alerts.  To send alerts

> to ATT SMS addresses you send them as email from Nagios to 
> [EMAIL PROTECTED]
>
>   

providing your network is still abot to connect to the outside world


-
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] SMS and ATT with Nagios

2008-05-12 Thread Frater, Greg J
>Hi,

>Luis Fernando Lacayo <[EMAIL PROTECTED]> wrote on 09.05.08 16:38:

>> Good Morning all,

>I have to change my NAGIOS platform to a Dell Blade on RHEL
>> 5.  I
>currently use a modem and qpage to send out notifications.  Since
> there
>is no way to attach a modem to a blade, I am thinking on sending
>> the
>alerts via SMS.  Our carrier is ATT, is there anyone out there
>> currently
>doing this? 

>Can you share how you are doing this?

>Thanks,

>Luis

>Multitech has Modems with Ethernet Interfaces. You just connect them to
the Network and send a text oder sms via Telnet commands:
>http://www.multitech.com/PRODUCTS/Categories/Device_Networking/


We use SMS for our alerts, it is our primary mechanism.  We send them
out using the same Nagios command as the email alerts.  To send alerts
to ATT SMS addresses you send them as email from Nagios to
[EMAIL PROTECTED]

HTH 

-greg

-
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nagios reboot XP

2008-05-08 Thread Frater, Greg J
 

>Palle L Jensen wrote:
>>
>> I guess I should have been more specific, sorry about that.
>>
>> I am running Nagios 3.0.1 on openSUSE 10.3, and I am only monitoring 
>> windows hosts. On the windows hosts I am running NSClient++. That 
>> seems to work good and I would prefer not to change to any other
client.
>>
>> I have read about the event handlers in the Nagios documentation, but

>> the samples there only refers to nrpe and scripts on the server. I 
>> understand how to define the service in Nagios, but I am not sure how

>> to create the script that executes the batch file on the windows
host? 
>> Maybe I am not getting the point and understanding on the
documentation.
>>
>> Lets say I created a batch file that will start or restart a service,

>> this batch file is located in c:\NSclient++\scripts folder. And then 
>> defined the service and command in Nagios:
>>
>> define service{
>> host_name  somehost
>> service_descriptionsql-service
>> max_check_attempts 4
>> event_handler  restart-sqlagent
>> }
>>
>> define command{
>> command_name   restart-sqlagent
>> command_line   what do I add here?? It has to be something
that can be sent to the Nsclient++/windows host and execute the batch
script in c:\NSclient++\scripts folder
>> }
>>
>> The command line on the eventhandler documentation shows a path to a 
>> local script and local command ?/etc/rc.d/init.d/httpd restart?
>>
>> So creating and event handler for services or others on the box where

>> Nagios is running, I understand. But not how to do it remotely to a 
>> windows host.
>>
>> If you could shed some more light over this I would very much 
>> appreciate it.
>>
>> Thanks,
>>
>> PJ
>>
>You need a Unix-side script to execute and handle the condition 
>appropriately, for example to only call the restart when the service is

>in hard failure. A sample Bash script is given in the docs to get you 
>started.

>If using Bash (which I recommend) the Bash script should check the
state 
>and perform the logic control, calling the batch or script on the 
>windows side though an NRPE call to nsclient++ to execute the script on

>the windows host.

The NSClient++ data collector's primary mechanism for actions (getting
it to do stuff) is via NRPE.  It will respond to the old nsclient
command as well but the majority of it's functions are utilized through
NRPE.  This is the case with your situation as well.  On the Nagios side
you would define a check like you have above and in the NSC.ini file you
define a handler for the command.  Below is an example of one we use to
check the CPU utilization of any process on the server.

On the Nagios server define your service as above and use something like
this for the command_line line.
Command_line$USER1$/check_nrpe -H $HOSTADDRESS$ -t 60 -c
wmi_process_perf -a $ARG1$   

On the Windows server in the NSC.ini file under the NRPE Handlers
section put something like this:
wmi_process_perf=c:\windows\system32\cscript.exe //nologo "c:\program
files\nsclient++\scripts\wmi_cpu_perf.vbs" /process:$ARG1$

In this example the $ARG1$ value is the name of a running process on the
server.  The check returns the CPU and memory utilization of the
process.

HTH, 

-greg

-
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] (no subject)

2008-04-29 Thread Frater, Greg J
>> -Original Message-
>> From: [EMAIL PROTECTED]
[mailto:nagios-users- 
>> [EMAIL PROTECTED] On Behalf Of Frater, Greg J
>> Sent: Monday, April 28, 2008 11:20 AM
>> To: nagios-users@lists.sourceforge.net
>> Subject: [Nagios-users] A bit OT: sending MMS messages from Linux
>> 
>> Greetings All,
>> 
>> My apologies if this is to much off topic.
>> 


>> Linux server.  The common link between all the scenarios we have
tried is the
>> Linux server and the sendmail transfer agent.  Does anyone on the
list send

>Sounds like it's just a normal e-mail to a specific address, correct?

Yes, as far as Linux/Nagios is concerned they are standard messages to
SMTP addresses.


>Have you verified that sendmail is behaving itself as a normal mail
server on the Internet? It has valid public reverse DNS? Forward DNS
matches reverse DNS? HELO name is a valid FQDN, preferably matching
reverse DNS for the IP? What do your sendmail logs show? Is there a
bounce generated?

We use the same Nagios command to send successfully to the SMS addresses
([EMAIL PROTECTED] vs [EMAIL PROTECTED] for MMS) of the same phones (i.e.
using the same SMTP servers to get out).  The sendmail logs show the
messages as being delivered Okay.  I did some testing, I'm trying
different things to try and discover where the problem lies.

Test: Send 10 messages from old RH/sendmail box to MMS address of cell
phone, Send 10 messages from new RH/sendmail box to MMS address of cell
phone, Send 10 messages from windows box (with sendmail port) to the MMS
address of cell phone.  All messages are sent one after another from the
command line.

Results: Usually all 10 of the messages from the Windows box come
through, though not always.  Some of the Linux generated messages come
through I've gotten as many as 4 and as few as none.  The new versus old
versions of Linux and Sendmail do not seem to make a difference.  The
messages will come through on occasion but not consistently.  Not good
for a monitoring solution.


I should rephrase my original questions:

Is anyone else using MMS to receive Nagios alerts?
Do you ever notice any alerts not getting to your phone?
What provider are you using?

Thanks, 

-greg





-
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] A bit OT: sending MMS messages from Linux

2008-04-28 Thread Frater, Greg J
Greetings All, 

My apologies if this is to much off topic.

We are having trouble sending alerts from our Nagios server to the MMS
address of our cell phones.  We have been sending SMS messages for years
with great success, however MMS messages are never received.  We would
like to use MMS because SMS messages are limited to 160 characters, MMS
has a much higher limit.

The problem is not with Nagios but instead (I think) with the sendmail
mechanism it uses when sending messages.  I'm just hoping someone on the
list has dealt with this already and can help me figure it out.  We use
Sendmail to send massages from the Nagios server to an internal SMTP
gateway that forwards the messages on to the Internet.  We can
successfully send MMS messages from Outlook clients on the desktop which
use the same SMTP gateways as the Nagios server, the phones receive
these without any trouble.  We have tried both ATT and Verizon systems
and neither will deliver the MMS messages from the Nagios server.  We
have also tested sending the messages from the command line on the Linux
server.  The common link between all the scenarios we have tried is the
Linux server and the sendmail transfer agent.  Does anyone on the list
send messages to MMS addresses?  If so did you have any trouble getting
it to work, if not what OS and mail transfer agent are you using?

We have tested this from Red Hat 7.3/Sendmail 8.11 and Red Hat
Enterprise Linux 5/Sendmail 8.13

Thanks,

-greg
-
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Monitoring Windows Servers - [New domain name for nsclient++]

2008-04-08 Thread Frater, Greg J
 

-Original Message-
From: Michael Medin [mailto:[EMAIL PROTECTED] 
Sent: Monday, April 07, 2008 9:59 PM
To: Frater, Greg J
Cc: nagios-users@lists.sourceforge.net
Subject: Re: [Nagios-users] Monitoring Windows Servers - [New domain
name for nsclient++]


>> (found here: http://trac.nakednuns.org/nscp - don't ask me about the 
>> URL I don't know why it is what it is - it seems safe though)
>>   
>LOL, everyone seems to mention this lately, well, here is my official
apology to anyone who has taken offense for the domain name *SORRY*
>Anyways, try: http://NSCLIENT.ORG, http://nsclient.com or
http://nsclient.net hopefully these are "better" (if anyone want to know
the >reason for using the old one there is a post on the forum about it
http://nsclient.org/nscp/discussion/2/120 )

All I know is I got a visit from our security guy asking for an
explanation of my visits to that site.

>The preferable domain is nsclient.ORG but since go-daddy was so damn
cheep (anyone know why? I pay 10 times as much for my "previous
>domains") I got the others when I was at it...

Excellent, I will use the new domain (though I must admit nakednuns is
pretty memorable).

>And if any apache2 configure gurus knows how to combine location (I use
a location with mod python inside it to "call trac") and mod_rewrite let
me know cause I would have preferred to drop the "trailing /nscp" but
alas apache was giving me attitude...

I would not be of much help here, I'm not an Apache guy.

-greg

-
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Register now and save $200. Hurry, offer ends at 11:59 p.m., 
Monday, April 7! Use priority code J8TLD2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Monitoring Windows Servers - Comparing Options

2008-04-07 Thread Frater, Greg J
>No input from anyone? One of the selling points to move away from
proprietary solutions and towards OSS was the helpfulness of the
community and mailing list. I know I don't have a 'problem' listed
below that can easily be solved, but surely there are some opinions out
there?

Alright, I'll respond.  Most of the time I usually just lurk
here, most questions are usually answered quite well by others in a
relatively short time.  This is my 2 cents.  See response below amongst
your first post questions. 


>So, my question is, what is the best option for monitoring Windows
Servers and why?
>An important factor to consider is that I will probably not have
Administrative access to any of the Windows Servers that I will be
>monitoring.

>I'm currently considering three options for the reasons shown below (in
order of preference):
>1) WMI checks
>Pros
>-Complete control of NRPE service on 'my' Windows Server
>(Can this be distributed over two boxes?)
>-Complete control of check commands on Nagios Server

WMI provides a very thorough list of things you can check
It's built into the OS, i.e. no install or configuration
required

>Cons
>-NRPE service must be run from an account with access to all
Windows Servers.

WMI can fail.  From time to time we have a server with WMI
problems, we've been able to fix WMI at times, other times we've had to
rebuild the OS, in those cases you would not have any monitoring.
WMI is used by other things as well, and is susceptible to
getting corrupted or reconfigured by something (or in our case someone)
else.  I think we've resolved most of our issues with WMI, however I
learned that if it's broken, it does not matter why you can't use it to
monitor anything until it's fixed.
WMI requires common MS technology such as common MS network
ports and Windows (AD) user accounts for security.  This is not a
problem for the most part but increases the risk when monitoring a DMZ
for example and means your monitoring is dependant upon AD and thus
anything AD needs (like DNS), etc.  If AD or DNS goes down you've got
bigger problems than whether or not your Nagios agents are working,
however it that is how you watch everything then when they fail will you
know it?

>2) Agent checks - NRPE-NT or some other current Windows Agent
>Pros

Your not relying on a MS technology.  Obviously the OS has to be
available but your not dependant upon WMI or .NET or MDAC, etc.  This
may sound ironic but having your monitoring system as independent of the
system it's watching as possible the better (IMO).
Flexibility/Extensibility.  For example the agent we use (nscp)
(found here: http://trac.nakednuns.org/nscp - don't ask me about the URL
I don't know why it is what it is - it seems safe though) is very
flexible supporting both the original NSClient on the Nagios server side
or NRPE and can check pretty much anything (including, WMI, perfmon,
event logs, plus some built in checks and custom scripts).  The
documentation is not one of it's strong points, but it is very capable
and reliable.  I've struggled at times getting a particular type of
check working because I could not understand the docs.  I have not run
into any problems with it in terms of crashes or memory leaks, etc.
After about two years of using it (various versions) I can only think of
twice when it has either generated an error or stopped running and that
covers over 200 Windows boxes.  It can be installed and uninstalled
without server reboots.  It just works, I like it.  Better docs would be
nice though.  I'm not trying to sell you on this particular agent or
even agents in general, this is my experience.

>Cons
>-Requires an agent setup and running on every Windows Server
>-No direct control of agents/check commands

Some checks via an agent (like custom scripts) are not as quick
as SNMP (and probably WMI).

>3) SNMP based checks
>Pros
>-Complete control over check commands on Nagios Server

SNMP checks are very fast
Once configured no changes are required on the server (i.e. SNMP
does not require version upgrades, etc.)

>Cons
>-No direct control of SNMP Community Strings/ACL allowing
access from Nagios
>-No direct control of SNMP service

The values exposed via SNMP on Windows servers is limited.  You
can do basic monitoring disks, ram, cpu, etc.  But not advanced things
such as CPU user mode vs. kernel mode usage.  You can get an SNMP
extension such as SNMP-Informant (http://www.snmp-informant.com/) to
resolve this issue.
SNMP can fail.  We have a small portion of our servers both 2000
and 2003 where SNMP service won't stay running.  We do some SNMP
monitoring but not a lot, I've not solved this one yet.


>How do you do it? Are there any other Pros & Cons that I might be
missing? Do yo