Re: [Nagios-users] sms notifications
Hi Frank, We just bought the MultiTech MultiModem iCell and need to interface it to Nagios. Care to share what your using (home grown script, SMS gateways, etc.) to send alerts from Nagios through your iSMS? Thanks, -greg From: Frank Bulk [mailto:frnk...@iname.com] Sent: Wednesday, February 01, 2012 6:55 PM To: NAGIOS Subject: Re: [Nagios-users] sms notifications We use the MultiTech MultiModem iSMS - works great. Frank From: Marco Borsani [mailto:m.bors...@it.net] Sent: Wednesday, February 01, 2012 9:04 AM To: NAGIOS Subject: [Nagios-users] sms notifications Hi all To send sms notifications, I am using since 2003 a couple of radiomodem, connected via serial to my server and smstools SW. Now, I need something "stronger" compared that configuration because for specific checks I have to send many and many sms at the same time. What can I use ? Regards Marco -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] qpage - OT
Thanks for the response Benny, I appreciate any help I can get >> qpage error: >> <502 MESSAGE REJECTED - STX OR EOT EXPECTED> >It would have been nice to see your qpage.cf file... ;) That seems obvious, see below >Be sure you have 'parity=even' in your config. When you run a test with verbose and interactive flags set, do you fail five or six times before you get that message? I've never tried the interactive flag, I will do so. As far as the failures go when I had the retry set to 20 it would to fail 5 times in a row and then reset the modem or something, I can't fully interpret the logs, and then retry again possible 20 times? as in 20 sets of 5. The interactive (-i) option seems to require a page to be sent right now. As of yet I have been unable to get a failure when sending a page manually but I think I've really only sent a small number 10-20 pages manually. The only times it has failed so far is when it's running in daemon mode. Do you guys use USB modems with qpage? These problems got much worse after switching to a USB modem. qpage.cf: # # QuickPage configuration file # administrator=xxx...@xxx.xxx queuedir=/var/spool/qpage pidfile=/var/spool/qpage/qpage.pid modem=modem1 device=/dev/modem service=att device=modem1 phone=9,18009094602 baudrate=9600 parity=even allowpid=yes #maxtries=6 maxtries=3 msgprefix=false #maxmsgsize=250 maxmsgsize=500 -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] qpage - OT
Cheers All, I know a number of people use qpage for sending alerts via modem as we do. I'm hoping someone can help me with a qpage problem we're having, there does not seem to be a qpage mailing list and as of yet the developer has not responded to my email. We are getting *random* alert pages that fail to send with the following error. I say random because I have not figured out any patterns or commonality between them (other than the obvious they are all from Nagios, etc.). Roughly once a day a page will fail to send but others before and after it will send fine, they all use the same send script. We've always had some problems but they recently got worse after we replaced a failed PCI modem with a USB modem from US Robotics. Is anyone else using USB modems, if so what brand/model? Has anyone seen problems like this? Is the a better forum or mailing list I could look to for help? Sorry for the off topic post. qpage error: <502 MESSAGE REJECTED - STX OR EOT EXPECTED> Thanks, Greg Frater System Admin -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] trying to fix problem with excessive latency
Hi All, This is just a me too post to hopefully help others down the road! I have inherited maintenance of a medium-sized Nagios installation. We currently have 649 hosts and 5415 services. Our setup works nicely, with one exception: Nagios falls behind on host/service checks. Our usual latency once Nagios has been running for a while is about 190-200 seconds. Our Nagios host is reasonably powerful and isn't struggling; it seems that Nagios itself is limited somehow. We are similar in size 530 hosts, 3300 service checks with a service check latency of around 300 seconds and 350 for hosts. There are no obvious performance bottlenecks on our sever the closest thing to it for us was the CPU's ave of around 30% utilization across 4 procs with 4 GB of RAM. Our current configuration did not have any latency issues running Nagios 3.1.2, our latency issues started a couple of weeks ago with an upgrade to 3.2.2. >It looks like use_large_installation_tweaks did the trick. I could have sworn I tried that before; likely I set it to 0 instead of 1 or something foolish like that. We've avoided this in an effort to keep our configuration as plain as possible and because I remember reading that this was not expected to be effective unless you had something like 1000+ hosts and many more service checks. Apparently that is not correct, at least under 3.2.2. >Anyway, I wanted to test the individual components of large_installation_tweaks to see what helped. >* child_processes_fork_twice=0: latency drops from 149 to 18 This change took our latency from ~300s to .228 for services and ~350 seconds to .458 for hosts >* free_child_process_memory=0: latency drops from 149 to 0.6 I made this change with the 'child_processes_fork_twice=0' change still in place and saw no measurable drop in latencies. >* disabling summary macros in environment variables doesn't seem to be > individually configurable, but I expect that's disabled already by > enable_environment_macros=0. Did not do this, was not deemed necessary >So, thanks again to everybody who helped. Latency is now 0.35% of what it used to be. > >-Corey Ditto, this was a big help to us as well, thanks. -greg -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 & L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today. http://p.sf.net/sfu/beautyoftheweb ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios did not send notification to some contacts --again
Hi All, We had an event several weeks ago, it has happened again, I'm posted after the first time and reposting to the same thread because they are related. Sorry if this makes the it confusing. Marc thanks for the reply to the first event, see below for my responses. In the second event a host went down due to a power outage but only a portion of the contacts were sent notifications (which is the same problem as the first event). This is the second time, that I know of, that Nagios has failed to send to some of the contacts. The problem has now occurred on two different hosts. I can't explain why it's happening which does not instill confidence in our customers. Any help or suggestions in fixing this are greatly appreciated. This next part is from/for the first event first event -- >> There was a routing issue on our WAN that caused this event, the SMTP server we use is across the WAN. Could the routing issue have prevented some of the SMTP notifications from being sent, wouldn't they just queue up and go once the problem was resolved? >They would be queued by the SMTP server running on your nagios machine. Redelivery attempts would occur based on the configuration there. Okay, makes sense. >> I have seen messages that did not arrive at the recipients phone but I've never seen Nagios not generate notifications for contacts that are configured for that host or service. Has anyone else seen this, any suggestions on a cause or how to troubleshoot? >- Check nagios.log for a HOST NOTIFICATION event for that group. Make sure there were no errors logged. nagios.log only shows notifications sent to some of the contacts, these notifications were received. >- Check your local SMTP server logs to see if the messages were received there and no errors were reported. Not necessary, nagios did not send the notifications >- Make sure that nagios has been restarted since adding this group and contacts. Done. The contact groups in question have been in place for many months. >- Make sure you don't have multiple nagios daemons running at the same time. Done. Only a single instance is running. --- end of first event - Second event with logs and configs - Below are the configs for the host from the second event. If you look the log at the bottom you'll see that 11 of 16 contacts were sent notifications, some but not all from each of the contact groups configured. I'm trying to figure out why. Does anyone see a problem with my configs? Host in question: CONFIGS: define host { host_name Host_A alias Host_A parents Host_B use upshost contact_groups +network-email,onguard register1 } define contactgroup { contactgroup_name network-email alias Users who monitor the network - email only members netuser1,netuser2,netuser3 } define contactgroup { contactgroup_name onguard alias On Guard Admins members og_user1-phone,og_user2-phone,og_user3,og_user3-home,og_user3-phone,og_u ser4,og_user4-phone,og_user5-phone,og_user6,og_user6-phone,og_user7,og_u ser7-phone,og_user8 } define host { name upshost aliasNetInfra UPS' template check_commandcheck-host-alive use generic-pnp,generic-host max_check_attempts 5 check_interval 60 retry_interval 3 active_checks_enabled1 passive_checks_enabled 1 flap_detection_enabled 1 process_perf_data1 retain_status_information1 retain_nonstatus_information 1 contact_groups network notification_interval60 notification_period 24x7 notification_options d,u,r notifications_enabled1 register 0 } Excerpt from nagios.log [1283265540] HOST NOTIFICATION: netuser2-cell;Host_A;UNREACHABLE;alert-host-by-sms;PING CRITICAL - Packet loss = 100% [1283265540] HOST NOTIFICATION: netuser2-pager;Host_A;UNREACHABLE;alert-host-by-modem;PING CRITICAL - Packet loss = 100% [1283265540] HOST NOTIFICATION: netuser2;Host_A;UNREACHABLE;alert-host-by-email-long;PING CRITICAL - Packet loss = 100% [1283265540] H
[Nagios-users] Nagios did not send notification to some contacts
Hi All, I'm looking for troubleshooting suggestions to a problem we had. During a recent host down event Nagios sent notifications to 8 of the 18 configured contacts. The contacts are configured in two contact groups, of the 8 contacts that were sent notifications all were from the same contact group but there are another 4 contacts in that group that did not get notifications sent. No contacts from the second group had notifications generated. There was a routing issue on our WAN that caused this event, the SMTP server we use is across the WAN. Could the routing issue have prevented some of the SMTP notifications from being sent, wouldn't they just queue up and go once the problem was resolved? I have seen messages that did not arrive at the recipients phone but I've never seen Nagios not generate notifications for contacts that are configured for that host or service. Has anyone else seen this, any suggestions on a cause or how to troubleshoot? Nagios 3.2.0 on RHEL 5.5 Thanks, -greg -- The Palm PDK Hot Apps Program offers developers who use the Plug-In Development Kit to bring their C/C++ apps to Palm for a share of $1 Million in cash or HP Products. Visit us here for more details: http://p.sf.net/sfu/dev2dev-palm___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] regularly schedule host checks?
Thanks Max! I'll read up on cached checks, a lot has changed since 1.0. Regards, -greg -Original Message- From: max.schub...@gmail.com [mailto:max.schub...@gmail.com] On Behalf Of Max Sent: Thursday, July 09, 2009 9:48 AM To: Frater, Greg J Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] regularly schedule host checks? On Thu, Jul 9, 2009 at 11:58 AM, Frater, Greg J wrote: > Hi All, > > I'm wondering if there is any consensus from Nagios users regarding > scheduled host checks. In general is it better to have Nagios > schedule host checks or is it better to use the old logic i.e. only > run the host check if a service check fails. How do you turn off the > scheduled host checks in version 3? I've been messing with some of > the host check settings but there's no obvious 'disable scheduled host > checks'. recommended method with Nagios 3 is to run regularly scheduled host checks and then use host check caching as this will keep Nagios from re-running the host check every time a service fails :): http://nagios.sourceforge.net/docs/3_0/cachedchecks.html If you really do not want to use regularly scheduled host checks, set active_checks_enabled 0 in your host definition, then a host check will only be run on-demand when a service check fails. -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] regularly schedule host checks?
Hi All, I'm wondering if there is any consensus from Nagios users regarding scheduled host checks. In general is it better to have Nagios schedule host checks or is it better to use the old logic i.e. only run the host check if a service check fails. How do you turn off the scheduled host checks in version 3? I've been messing with some of the host check settings but there's no obvious 'disable scheduled host checks'. Thanks, -greg -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Checking URL on windows-client using NSClient++
Dennis, Your almost there, you need to set your vbs script to output an exit code of 0-4 according to the nagios plug in documentation (http://nagiosplug.sourceforge.net/developer-guidelines.html#PLUGOUTPUT) . Then setup the nsclient to run your script as a plug in check. To do that you will need to follow these basic steps. 1. add the configuration to the nsc.ini file on each xp workstation you want to run this check on. It should look something like this: [NRPE Handlers] webpage_check=c:\windows\system32\cscript.exe //nologo "c:\program files\nsclient++\scripts\web_page_check.vbs" 2. Setup check in Nagios In your commands definition file: define command { command_name webpage_check command_line$USER1$/check_nrpe -H $HOSTADDRESS$ -t 60 -c webpage_check } In your services definition file: define service { host_name XPhost01, XPhost02 service_description check web page on XP host display_namewebpage check check_command webpage_check use service-template . } That should do it, and it uses your existing script (i.e. no rework). Good luck, hope that helps, -greg From: jvc_dvl...@hotmail.com [mailto:jvc_dvl...@hotmail.com] Sent: Tuesday, June 30, 2009 4:08 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Checking URL on windows-client using NSClient++ Hi, I would like to check if a certain website is available on some of my windows xp clients. These clients are public self-service computers to make bookings for a railway organisation. Every now and then, some of these self-service computers show a blank screen without the default start-page. I would like to check if the page is shown up and if it's not, it notifies Nagios which notifies me by email. I've tried to run an external script (vbs) which checks the availability of the site and that goes well if you use it manually but i dont want any popups saying: "OK". It should run in the background. So i've edited the vbs file and now it does check it and doesnt give any output anymore. Has anyone else tried something like this before? I use NSClient++ and Nagios v3.0.6. Kind Regards, Dennis de Vries. -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] weirdness in the scheduling of host checks
I figured out my problem, I had two instances of Nagios running. That would explain a lot of the scheduling weirdness, maybe all of it. :-) > Is anyone else seeing weird things in the scheduling of checks? I don't have a good sense of what is wrong but, it's definitely not the way it was under Nagios 1.0 (or the way it should be). I've been watching the scheduling queue on our Nagios 3 box for a week or so, here's a list of what I've seen: > Under Nagios 3.0.6: > - host checks staying at the top of the queue for a long time (over an hour sometimes) even when they have a timeout set at 30 seconds > Under Nagios 3.1.6 > - host check showing up unexpectedly in the scheduling queue, this morning when I looked at the queue the top event was about 15 minutes behind the current time but things were moving along okay, when I last checked there was a host check at the top of the queue with a next check time from 4 days ago. > - We had a host go down yesterday (Sunday) but we did not get alerted. When I looked at it in Nagios I noticed the host check was in an OKAY state and the 'last check' value for it was from 12 days ago (6/17/2009)! > - Host checks don't seem to be getting stuck in the queue like they were under 3.0.6, at least not for as long > I'm going to submit a ticket to tracker.nagios.org but would like to have more empirical evidence of the problem first, all I have so far are symptoms, no good data points (logs, errors, etc.). Is anyone else seeing this type of behavior? > Nagios 3.1.2 (also had trouble with 3.0.6) > RHEL 5 64 bit -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] weirdness in the scheduling of host checks
Greetings All, Is anyone else seeing weird things in the scheduling of checks? I don't have a good sense of what is wrong but, it's definitely not the way it was under Nagios 1.0 (or the way it should be). I've been watching the scheduling queue on our Nagios 3 box for a week or so, here's a list of what I've seen: Under Nagios 3.0.6: - host checks staying at the top of the queue for a long time (over an hour sometimes) even when they have a timeout set at 30 seconds Under Nagios 3.1.6 - host check showing up unexpectedly in the scheduling queue, this morning when I looked at the queue the top event was about 15 minutes behind the current time but things were moving along okay, when I last checked there was a host check at the top of the queue with a next check time from 4 days ago. - We had a host go down yesterday (Sunday) but we did not get alerted. When I looked at it in Nagios I noticed the host check was in an OKAY state and the 'last check' value for it was from 12 days ago (6/17/2009)! - Host checks don't seem to be getting stuck in the queue like they were under 3.0.6, at least not for as long I'm going to submit a ticket to tracker.nagios.org but would like to have more empirical evidence of the problem first, all I have so far are symptoms, no good data points (logs, errors, etc.). Is anyone else seeing this type of behavior? Nagios 3.1.2 (also had trouble with 3.0.6) RHEL 5 64 bit Regards, -greg -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] disk IO for windows?
Hi All, We just started monitoring IO on our Windows servers. We measure two PerfMon counters (using the nsclient++ agent). System\File Control Operations/sec Description: File Control Operations/sec is the combined rate of file system operations that are neither reads nor writes, such as file system control requests and requests for information about device characteristics or status. This is the inverse of System: File Data Operations/sec and is measured in number of operations perf second. This counter displays the difference between the values observed in the last two samples, divided by the duration of the sample interval. System\Data Operations/sec Description: File Data Operations/ sec is the combined rate of read and write operations on all logical disks on the computer. This is the inverse of System: File Control Operations/sec. This counter displays the difference between the values observed in the last two samples, divided by the duration of the sample interval. I'm not sure what good thresholds are for these, currently we have a warning at 1 and critical at 2 on our test servers. We've had a couple of servers cross those thresholds but I don't have a good sense for what they should be yet. I've never seen any documentation regarding what a Windows box can do (in terms of IO). >From a command line they look like this: check_nrpe -H $HOSTADDRESS$ -c CheckCounter -a "\\System\\File Data Operations/Sec" ShowAll MaxWarn=1 MaxCrit=2 Regards, -greg -Original Message- From: Curtis LaMasters [mailto:curtislamast...@gmail.com] Sent: Tuesday, June 09, 2009 8:15 PM To: Anthony Montibello Cc: Andreas Ericsson; Nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] disk IO for windows? I use Disk Idle time as an indicator. Not an original idea :( I was told to mimic the monitoring built into Windows SBS. Curtis LaMasters http://www.curtis-lamasters.com http://www.builtnetworks.com On Tue, Jun 9, 2009 at 8:27 PM, Anthony Montibello wrote: > That is partialy right, > > The Alt delete thing is To get to TaskManager but only on New versions > of windows does it give access to some counter names, > > The best place to go is Performance Monitor Since thats on all version > of windows since 2000 Control Panel--> Administrative Tools--> > Computer Managment--> then Performance Counter on newer systems from > Computer Management --> Reliability and performance > --> Monitoring Tools --> Performance Monitor > Once you FIND performance Monitor then click the Green + to get into > the "add counters" > Click the Checkbox to Show the Counter description then click arround > till you find what you need Look for Disks for Drive stuff, > > Tony (Author of NC_NEt) > > On Tue, Jun 9, 2009 at 9:24 AM, Andreas Ericsson wrote: >> >> dave stern - e-mail.pluribus.unum wrote: >> > Anyone know of a plug-in or mechanism to log local disk I/O on windows? >> > >> > My nagios server is currently using check_nt to connect to windows >> > hosts via nsclient++. I was hoping perhaps COUNTER has something >> > buried within it to pull down this info. >> > >> >> There are indeed counters for that, but due to Microsoft's stupidity >> the counter-names are different depending on which base-language >> you've used for your windows servers. >> >> I don't know what they're named for english platforms (or any other >> for that matter), but you should be able to view them with that thing >> you can pop up when pressing ctrl-alt-del (task manager or whatever >> it's called). >> >> -- >> Andreas Ericsson andreas.erics...@op5.se >> OP5 AB www.op5.se >> Tel: +46 8-230225 Fax: +46 8-230231 >> >> Considering the successes of the wars on alcohol, poverty, drugs and >> terror, I think we should give some serious thought to declaring war >> on peace. >> >> >> - >> - Crystal Reports - New Free Runtime and 30 Day Trial Check >> out the new simplified licensing option that enables unlimited >> royalty-free distribution of the report engine for externally facing >> server and web deployment. >> http://p.sf.net/sfu/businessobjects >> ___ >> Nagios-users mailing list >> Nagios-users@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to >> /dev/null > > > -- > Crystal Reports - New Free Runtime and 30 Day Trial Check out > the new simplified licensing option that enables unlimited > royalty-free distribution of the report engine for externally facing > server and web deployment. > http://p.sf.net/sfu/businessobjects > _
[Nagios-users] Assigning contacts to hosts confusion
Hi All, Can someone help me understand the best way to assign contacts to hosts en masse? In version 1.x, we would create a hostgroup put the hosts that we needed in it and then assign a contact group to that hostgroup, done deal. Now it appears that under Nagios 3 to accomplish the same thing I have to create the hostgroup as before and then go to each host and assign a contact or contact group to it. That's much more tedious than the old way, am I missing something? Please tell me there's an easier way that I'm overlooking. I found this in the what's new in Nagios 2 doc's (http://nagios.sourceforge.net/docs/2_0/whatsnew.html): Contact group changes - The contact_groups directive has been moved from hostgroup definitions to host definitions . This was done in order to maintain consistency with the way service contacts are specified. Make sure to update your config files! Thanks, -greg -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Im confused about scheduled downtime and notifications
>> Hi All, >> >> We've just migrated from Nagios 1 to 3.0.6. I have a host that was in >> scheduled downtime and turned off. I turned it back on, while still >> in scheduled downtime, and got a HOST UP alert for it. Is this >> expected behavior? Could I have a misconfiguration somewhere, where should >> I look? >> Has anyone else seen this? >I've occasionally seen notifications for hosts in scheduled downtime, but only >in the few seconds after restarting the Nagios daemon. I don't think it's >>expected behaviour, no. In this case the daemon had been running for 3 days. >I've not bothered to find a fix for it, as it happens only rarely and so long >as I don't restart the daemon at three in the morning, it won't bother anyone. This was on a host that had just been retired, I turned it back on to get a file off of it and that's when the alert was sent, it had been offline for a while. The interesting thing was that Nagios only sent a UP alert when I shut it down again (a few minutes later) the alerts for down were not sent (as would be expected). It does not appear to be a major problem (no one else responded), I'm not going to worry about it too much. Thanks for your feedback. -greg -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Im confused about scheduled downtime and notifications
Hi All, We've just migrated from Nagios 1 to 3.0.6. I have a host that was in scheduled downtime and turned off. I turned it back on, while still in scheduled downtime, and got a HOST UP alert for it. Is this expected behavior? Could I have a misconfiguration somewhere, where should I look? Has anyone else seen this? Thanks, -greg -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] stopping and starting NDO daemon
> Hello Greg, > here it is, simply took one of RHEL's init-scripts and modified it: > [r...@atvies00nx libexec]# cat /etc/init.d/ndo2db > #! /bin/sh ... > echo $"Usage: $0 {start|stop|restart|try-restart|status}" >RETVAL=3 > esac > exit $RETVAL Thanks Christian, I'll give that a try. -- Stay on top of everything new and different, both inside and around Java (TM) technology - register by April 22, and save $200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco. 300 plus technical and hands-on sessions. Register today. Use priority code J9JMT32. http://p.sf.net/sfu/p___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] stopping and starting NDO daemon
Hi All, We've just setup nagios 3 using NDO (to support Nagvis) and I realized that NDO does not come with an init script. I found one post to the list (http://article.gmane.org/gmane.network.nagios.user/52477/match=ndo+init ) from about a year ago, someone was asking for help incorporating the NDO daemon into the Nagios init script, but there were no responses. I'm wondering how others are handling the stopping and starting of the NDO daemon? Would it be better to incorporate it into the Nagios init script or should it have it's own, has somebody already done this that is willing to share with the list? Thanks, -greg -- Stay on top of everything new and different, both inside and around Java (TM) technology - register by April 22, and save $200 on the JavaOne (SM) conference, June 2-5, 2009, San Francisco. 300 plus technical and hands-on sessions. Register today. Use priority code J9JMT32. http://p.sf.net/sfu/p___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Some hostgroups NOT displaying (weirdness) SOLVED
Okay, I apparently have not learned the lesson to check the archives first. I found the answer there (http://article.gmane.org/gmane.network.nagios.user/45395/match=hostgrou p+not+display), I did not check there until after I posted. Anyways for future reference our problem was due to NOT having the web ui users as a contact of a hosts in the offending hostgroups. Regards, -greg From: Frater, Greg J Sent: Tuesday, April 07, 2009 1:57 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] Some hostgroups NOT displaying (weirdness) Greetings All, I'm pretty lost and could use some help. We are moving a Nagios 1.x setup to Nagios 3.0.6. We have about 550 hosts and some 5000 services. About 3/4 of the way through our migration of config files we noticed that some of our hostgroups are not showing up in the web ui, but I have yet to figure out why. There are certain hostgroups (but not all) that are not displaying and if they are listed in the 'hostgroup_members' directive of another hostgroup it will cause the second hostgroup not to display either. I've tried every combination of member and hostgroup_members that I can think of but so far the only way I can duplicate the problem is use one of the known problem groups. Does anyone know what the logic or rules are that are used to determine if a hostgroup will display in the web ui or not? Any other suggestions on what to try to track this down? Any help is appreciated. Thanks, -greg -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Some hostgroups NOT displaying (weirdness)
Greetings All, I'm pretty lost and could use some help. We are moving a Nagios 1.x setup to Nagios 3.0.6. We have about 550 hosts and some 5000 services. About 3/4 of the way through our migration of config files we noticed that some of our hostgroups are not showing up in the web ui, but I have yet to figure out why. There are certain hostgroups (but not all) that are not displaying and if they are listed in the 'hostgroup_members' directive of another hostgroup it will cause the second hostgroup not to display either. I've tried every combination of member and hostgroup_members that I can think of but so far the only way I can duplicate the problem is use one of the known problem groups. Does anyone know what the logic or rules are that are used to determine if a hostgroup will display in the web ui or not? Any other suggestions on what to try to track this down? Any help is appreciated. Thanks, -greg -- This SF.net email is sponsored by: High Quality Requirements in a Collaborative Environment. Download a free trial of Rational Requirements Composer Now! http://p.sf.net/sfu/www-ibm-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nsclient 0.3.6-RC4-x64 issues
Have you checked the firewall settings in 2008? If they have been enabled then they will block the nsclient++ exe from communicating on the network unless you tell the firewall to let nsclient++ traffic through. That's the next thing I would check. Good luck, -greg -Original Message- From: Joseph L. Casale [mailto:jcas...@activenetwerx.com] Sent: Saturday, February 14, 2009 12:54 PM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] nsclient 0.3.6-RC4-x64 issues >Normally Windows monitoring use NC_Net or NSClient++. Can you give more >details on the check you're doing (the real command, not check_command)? Well, just started with Nagios, and I am using one of the windows.cfg templates with NSClient++ installed on the Windows Server 2008x64 boxes. >If you're using check_nt or one of its substitute (NC_Net has >check_nc_net, NSClient++ may have one too) make sure the proper service >is installed and running on the Windows server. Try restarting the >service if it's already running. Make sure the configuration is good >(especially port and password). I am using check_nt and the port/pass is correct for WinXP machines, but the same ini on the 2008 machines doesn't work. Given Nagios suggests a socket error, I suspect it is probably some bug in the x64 client on the 2008 machines... In the meantime, since I am just starting out, I have been reading up on snmp as that is how I am preferring to do my monitoring rather than have to install any applications. jlc -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Using a modem with Nagios as backup alert path
>-Original Message- >From: Russell Adams [mailto:rlad...@adamsinfoserv.com] >Sent: Thursday, February 12, 2009 4:12 PM >To: nagios-users@lists.sourceforge.net >Subject: Re: [Nagios-users] Using a modem with Nagios as backup alert path >On Thu, Feb 12, 2009 at 03:05:54PM -0800, Frater, Greg J wrote: >> Greetings All, >> >> We are setting up a new Nagios instance running version 3. With it we >> are adding a modem so that alerts can be sent via the phone system >> even if the data network goes down. In the testing I've done messages >> sent via the modem take quite a bit longer to be received than alerts >> sent via SMTP, this is because the modem goes through the dial up >> connection process and then sends the messages to each recipient in >> separate sessions, so I want to keep SMTP as the primary method and >> use the modem as a backup path. What is the best way to do this, can >> we do it without a bunch of work to implement? Suggestions and advice gladly accepted. >> >> Thanks, >> >> -greg >What are you sending via modem, SMS, text pages? Are you connecting to an ISP via PPP to fire off one SMTP message? The modem is sending sms messages. >I have successfully used Sendpage with Nagios to send to IXO/TAP text pagers, and it supports piggybacking to send out multiple pages in a single phone call. I think it supports sending SMS too. I guess I need to have a closer look at the sendpage config, there's probably a way to do that I just did take the time to dig into it. -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Using a modem with Nagios as backup alert path
> From: Gareth Fletcher [mailto:gareth.fletc...@onenet.co.nz] > Sent: Thursday, February 12, 2009 3:44 PM > To: Frater, Greg J > Subject: RE: Using a modem with Nagios as backup alert path > Hi Greg, > Just something to get you started before someone with more knowledge answers - what about monitoring the data network in Nagios, create an event handler so if it goes down change bring the modem up and start sending messages that way. When it comes back up you can keep SMTP primary. I'm not sure how to effectively check the "network" to see if SMTP is available, on one hand it seems straight forward but when I think about details it does not seem so obvious. My SMTP path looks something like this: Nagios -> local sendmail daemon -> smtp smarthost -> corporate smtp gateway -> corporate internet connection -> internet Maybe it's overkill to worry about all of those pieces but in my paranoid (i.e. mad boss wanting to know why we didn't know about a major outage) mind I don't want to leave anything out :-) Honestly I'm not checking all of those things now, but I would like to be a bit more thorough on the new setup, some of those things I don't have access to be able to check them. > How come you have to dial up each time to send alerts? That's a good question, that's just the way it worked when I first set it up and I did not check for another or better way. -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Using a modem with Nagios as backup alert path
Greetings All, We are setting up a new Nagios instance running version 3. With it we are adding a modem so that alerts can be sent via the phone system even if the data network goes down. In the testing I've done messages sent via the modem take quite a bit longer to be received than alerts sent via SMTP, this is because the modem goes through the dial up connection process and then sends the messages to each recipient in separate sessions, so I want to keep SMTP as the primary method and use the modem as a backup path. What is the best way to do this, can we do it without a bunch of work to implement? Suggestions and advice gladly accepted. Thanks, -greg -- Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring Windows Eventviewer
> Thanks for the info, > Great that it can be done with the NSClient++ as I have it installed on all our servers. > What do I need to define in the NSClient++ agent and where ? > Is it in the .ini file or elsewhere ? The only thing you need to do on the Windows server is enable the CheckEventLog.dll by removing the semicolon from that line in the nsc.ini file. Regards, -greg From: Frater, Greg J [mailto:[EMAIL PROTECTED] Sent: maandag 2 juni 2008 21:41 To: Tim Van Caeyzeele Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Monitoring Windows Eventviewer >Dear All, >Would anyone have experience in checking the windows eventviewer for certain events, or turning nagios red in case of ERRORs ? >What script are you using ? preferably something that can simply interact with NSClient We do this using the NSClient++ agent (www.nsclient.org ). It checks the event logs and filters them based on criteria you define, alerting when the number of hits you specify is reached (i.e. when the system log has 1 or more events with an ID of within the last 10 minutes send alerts). Here is an example we use to monitor for a specific Oracle error. In the example we check the "application" log of the server every "60" minutes for events with an ID of "20" with event type of "Error" containing a string in the text of the message "Can not allocate log", check turns critical after 1 matching event is found that is time stamped within the last "65" minutes. Checkcommands.cfg: define command{ command_namecheck_eventlogs command_line$USER1$/check_nrpe -H $HOSTADDRESS$ -p 5666 -c checkEventLog -a filter=new $ARG1$ MaxWarn=$ARG2$ MaxCrit=$ARG3$ filter-generated=\$ARG4$ $ARG5$ truncate=$ARG6$ # Desc: # $ARG1$ = event logs to check (i.e. file=system file=application) # $ARG2$ = Warning level (i.e. number of hits to generate a warning response) # $ARG3$ = Critical level (i.e. number of hits to generate a critcal response) # $ARG4$ = Time period (i.e. 1 day is '1d' 30 hours is '>30h') # $ARG5$ = Filters (i.e. filter-eventID==9009 filter-eventSource=Tcpip) see http://www.nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog <http://www.nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog> for detailed info # $ARG6$ = Amount of data to return in characters (i.e. truncate=150) # Example: check_nrpe -H server_name_here -p 5666 -c checkEventLog -a filter=new file=system MaxWarn=1 MaxCrit=1 filter-generated=\>30h filter+eventID==10002 descriptions truncate=138 } Services.cfg: define service{ use standard-srv service_description eventlog: Oracle archive log errors check_command check_eventlogs!file=application!1!1!>65m!filter+eventID==20 filter+eventType==error filter+message=substr:"Can not allocate log"!100 normal_check_interval 60 notification_optionsw,c contact_groups apps host_name server1, server2 } HTH, -greg - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring Windows Eventviewer
>Dear All, >Would anyone have experience in checking the windows eventviewer for certain events, or turning nagios red in case of ERRORs ? >What script are you using ? preferably something that can simply interact with NSClient We do this using the NSClient++ agent (www.nsclient.org). It checks the event logs and filters them based on criteria you define, alerting when the number of hits you specify is reached (i.e. when the system log has 1 or more events with an ID of within the last 10 minutes send alerts). Here is an example we use to monitor for a specific Oracle error. In the example we check the "application" log of the server every "60" minutes for events with an ID of "20" with event type of "Error" containing a string in the text of the message "Can not allocate log", check turns critical after 1 matching event is found that is time stamped within the last "65" minutes. Checkcommands.cfg: define command{ command_namecheck_eventlogs command_line$USER1$/check_nrpe -H $HOSTADDRESS$ -p 5666 -c checkEventLog -a filter=new $ARG1$ MaxWarn=$ARG2$ MaxCrit=$ARG3$ filter-generated=\$ARG4$ $ARG5$ truncate=$ARG6$ # Desc: # $ARG1$ = event logs to check (i.e. file=system file=application) # $ARG2$ = Warning level (i.e. number of hits to generate a warning response) # $ARG3$ = Critical level (i.e. number of hits to generate a critcal response) # $ARG4$ = Time period (i.e. 1 day is '1d' 30 hours is '>30h') # $ARG5$ = Filters (i.e. filter-eventID==9009 filter-eventSource=Tcpip) see http://www.nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog for detailed info # $ARG6$ = Amount of data to return in characters (i.e. truncate=150) # Example: check_nrpe -H server_name_here -p 5666 -c checkEventLog -a filter=new file=system MaxWarn=1 MaxCrit=1 filter-generated=\>30h filter+eventID==10002 descriptions truncate=138 } Services.cfg: define service{ use standard-srv service_description eventlog: Oracle archive log errors check_command check_eventlogs!file=application!1!1!>65m!filter+eventID==20 filter+eventType==error filter+message=substr:"Can not allocate log"!100 normal_check_interval 60 notification_optionsw,c contact_groups apps host_name server1, server2 } HTH, -greg - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] SMS and ATT with Nagios
Correct, it is good to have at least two ways out, modem and network. -Original Message- From: Tom Brown [mailto:[EMAIL PROTECTED] Sent: Monday, May 12, 2008 8:04 AM To: Frater, Greg J Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] SMS and ATT with Nagios > > We use SMS for our alerts, it is our primary mechanism. We send them > out using the same Nagios command as the email alerts. To send alerts > to ATT SMS addresses you send them as email from Nagios to > [EMAIL PROTECTED] > > providing your network is still abot to connect to the outside world - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] SMS and ATT with Nagios
>Hi, >Luis Fernando Lacayo <[EMAIL PROTECTED]> wrote on 09.05.08 16:38: >> Good Morning all, >I have to change my NAGIOS platform to a Dell Blade on RHEL >> 5. I >currently use a modem and qpage to send out notifications. Since > there >is no way to attach a modem to a blade, I am thinking on sending >> the >alerts via SMS. Our carrier is ATT, is there anyone out there >> currently >doing this? >Can you share how you are doing this? >Thanks, >Luis >Multitech has Modems with Ethernet Interfaces. You just connect them to the Network and send a text oder sms via Telnet commands: >http://www.multitech.com/PRODUCTS/Categories/Device_Networking/ We use SMS for our alerts, it is our primary mechanism. We send them out using the same Nagios command as the email alerts. To send alerts to ATT SMS addresses you send them as email from Nagios to [EMAIL PROTECTED] HTH -greg - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nagios reboot XP
>Palle L Jensen wrote: >> >> I guess I should have been more specific, sorry about that. >> >> I am running Nagios 3.0.1 on openSUSE 10.3, and I am only monitoring >> windows hosts. On the windows hosts I am running NSClient++. That >> seems to work good and I would prefer not to change to any other client. >> >> I have read about the event handlers in the Nagios documentation, but >> the samples there only refers to nrpe and scripts on the server. I >> understand how to define the service in Nagios, but I am not sure how >> to create the script that executes the batch file on the windows host? >> Maybe I am not getting the point and understanding on the documentation. >> >> Lets say I created a batch file that will start or restart a service, >> this batch file is located in c:\NSclient++\scripts folder. And then >> defined the service and command in Nagios: >> >> define service{ >> host_name somehost >> service_descriptionsql-service >> max_check_attempts 4 >> event_handler restart-sqlagent >> } >> >> define command{ >> command_name restart-sqlagent >> command_line what do I add here?? It has to be something that can be sent to the Nsclient++/windows host and execute the batch script in c:\NSclient++\scripts folder >> } >> >> The command line on the eventhandler documentation shows a path to a >> local script and local command ?/etc/rc.d/init.d/httpd restart? >> >> So creating and event handler for services or others on the box where >> Nagios is running, I understand. But not how to do it remotely to a >> windows host. >> >> If you could shed some more light over this I would very much >> appreciate it. >> >> Thanks, >> >> PJ >> >You need a Unix-side script to execute and handle the condition >appropriately, for example to only call the restart when the service is >in hard failure. A sample Bash script is given in the docs to get you >started. >If using Bash (which I recommend) the Bash script should check the state >and perform the logic control, calling the batch or script on the >windows side though an NRPE call to nsclient++ to execute the script on >the windows host. The NSClient++ data collector's primary mechanism for actions (getting it to do stuff) is via NRPE. It will respond to the old nsclient command as well but the majority of it's functions are utilized through NRPE. This is the case with your situation as well. On the Nagios side you would define a check like you have above and in the NSC.ini file you define a handler for the command. Below is an example of one we use to check the CPU utilization of any process on the server. On the Nagios server define your service as above and use something like this for the command_line line. Command_line$USER1$/check_nrpe -H $HOSTADDRESS$ -t 60 -c wmi_process_perf -a $ARG1$ On the Windows server in the NSC.ini file under the NRPE Handlers section put something like this: wmi_process_perf=c:\windows\system32\cscript.exe //nologo "c:\program files\nsclient++\scripts\wmi_cpu_perf.vbs" /process:$ARG1$ In this example the $ARG1$ value is the name of a running process on the server. The check returns the CPU and memory utilization of the process. HTH, -greg - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] (no subject)
>> -Original Message- >> From: [EMAIL PROTECTED] [mailto:nagios-users- >> [EMAIL PROTECTED] On Behalf Of Frater, Greg J >> Sent: Monday, April 28, 2008 11:20 AM >> To: nagios-users@lists.sourceforge.net >> Subject: [Nagios-users] A bit OT: sending MMS messages from Linux >> >> Greetings All, >> >> My apologies if this is to much off topic. >> >> Linux server. The common link between all the scenarios we have tried is the >> Linux server and the sendmail transfer agent. Does anyone on the list send >Sounds like it's just a normal e-mail to a specific address, correct? Yes, as far as Linux/Nagios is concerned they are standard messages to SMTP addresses. >Have you verified that sendmail is behaving itself as a normal mail server on the Internet? It has valid public reverse DNS? Forward DNS matches reverse DNS? HELO name is a valid FQDN, preferably matching reverse DNS for the IP? What do your sendmail logs show? Is there a bounce generated? We use the same Nagios command to send successfully to the SMS addresses ([EMAIL PROTECTED] vs [EMAIL PROTECTED] for MMS) of the same phones (i.e. using the same SMTP servers to get out). The sendmail logs show the messages as being delivered Okay. I did some testing, I'm trying different things to try and discover where the problem lies. Test: Send 10 messages from old RH/sendmail box to MMS address of cell phone, Send 10 messages from new RH/sendmail box to MMS address of cell phone, Send 10 messages from windows box (with sendmail port) to the MMS address of cell phone. All messages are sent one after another from the command line. Results: Usually all 10 of the messages from the Windows box come through, though not always. Some of the Linux generated messages come through I've gotten as many as 4 and as few as none. The new versus old versions of Linux and Sendmail do not seem to make a difference. The messages will come through on occasion but not consistently. Not good for a monitoring solution. I should rephrase my original questions: Is anyone else using MMS to receive Nagios alerts? Do you ever notice any alerts not getting to your phone? What provider are you using? Thanks, -greg - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] A bit OT: sending MMS messages from Linux
Greetings All, My apologies if this is to much off topic. We are having trouble sending alerts from our Nagios server to the MMS address of our cell phones. We have been sending SMS messages for years with great success, however MMS messages are never received. We would like to use MMS because SMS messages are limited to 160 characters, MMS has a much higher limit. The problem is not with Nagios but instead (I think) with the sendmail mechanism it uses when sending messages. I'm just hoping someone on the list has dealt with this already and can help me figure it out. We use Sendmail to send massages from the Nagios server to an internal SMTP gateway that forwards the messages on to the Internet. We can successfully send MMS messages from Outlook clients on the desktop which use the same SMTP gateways as the Nagios server, the phones receive these without any trouble. We have tried both ATT and Verizon systems and neither will deliver the MMS messages from the Nagios server. We have also tested sending the messages from the command line on the Linux server. The common link between all the scenarios we have tried is the Linux server and the sendmail transfer agent. Does anyone on the list send messages to MMS addresses? If so did you have any trouble getting it to work, if not what OS and mail transfer agent are you using? We have tested this from Red Hat 7.3/Sendmail 8.11 and Red Hat Enterprise Linux 5/Sendmail 8.13 Thanks, -greg - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Don't miss this year's exciting event. There's still time to save $100. Use priority code J8TL2D2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring Windows Servers - [New domain name for nsclient++]
-Original Message- From: Michael Medin [mailto:[EMAIL PROTECTED] Sent: Monday, April 07, 2008 9:59 PM To: Frater, Greg J Cc: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Monitoring Windows Servers - [New domain name for nsclient++] >> (found here: http://trac.nakednuns.org/nscp - don't ask me about the >> URL I don't know why it is what it is - it seems safe though) >> >LOL, everyone seems to mention this lately, well, here is my official apology to anyone who has taken offense for the domain name *SORRY* >Anyways, try: http://NSCLIENT.ORG, http://nsclient.com or http://nsclient.net hopefully these are "better" (if anyone want to know the >reason for using the old one there is a post on the forum about it http://nsclient.org/nscp/discussion/2/120 ) All I know is I got a visit from our security guy asking for an explanation of my visits to that site. >The preferable domain is nsclient.ORG but since go-daddy was so damn cheep (anyone know why? I pay 10 times as much for my "previous >domains") I got the others when I was at it... Excellent, I will use the new domain (though I must admit nakednuns is pretty memorable). >And if any apache2 configure gurus knows how to combine location (I use a location with mod python inside it to "call trac") and mod_rewrite let me know cause I would have preferred to drop the "trailing /nscp" but alas apache was giving me attitude... I would not be of much help here, I'm not an Apache guy. -greg - This SF.net email is sponsored by the 2008 JavaOne(SM) Conference Register now and save $200. Hurry, offer ends at 11:59 p.m., Monday, April 7! Use priority code J8TLD2. http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring Windows Servers - Comparing Options
>No input from anyone? One of the selling points to move away from proprietary solutions and towards OSS was the helpfulness of the community and mailing list. I know I don't have a 'problem' listed below that can easily be solved, but surely there are some opinions out there? Alright, I'll respond. Most of the time I usually just lurk here, most questions are usually answered quite well by others in a relatively short time. This is my 2 cents. See response below amongst your first post questions. >So, my question is, what is the best option for monitoring Windows Servers and why? >An important factor to consider is that I will probably not have Administrative access to any of the Windows Servers that I will be >monitoring. >I'm currently considering three options for the reasons shown below (in order of preference): >1) WMI checks >Pros >-Complete control of NRPE service on 'my' Windows Server >(Can this be distributed over two boxes?) >-Complete control of check commands on Nagios Server WMI provides a very thorough list of things you can check It's built into the OS, i.e. no install or configuration required >Cons >-NRPE service must be run from an account with access to all Windows Servers. WMI can fail. From time to time we have a server with WMI problems, we've been able to fix WMI at times, other times we've had to rebuild the OS, in those cases you would not have any monitoring. WMI is used by other things as well, and is susceptible to getting corrupted or reconfigured by something (or in our case someone) else. I think we've resolved most of our issues with WMI, however I learned that if it's broken, it does not matter why you can't use it to monitor anything until it's fixed. WMI requires common MS technology such as common MS network ports and Windows (AD) user accounts for security. This is not a problem for the most part but increases the risk when monitoring a DMZ for example and means your monitoring is dependant upon AD and thus anything AD needs (like DNS), etc. If AD or DNS goes down you've got bigger problems than whether or not your Nagios agents are working, however it that is how you watch everything then when they fail will you know it? >2) Agent checks - NRPE-NT or some other current Windows Agent >Pros Your not relying on a MS technology. Obviously the OS has to be available but your not dependant upon WMI or .NET or MDAC, etc. This may sound ironic but having your monitoring system as independent of the system it's watching as possible the better (IMO). Flexibility/Extensibility. For example the agent we use (nscp) (found here: http://trac.nakednuns.org/nscp - don't ask me about the URL I don't know why it is what it is - it seems safe though) is very flexible supporting both the original NSClient on the Nagios server side or NRPE and can check pretty much anything (including, WMI, perfmon, event logs, plus some built in checks and custom scripts). The documentation is not one of it's strong points, but it is very capable and reliable. I've struggled at times getting a particular type of check working because I could not understand the docs. I have not run into any problems with it in terms of crashes or memory leaks, etc. After about two years of using it (various versions) I can only think of twice when it has either generated an error or stopped running and that covers over 200 Windows boxes. It can be installed and uninstalled without server reboots. It just works, I like it. Better docs would be nice though. I'm not trying to sell you on this particular agent or even agents in general, this is my experience. >Cons >-Requires an agent setup and running on every Windows Server >-No direct control of agents/check commands Some checks via an agent (like custom scripts) are not as quick as SNMP (and probably WMI). >3) SNMP based checks >Pros >-Complete control over check commands on Nagios Server SNMP checks are very fast Once configured no changes are required on the server (i.e. SNMP does not require version upgrades, etc.) >Cons >-No direct control of SNMP Community Strings/ACL allowing access from Nagios >-No direct control of SNMP service The values exposed via SNMP on Windows servers is limited. You can do basic monitoring disks, ram, cpu, etc. But not advanced things such as CPU user mode vs. kernel mode usage. You can get an SNMP extension such as SNMP-Informant (http://www.snmp-informant.com/) to resolve this issue. SNMP can fail. We have a small portion of our servers both 2000 and 2003 where SNMP service won't stay running. We do some SNMP monitoring but not a lot, I've not solved this one yet. >How do you do it? Are there any other Pros & Cons that I might be missing? Do yo