Re: [Nagios-users] Plugin exists?
On 12/21/2012 09:50 AM, RichTea wrote: > Hi, > > I wrote something similar to this a while back, it looks at data pairs in a > text file eg :. > > It should not be to hard to make it work with csv, you will have to ignore > the shoddy coding though! > > http://23.me.uk/scripts/check_jmxview.pl.text > http://23.me.uk/scripts/check_jmxview.server.cfg.text > > I might even be able to find the Nagios related config for this if needs be. > > Ritchie I'll take a look - tnx! DR -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Plugin exists?
Hi just wondering if anyone might know of a plugin that does what I'm looking for. The idea is that it would work similar to check_mysql_query, in that it checks the numeric value return by a query against warning and critical threshold values. But instead of querying mysql, it would just cat the contents of a text file. I.e., read the first line and first column of a text file, parse it as a number, and then compare it against the threshold values. It obviously wouldn't be too hard to write a plugin like this, but I figured I'd try to see if one existed in order to not re-invent the wheel. A search through Nagios Exchange didn't turn anything up. But I thought someone on the list might know of one, or have already written one themselves. Thanks, DR -- LogMeIn Rescue: Anywhere, Anytime Remote support for IT. Free Trial Remotely access PCs and mobile devices and provide instant support Improve your efficiency, and focus on delivering more value-add services Discover what IT Professionals Know. Rescue delivers http://p.sf.net/sfu/logmein_12329d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] java / websphere monitoring
On 06/28/2011 08:11 AM, Daniel Wittenberg wrote: > I've been playing with jolokia/jmx4perl for doing java monitoring and > it's proving painful on websphere so curious what others are doing > for java and/or websphere monitoring? > > Thanks! Dan check_jmx http://code.google.com/p/jmxquery/ DR -- All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Best practices for running Nagios across multiple data centers
Thanks much for the suggestions! I'll give Merlin and Multisite some thought. Neither one of them seems to be in the CentOS (or EPEL) repos though, which makes me a little reluctant. I also read up a bit on the Nagios NSCA option too, and it may not be as bad to configure as I thought, so I might wind up going with that. Will run some experiments later today. Thanks again for the pointers! DR On 06/23/2011 02:47 AM, Pipitone Alan (External) wrote: > Hi > > To set up distributed Nagios installations, you can try Merlin: > http://www.op5.org/community/plugin-inventory/op5-projects/merlin > > In your case, I think that You should only open (on your firewall) > the port that Merlin uses to contact other Nagios servers. > > I use it, is exceptional. > > Bye > > Alan Pipitone http://www.alan-pipitone.com > > -Original Message- From: Carlos de Santa-Ana Garcia > [mailto:car...@dsag.jazztel.es] Sent: mercoledì 22 giugno 2011 21:30 > To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] > Best practices for running Nagios across multiple data centers > > Hi > > > Look at this: > > http://mathias-kettner.de/checkmk_multisite.html > > > Greets. > > El 22/06/2011 18:34, David Rosenstrauch escribió: >> I've had a lot of experience using Nagios in my previous job, where >> we used it to monitor numerous hosts and many different types of >> services - albeit all in the same data center. >> >> In my new job, however, I need to set up more sophisticated >> monitoring, since we have servers located in multiple data >> centers, with each data center containing a bunch of servers behind >> a firewall. What are some of the best approaches for Nagios to >> handle monitoring a setup like this? >> >> I'm assuming I'd need to set up multiple Nagios instances - one >> for each data center - with each one monitoring the servers behind >> that data center's firewall. What I'm wondering then, though, is >> how best to tie that all together. Would I need to run another, >> "main" nagios instance that takes in information from the others? >> If so, how best to feed the data from the 2nd tier instances into >> the main one? Passive checks sound like they could do the trick, >> but if I understand correctly, that would mean I'd need to define >> each service check on both the main instance (as a passive check) >> and on the 2nd tier instance (as an active check), which sounds >> like a bit of a configuration headache. >> >> Anyone have any good suggested reading on how to best configure a >> setup like this? Feel free to RTFM me, though I wasn't able to >> turn up much of use myself when searching on "nagios multiple data >> centers". >> >> Thanks, >> >> DR >> >> -- >> >> Simplify data backup and recovery for your virtual >> environment with vRanger. Installation's a snap, and flexible >> recovery options mean your data is safe, secure and there when you >> need it. Data protection magic? Nope - It's vRanger. Get your free >> trial download today. http://p.sf.net/sfu/quest-sfdev2dev >> ___ Nagios-users >> mailing list Nagios-users@lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users ::: >> Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. ::: Messages without supporting info will risk >> being sent to /dev/null >> > > > -- > > Simplify data backup and recovery for your virtual environment with vRanger. > Installation's a snap, and flexible recovery options mean your data > is safe, secure and there when you need it. Data protection magic? > Nope - It's vRanger. Get your free trial download today. > http://p.sf.net/sfu/quest-sfdev2dev > ___ Nagios-users mailing > list Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please > include Nagios version, plugin version (-v) and OS when reporting any > issue. ::: Messages without supporting info will risk being sent to > /dev/null > ** > > CONFIDENTIALITY NOTICE > This e-mail and any attachment are confidential and may be privileged > or otherwise protected from disclosure. It is solely intended for the > person(s) named above. If
[Nagios-users] Best practices for running Nagios across multiple data centers
I've had a lot of experience using Nagios in my previous job, where we used it to monitor numerous hosts and many different types of services - albeit all in the same data center. In my new job, however, I need to set up more sophisticated monitoring, since we have servers located in multiple data centers, with each data center containing a bunch of servers behind a firewall. What are some of the best approaches for Nagios to handle monitoring a setup like this? I'm assuming I'd need to set up multiple Nagios instances - one for each data center - with each one monitoring the servers behind that data center's firewall. What I'm wondering then, though, is how best to tie that all together. Would I need to run another, "main" nagios instance that takes in information from the others? If so, how best to feed the data from the 2nd tier instances into the main one? Passive checks sound like they could do the trick, but if I understand correctly, that would mean I'd need to define each service check on both the main instance (as a passive check) and on the 2nd tier instance (as an active check), which sounds like a bit of a configuration headache. Anyone have any good suggested reading on how to best configure a setup like this? Feel free to RTFM me, though I wasn't able to turn up much of use myself when searching on "nagios multiple data centers". Thanks, DR -- Simplify data backup and recovery for your virtual environment with vRanger. Installation's a snap, and flexible recovery options mean your data is safe, secure and there when you need it. Data protection magic? Nope - It's vRanger. Get your free trial download today. http://p.sf.net/sfu/quest-sfdev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] problem with check_url
On 10/27/2009 03:58 PM, Eddy Ruiz Azcuy wrote: > > Hi everyone, here's the problem, I have installed a monitoring server > running nagios3 in my company and we must monitor a webpage since the > clients work with it 24x7,but once I've downloaded the check_url.pl > Perl-written Nagios plugin and I have placed it in the > folder /usr/lib/nagios/plugin/ I run it from the command line > > perl /usr/lib/nagios/plugin/check_url.pl http://info..cu > > and it returns this. > > 200 .OK > > but when I try to run it from the plugin says: > > could,nt register the host info..cu here are the conf > files.Any help will be deeply apreciated,Thanks in advance. Why not use the check_http plugin that ships with nagios, instead of check_url.pl? DR -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Adding a link to error messages
On 10/07/2009 03:28 PM, Greg Lindstrom wrote: > Hello, > > I'm running Nagios on a Gentoo system and would like to know if I can > add a link to my custom monitors so users could click to our > department wiki for a description of the problem as well as how to > resolve them. That is, I'd like to have my custom module return a > link in the "error" text and have that link be "clickable" from the > Nagios screens. > > Thanks, > --greg We've done this. Here's how: 1) have your plugin output HTML 2) edit your /etc/nagios/cgi.cfg file, and set escape_html_tags=0 to prevent Nagios from stripping out the HTML that your plugin is generating HTH, DR -- Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to make 1 exception for 1 host when mapping services to hostgroups?
On 09/08/2009 08:42 PM, Janet Sullivan wrote: One way to do this is use the custom host and service variables that Nagios 3.x provides in it's configuration DSL. We use this extensively. http://nagios.sourceforge.net/docs/3_0/customobjectvars.html If you use custom host variables in your service definitions, you > can then define thresholds for services at the host level, which means > you can use inheritance to have a common base set of thresholds but then override those as needed. >>> Hi Max. That's indeed an interesting approach. >>> Although I will initially go with the proposal of Thomas >>> Thanks for the tip, very inspiring. >> I've used Max's approach as well. Works nicely! > > How will using custom macros for services like this affect performance > and memory usage once it's used 10,000+ times? Does it scale well? Not sure I see the issue ... In my case, I define an "additional detail text" macro on our service checks. It's often hard to determine from the output of a service check exactly what the problem is, since the output is often very specific to the functionality of the service check. Including a plain English "additional detail" message on the alert emails helps make clear exactly what has failed and why. So the way I have it working is that by default "additional detail" is set to blank. But specific services can override this with a small string of text. I probably do this on 30-40% of the (~160) services I have defined. So that brings me back to I'm not sure I see what the performance issue is here. Even if I had 16,000 services defined, we're not talking about that much extra memory required to hold this text. Can you clarify what you see as the issue? Because if I'm doing something that could potentially cause problems, I'd like to understand that. Thanks, DR -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] How to make 1 exception for 1 host when mapping services to hostgroups?
On 09/08/2009 07:41 AM, Jelle Smet wrote: >> One way to do this is use the custom host and service variables that >> Nagios 3.x provides in it's configuration DSL. We use this >> extensively. >> >> http://nagios.sourceforge.net/docs/3_0/customobjectvars.html >> >> If you use custom host variables in your service definitions, you can >> then define thresholds for services at the host level, which means you >> can use inheritance to have a common base set of thresholds but then >> override those as needed. > > Hi Max. That's indeed an interesting approach. > Although I will initially go with the proposal of Thomas > Thanks for the tip, very inspiring. I've used Max's approach as well. Works nicely! DR -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Quick and easy way to monitor Nagios itself?
On 09/04/2009 02:50 PM, Jonathan Call wrote: > Since I have a large Nagios distributed system the possibility of a > Nagios process going AWOL on one of my many servers is a serious > concern. Has anyone come up with a sure way to confirm (i.e. a cron job) > that Nagios is processing checks properly? > > For example, I had one OCP_daemon process die, as a result the Nagios > process hung for quite some time before it was discovered. Freshness > checking is not an option because many hosts are behind firewalls or on > private networks and so the central server has active checks disabled > globally. > > Jonathan See the check_nagios plugin. One of the parms you can specify is to have it check how long ago it wrote something to its log file. We recently had a problem where our Nagios box went down and we never got any notifictions. So I set up another small Nagios instance whose sole purpose is to monitor the primary instance. It has just one service check: check_nrpe calling check_nagios on the main Nagios box to verify that it's up and running. HTH, DR -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Why are there "commands"?
On 08/28/2009 01:18 PM, David Dyer-Bennet wrote: > I don't really understand the purpose / utility of the "command" level of > abstraction in Nagios configuration. (2.10; we're still on Centos 4.7). > > To define a new service to check particular Windows web services we've > written, I define a service, and then it has to refer to a command, and > over in the command I have to hard-code the parameters needed to test this > specific service -- so in fact I need a separate command for each service. > This seems, to me, to just introduce confusion, and separate bits of > information that belong together. > > Is this just a historical artifact that in fact doesn't make much sense, > or are there lots of cases where it's useful and makes it easier or > clearer to do what you want? > > (I'm fine with "that's the way it works, but it doesn't really make much > sense as it turns out", I've got plenty of that in my own code; I'm just > looking for more understanding, in case it makes more sense than I've so > far figured out.) I can't speak to Nagios v2.x, but at least on Nagios 3.x the command definitions have variable, not hard-coded, parameters. So the command definition essentially winds up being the general definition of what you want to check (e.g., check that a web server is up, using the check_http plugin), while the service definition provides specific details of which machine(s) should be checked using the command and how. (e.g., check that the web servers on boxes 1, 2, and 3 are up, that page /news.html doesn't return a 404, and that the page contains the text "News".) > As a broader question, are there documents that give more of a logical > overview of Nagios, explaining how and why things are broken up and how > they work together? Again, I'm responding for Nagios 3.x, but the docs are here: http://nagios.sourceforge.net/docs/3_0/ Perhaps there's also docs on the site for v2.x. HTH, DR -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Startup Failure
On 08/27/2009 10:27 AM, VINCENT BOB-DMHG67 wrote: > Hi, > > I am having a strange problem with my Nagios installation. > > Nagios has been running for about two years on our site. > > It was updated to 3.0.6 in January of this year. > > We just discovered that it stopped running and monitoring our servers. > > We restarted the system and it immediately fails. > > Hoping to resolve the issue, I updated the system to 3.2, but it too > fails within seconds of starting. > > I have been unable to find any log messages regarding the failure. > > Any suggestions or tips would be appreciated. > > Bob I'd guess the new version is barfing on something in your config file(s) that it no longer understands. Try verifying your config files: nagios -v e.g., nagios -v /etc/nagios/nagios.cfg That should give you some idea what the error is. HTH, DR -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Understanding Check Load
On 08/20/2009 12:33 PM, cr...@hooters-uk.com wrote: > Hi Group, I hope you can give me a better > understanding of the check_load command as I'm unsure > what my settings should be aligned to, if you take a > look at the below command line I get Warning and > Critical alerts all the time and I'm not sure what to > alter the figure to as I do not understand them at > the minute. > > command[check_load]=/usr/local/nagios/libexec/check_load > -w 15,10,5 -c 30,25,20 > > The above line give: WARNING - load average: 3.99, > 4.56, 7.60 > > Looking at the warning and given the fact that the > server guys says that his server is running OK, what > would be the ideal -w and -c be to use. > > I will try and read up on the above but just looking > for help now on this one server. > > Many thanks > > Craig It's not an exact science here - more of a black art. I found this page to be a good primer on the topic: -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Understanding Check Load
On 08/20/2009 12:33 PM, cr...@hooters-uk.com wrote: > Hi Group, I hope you can give me a better > understanding of the check_load command as I'm unsure > what my settings should be aligned to, if you take a > look at the below command line I get Warning and > Critical alerts all the time and I'm not sure what to > alter the figure to as I do not understand them at > the minute. > > command[check_load]=/usr/local/nagios/libexec/check_load > -w 15,10,5 -c 30,25,20 > > The above line give: WARNING - load average: 3.99, > 4.56, 7.60 > > Looking at the warning and given the fact that the > server guys says that his server is running OK, what > would be the ideal -w and -c be to use. > > I will try and read up on the above but just looking > for help now on this one server. > > Many thanks > > Craig It's not an exact science here - more of a black art. I found this page to be a good primer on the topic: http://hissohathair.blogspot.com/2008/07/tuning-nagios-load-checks.html DR -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] http_check & passwords
On 08/18/2009 09:06 AM, Marc Powell wrote: > On Aug 18, 2009, at 7:16 AM, Alex Huth wrote: > >> Hi! >> >> I have to implement websites, which have basic auth (develepmont & >> testing >> stges) Even when they are only for dev and testing, i don' like to >> put the >> password cleartext in the service.cfg to query them. >> >> Any other possibillity? > > You can put them in resource.cfg and use $USERx$ macros to reference > them. They'll still be visible in the process list as the plugin is > being executed. I think it's generally expected that your nagios host > is on a trusted network with trusted access... > > -- > Marc For more info: http://nagios.sourceforge.net/docs/3_0/config.html http://nagios.sourceforge.net/docs/3_0/configmain.html#resource_file HTH, DR -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] searching "howto use snmp"
On 08/14/2009 03:26 PM, irado furioso com tudo wrote: > hi, all > > I found (google) lots of crypt information on how to query remote hosts > about disk usage but none I can understand :( > > also, there are no search engine for previouos posts on the subject in > older lists, so: > > can anyone point me WHERE can I found a non-cryptographic information > on "how to get info on disk usage on remote hosts"? > > Nagios 2.12 on FreeBSD 7.2 > > TIA Easiest options are either: 1) run NRPE on the remote box, and have the main Nagios machine communicate with the remote NRPE daemon (using the check_nrpe plugin) to perform a disk usage check, or 2) same as #1, but using SSH instead of NRPE (using the check_by_ssh plugin) You can check the --help for both of those plugins for more info, and there's also a whole document about NRPE up at the Nagios web site. HTH, DR -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] HTML e-mail plugin
On 08/13/2009 06:40 PM, Marc Powell wrote: >> We have a need to embed HTML links into Nagios service alert messages, >> and it looks like this plugin might handle that. > > > Just to take a different tack... How are they special in that you > can't just include them in the default notifications? If you have a > modern mail client that is able to display HTML mail, it should > properly interpret links in text/plain messages as well. Just bringing > it up as it may be simpler. > > -- > Marc Because Nagios is stripping HTML characters out of the $SERVICEOUTPUT$ macro for security reasons (and rightly so, since the notify-service-by-email command is directly echoing its contents to a command line string). This is resulting in the HTML link that we've embedded in the $SERVICEOUTPUT$ from a custom plugin getting broken in our email alerts. I figured that by replacing the command line string with a call to the (slightly customized) HTML e-mail plugin I might be able to get the HTML links working again. DR -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] HTML e-mail plugin
Just wondering if anyone here has used the HTML e-mail plugin (http://www.monitoringexchange.org/cgi-bin/page.cgi?g=Detailed%2F2002.html;d=1) and can vouch for it. We have a need to embed HTML links into Nagios service alert messages, and it looks like this plugin might handle that. Thanks, DR -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Check_dns Error
On 08/12/2009 05:01 PM, Dei Bertine wrote: > Hi All, > > I got this error today but wasnt sure on what is this about. > > No changes again were made, it just decided to appear out of no where... > > DNS CRITICAL - '/usr/bin/nslookup -sil' msg parsing exited with no address > > Please kindly advise... > > Cheers! > DB Sounds like a DNS failure. i.e., exactly like it says - it tried to lookup a host in DNS and failed. (i.e., dns lookup "exited with no address") So sounds like your DNS went down briefly. DR -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NDOUtils mysql question?
On 08/07/2009 03:03 PM, Andrew Davis wrote: > Our Nagios server also has Cacti on it. I'm pretty sure that Cacti uses > an InnoDB database, while NDOUtils uses myISAM. The two are backed up > differently (example: mysqlhotcopy doesn't working on InnoDB databases). > Does anyone know what mysql command you run to determine which type of > DB is used for various databases, especially considering you can run > multiple types at once? I know I can use "mysqlshow" or the "show > databases;" options to show the databases themselves, but it doesn't > list the DB type. Using mysqlshow with a -t and a DB name shows the > table types, but again, not the database types. I'm sort of stuck on > this. I want to make sure I know what DB types I'm dealing with so I can > ensure I'm backing them up properly. Google searches aren't helping... > the results all go back to the "mysql versus innodb" debate... I think the engine type is set on a table by table basis, not database-wide. You can see this as follows: mysql> show create table mysql.time_zone\G *** 1. row *** Table: time_zone Create Table: CREATE TABLE `time_zone` ( `Time_zone_id` int(10) unsigned NOT NULL auto_increment, `Use_leap_seconds` enum('Y','N') NOT NULL default 'N', PRIMARY KEY (`Time_zone_id`) ) ENGINE=MyISAM DEFAULT CHARSET=utf8 COMMENT='Time zones' 1 row in set (0.00 sec) Note the "ENGINE=MyISAM" at the end. HTH, DR -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] recognizing string from HTTP checks?
Ray Kiddy wrote: > Why is this not recognizing that "www.domain2.com" is not returning > the string I want it to return? Even if the domain is being > redirected, it is being redirected wrong. At no point is that URL > returning the string I want it to see. So, "www.domain2.com" should be > red. But it is green. > > Any suggestions on how to make this work? > > thanx - ray Couple of suggestions: 1) use -f follow in your check_http call to follow redirects. (See check_http -help for details.) 2) I've found that HTTP servers using virtual hosting work a bit differently than other HTTP servers. i.e., GET /foo.html HTTP/1.0 vs. GET /foo.html HTTP/1.0 Host: myvirtualdomain.com So I use 2 different check_http calls for this: define command { command_namecheck_http command_line$USER1$/check_http -I $HOSTADDRESS$ -f follow $ARG1$ $ARG2$ $ARG3$ } define command { command_namecheck_http_virtual command_line$USER1$/check_http -H $HOSTADDRESS$ -f follow $ARG1$ $ARG2$ $ARG3$ } HTH, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Dynamically generated service checks
Marc Powell wrote: > On Jul 15, 2009, at 11:13 AM, David Rosenstrauch wrote: > >> IMO, the ideal solution here would be if I could just submit passive >> check results for services that aren't explicitly configured in >> Nagios. >> But alas, that's not allowed and it fails with messages like >> "Warning: >> Passive check result was received for service 'foo' on host >> 'mysql-dev', but the service could not be found!" > > That's correct but this would be a lot like submitting SNMP traps to > nagios. You might glance over that documentation to get ideas. You > could create a generic service, set it as passive and volatile and > then just start submitting passive results to it with descriptive/ > useful output. Each non-ok result would generate an alert so you > wouldn't miss anything but if you don't build in some protections, you > could repeat notify about the same problem. > > -- > Marc Hmmm ... Sounds like it might be what we'd want. I'm a bit unclear about how this would work in practice though. What I'd want is that when I go check the Nagios "Service Problems" page in the GUI, I'd want to see multiple lines for the service - one line for each of the tags/files that we're having problems with. But if this is configured as just a single (passive, volatile) service, then wouldn't the GUI show only one line for the service, containing details for only the most recent tag/file that failed the check? Thanks, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Dynamically generated service checks
Marc Powell wrote: > On Jul 15, 2009, at 10:19 AM, David Rosenstrauch wrote: > >> Matthew Jurgens wrote: > >> 2) If I understand correctly, Nagios loads its config files at startup >> time and does not re-read them after that. So if I'm regenerating a >> config file, then that means I'd need to restart the Nagios service >> afterwards, which is a bit of an onerous imposition. > > Using the init script, restart stops and starts the daemon, reload > sends a HUP signal to the running process to tell it to re-read it's > config files. The init script verifies config before doing either. You > could manually HUP the running process if you're sure the config files > are syntactically correct. > >> And although >> again, I could in theory do this in a cron job, I'm not sure I'm >> comfortable with that. There's the potential for the Nagios service >> to >> not start up again successfully, and I don't like taking the risk that >> this dynamic update procedure could potentially bring down the entire >> Nagios system. > > I've been doing this exact thing (nagios reload) hourly for several > years quite successfully. If you wanted to be paranoid about it, you > could script a run of '/path/to/nagios -v /path/to/nagios.cfg' and > only reload if that exits 0 else send yourself an e-mail with the bad > output. > > -- > Marc Thanks much for these pointers (and the quick response). Didn't know about the SIGHUP thing. Well, it's good to know that this is a viable option. Still, re-writing a config file seems a bit of a kludgey way to handle this. I'm mulling over another idea, which I'm calling a "rotating" service check (for lack of a better word). Basic idea is: I set up one single service (rather than a service for each tag/file), and each time the service runs it automatically "rotates" to check the next tag/file. If the check fails, then it stops rotating and continues checking that tag/file until the problem gets fixed. Still working out the details, though, and not sure if this is a viable solution for us or not. On the one hand, assuming we run the check every 5 minutes, then the service check would rotate through and thereby check all of the tags within some reasonable period of time. On the other hand, if it hits a tag that fails the check, we'd only get an alert on that particular tag and wouldn't know if other tags were failing the check too. IMO, the ideal solution here would be if I could just submit passive check results for services that aren't explicitly configured in Nagios. But alas, that's not allowed and it fails with messages like "Warning: Passive check result was received for service 'foo' on host 'mysql-dev', but the service could not be found!" I'll have to mull this over some more ... Thanks, -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Dynamically generated service checks
Matthew Jurgens wrote: > I do a similar thing where I dynamically change the service check > definitions configured into Nagios. You've almost got to this solution > but say "and couldn't just ". Not sure why you say that but I'll briefly > cover what I did. You will need some programming/scripting. > > First up in the nagios.cfg file configure either a specific > configuration file or directory where you will define your dynamic > service checks > eg cfg_dir=/etc/nagios/dynamic_services or > cfg_file=/etc/nagios/dynamic_services.cfg > > Using a file/directory will be dependent on exactly how you need to > generate your checks. > > Then at the point where you know what the service checks will be > generate the appropriate nagios config file based on what is in the > database into either your single config file or into one file each in > the dynamic directory. > > You will then need to reload the nagios config and make sure it syntaxes > ok. > > We do this quite sucessfully. When customers add/delete servers we just > regenerate the config to match and reload Nagios. Thanks much for the response Matthew. Very helpful! I understand what you're suggesting here. Sounds like a couple of potential issues with it though. By doing this via regenerating a config file, this means that: 1) We would need to manually kick off a "config file regenerate" process each time a user adds/changes a tag/file in our system. It would be much better if there were some way to have nagios automatically pick up the change without any manual intervention. (Though I suppose we could work around this by using a cron job that runs periodically, checks for updates, and then regenerates the config if needed.) 2) If I understand correctly, Nagios loads its config files at startup time and does not re-read them after that. So if I'm regenerating a config file, then that means I'd need to restart the Nagios service afterwards, which is a bit of an onerous imposition. And although again, I could in theory do this in a cron job, I'm not sure I'm comfortable with that. There's the potential for the Nagios service to not start up again successfully, and I don't like taking the risk that this dynamic update procedure could potentially bring down the entire Nagios system. H there's got to be *some* other way to accomplish this, no? Thanks, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Dynamically generated service checks
Scratching my head as to how it might be possible to accomplish a monitoring task I need to do: We have a database table of tags. For each tag we have a process that generates a Javascript file and pushes it out to a CDN. I've successfully created a service check to monitor that the process runs successfully (by using check_mysql_query to verify that a "pushed" date gets updated on the database table). But we also need to monitor that each of the files exists - and in the correct version - over at the CDN, and here's where I'm hitting a wall. What I'm envisioning is a way whereby I could dynamically generate a whole bunch of service checks - one for each tag in the database table. Each service check would then go and download its corresponding file from the CDN, parse it, and verify the version number. In this way, if any particular file was missing or the wrong version, we would get a Nagios alert on that tag/file. However, I don't see any clear way to approach this with Nagios. Near as I can tell there doesn't seem to be any way to dynamically generate service checks. From my reading of the docs, every service need to be explicitly listed in the config files beforehand. I was thinking that perhaps having a cron job that generates passive checks might be a solution here, but again it looks like I would still need to have the service checks defined beforehand in the nagios config files, and couldn't just generate them on-the-fly based on the contents of the database table. Anyone have any ideas on how I could accomplish this file monitoring in a dynamic fashion? (Note that alerting on a file-by-file basis is needed here; it would not be sufficient to have a single service check that monitors and alerts on all of the files.) TIA, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check ping explanation required
shadih rahman wrote: > David, >Thanks for your reply. I am ensuring one icmp packet by using -p option > in my command definition. Please advise on this. Thanks > > On Tue, Jul 14, 2009 at 2:40 PM, David Rosenstrauch wrote: >> The plugin issues 5 pings by default: $USER1$/check_ping -H $HOSTADDRESS$ t 45 -4 -w 3000.0,100% -c 3000.0,100% -p 1 Hmmm ... Well, assuming that your command definition got pasted correctly, it looks like there's a typo in your command. It should be "-t 45", not "t 45". So I'd guess that perhaps the plugin is getting messed up by the typo and never parsing the "-p 1" parm, and so still issuing 5 pings? Just a guess though. HTH, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check ping explanation required
shadih rahman wrote: > Nagios uses check_ping for the host check of most hosts. > > The check is configured to send only one ping. > > So if Nagios receives a response, it is 0% packet loss. > > If no response, it is 100% packet loss. > How is it possible for Nagios to lose 85% of one ping? The plugin issues 5 pings by default: [r...@webapps nagios]# /usr/lib64/nagios/plugins/check_ping -help check_ping v1991 (nagios-plugins 1.4.13) Copyright (c) 1999 Ethan Galstad Copyright (c) 2000-2007 Nagios Plugin Development Team Use ping to check connection statistics for a remote host. Usage:check_ping -H -w ,% -c ,% [-p packets] [-t timeout] [-4|-6] Options: -h, --help Print detailed help screen -V, --version Print version information -4, --use-ipv4 Use IPv4 connection -6, --use-ipv6 Use IPv6 connection -H, --hostname=HOST host to ping -w, --warning=THRESHOLD warning threshold pair -c, --critical=THRESHOLD critical threshold pair -p, --packets=INTEGER number of ICMP ECHO packets to send (Default: 5) ... DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] service definitions & use
Natalie Aloi wrote: > does anyone have any documentation about the USE parameter listed below - I > have been unable to locate this option in the nagios documentation > > define service{ > name sensor-service > use generic-service > check_period 24x7 > max_check_attempts 4 > normal_check_interval 5 > retry_check_interval 1 > contact_groups sensor-admins > notification_options w,u,c,r > notification_interval 60 > notification_period 24x7 > register 0 > } It specifies a parent template to inherit from. See: http://nagios.sourceforge.net/docs/3_0/objectinheritance.html HTH, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Monitoring disk usage
Juki wrote: > Hello, > > I have installed nrpe v2.11 on a monitored host running Solaris 10. An > excerpt from nrpe.cfg on the monitored host is as follows; > > *command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 15% -p / > > command[check_disk]=/usr/local/nagios/libexec/check_disk -w 30% -c 20% -p > /var/opt/BGw/Server1 -p /var/opt/mediation/ora > command[check_swap]= /usr/local/nagios/libexec/check_swap -a -w 10% -c 5% > command[check_procs]=/usr/local/nagios/libexec/check_procs -w 300 -c 400 > command[check_ssh]=/usr/local/nagios/libexec/check_ssh -4 -t 10 -p 22 > localhost > command[check_ftp]=/usr/local/nagios/libexec/check_ftp -t 5* > > My intention is to monitor disk usage on the partitions /var/opt/BGw/Server1 > and /var/opt/mediation/ora. This is not working yet. When I log into the > Nagios web interface on the monitoring server, I do not see any details > (alerts and warnings) to do with the above mentioned partitions. However, I > do see other details like ssh, processes, swap and even those for the / > partition. > > If I run on the monitored host, I get some info; > > *bash-3.00# /usr/local/nagios/libexec/check_disk -w 30% -c 20% -p > /var/opt/BGw/Server1 -p /var/opt/mediation/ora > DISK OK - free space: /var/opt/BGw/Server1 16937 MB (31% inode=94%); > /var/opt/mediation/ora 9738 MB (64% inode=99%);| > /var/opt/BGw/Server1=36502MB;37990;43417;0;54272 > /var/opt/mediation/ora=5323MB;10997;12568;0;15711 > * >>From my understanding, the output above shows that the check_disk plugin > actually works, however on logging into the web interface, there are no > details to this effect and also no alerts/warnings at all. How do I get to > see the details of these partitions in the web interface including warnings > and alerts. > > > Thanks in advance, > Juki Did you add a service on the main nagios host to call the nrpe command you just set up? i.e., define service { service_description Check remote disk space host_name the_remote_host check_command check_nrpe!check_disk } HTH, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios plugin for monitoring files in a directory?
Israel Brewster wrote: > How about using check_file_age on the directory itself? I think the > timestamp of the directory is updated whenever a new file is added. Or > are there other processes going in in the directory, such as deletions > or modifications, that might make this value unreliable? Thanks much for the suggestion. Hmmm ... in theory that'd work. Something about it makes me uncomfortable though. Just feels like monitoring a specific count of files will wind up being more precise and correct than the directory mod time. Anyway, I've been taking a look at the "check_file" plugin on Monitoring Exchange, and it looks like that might fit the bill. Thanks, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] shoot not sure what is happening - cant start nagios (account unavailable)
Natalie Aloi wrote: > All I had a sample version (out of the box) of nagios running, I made > my config changes with my devices, etc, verified configuration...now > nagios won't start! Here is my output any ideas? > [na...@dns bin]$ sudo service nagios > restart [sudo] password for root: Running configuration check...done. > Stopping nagios: done. Starting nagios:su: warning: cannot change > directory to /var/spool/nagios: No such file or directory This > account is currently not available. done. > [r...@dns > bin]# cd /var/spool [r...@dns spool]# ls -lat total 52 drwxrwx--- 2 > smmsp smmsp 4096 2009-07-10 05:00 clientmqueue drwxr-xr-x 13 root > root 4096 2009-07-08 13:52 . drwxrwxr-x 2 root mail 4096 2009-07-07 > 04:10 mail drwx-- 2 root mail 4096 2009-07-07 04:10 mqueue > drwxr-xr-x 2 root root 4096 2009-06-30 15:00 anacron drwxr-xr-x 20 > root root 4096 2009-06-30 14:37 .. drwx--x--- 3 root lp 4096 > 2009-06-30 14:35 cups drwx-- 3 daemon daemon 4096 2009-06-30 > 14:26 at drwxr-xr-x 2 root root 4096 2008-12-12 13:02 repackage > drwxr-xr-x 2 root root 4096 2008-04-07 17:44 lpd drwxr-x--- 2 squid > squid 4096 2008-04-03 07:30 squid drwxrwxrwt 2 root root 4096 > 2008-03-03 12:51 vbox drwx-- 2 root root 4096 2008-02-12 10:18 > cron Sounds like ... exactly what it says: /var/spool/nagios: No such file or directory There is no directory called /var/spool/nagios, which apparently something in your nagios.cfg file is pointing to. Fix would be to either create the directory (and make sure you have its permissions and ownership set correctly) or change your config file to point somewhere else. BTW, on our system (CentOS 5.3) there is no /var/spool/nagios, but rather a parent /var/nagios directory which contains a "spool" sub-dir. So maybe /var/spool/nagios is not the right dir name for your distro? Where did you get your nagios.cfg file from? It'd generally best, I'd think, to take the one that comes shipped with the nagios release you installed and then customize it to your needs so that you don't wind up using a config file from another distro that's incompatible with the layout of your distro's file system. (This is also why I'd think it's generally a good idea to install from a package for your distro, as opposed to building from source, as you indicated you did.) HTH, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] notification command errors
Natalie Aloi wrote: > I am getting this error: But I don't know where or how to define the service > notification or host notification - any body know where to point me? > > Checking contacts... > Error: Service notification command 'notify-by-email' specified for contact > 'grover' is not defined anywhere! > Error: Host notification command 'host-notify-by-email' specified for contact > 'grover' is not defined anywhere! > Error: Service notification command 'notify-by-email' specified for contact > 'nagiosadmin' is not defined anywhere! > Error: Host notification command 'host-notify-by-email' specified for contact > 'nagiosadmin' is not defined anywhere! > Error: Service notification command 'notify-by-email' specified for contact > 'nats_cell' is not defined anywhere! > Error: Host notification command 'host-notify-by-email' specified for contact > 'nats_cell' is not defined anywhere! In my nagios installation those are defined in /etc/nagios/objects/commands.cfg, and the /etc/nagios/nagios.cfg file contains a line "cfg_file=/etc/nagios/objects/commands.cfg" which causes nagios to load that file as part of your configuration. You've probably inadvertently omitted that reference from the config file. HTH, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios plugin for monitoring files in a directory?
Just wondering if anyone might have some pointers on a monitoring puzzle I've run into. We have a couple of automated processes - one of them an automated FTP upload, the other an automated log file gatherer - which wind up causing essentially the same result: new files regularly getting added to a disk directory. I need to set up monitoring on this, since if we're not getting new files showing up then something has crashed and our system won't be getting updated properly. Not quite sure how to do this, though - i.e., which nagios plugin would help me get the job done. I've used the check_file_age plugin successfully in a couple of spots. E.g., to verify that a backup process has been run successfully I perform a unix "touch" on a "last-backup-date" file at the end of the backup, and then have the check_file_age plugin check the age of that file. I don't think I can use that plugin & technique here, however, since: a) Each uploaded file has a different (timestamped) name, which I don't know ahead of time. So I have no way to tell the check_file_age plugin which file to monitor. b) Unlike my backup process, which I wrote and was able to alter to touch the last-backup-date file, I don't have control over the external processes creating/uploading these files. I saw there were some rudimentary directory-monitoring plugins over at the Monitoring Exchange site, but they seem like they can only do things like monitor the number of files in a directory vs. a threshold minimum, which is not quite what I want. (I don't really care about how many files there are in the directory.) Anyone have any suggestions on an existing plugin (either part of the standard Nagios release, or available from Monitoring Exchange) that can help here? I'm not afraid to roll my own plugin if needed, but I'd obviously prefer to save myself some work and use an existing one if possible. Thanks, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] basic nagios/linux info needed
Natalie Aloi wrote: > Hello all, I am attempting to install nagios on a new server...I wanted to > use nag-3.0.6 but there is no rpm for it There's a nagios RPM available through the rpmforge repos: How to configure to use RPMforge ? http://dag.wieers.com/rpm/FAQ.php#B2 HTH, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring unreachable hosts
Frost, Mark {PBG} wrote: > >> -Original Message- >> From: David Rosenstrauch [mailto:dar...@darose.net] >> Sent: Wednesday, July 08, 2009 12:39 PM >> To: nagios-users@lists.sourceforge.net >> Subject: Re: [Nagios-users] monitoring unreachable hosts >> >> Frost, Mark {PBG} wrote: >>> This topic of unreachable hosts that's come up recently has got me >>> thinking about an issue we have. We have a few hosts that are behind >>> proxies and as such are impossible to ping. They have a single >> service >>> which we can check through the proxy successfully. >>> >>> I'm a little stuck on what to do with the host checks. >> Might a passive check be the solution here? I.e., instead of having > the >> nagios box do an active check and try to connect out to the external >> hosts, perhaps there's a way to have the external host connect in to > the >> nagios box periodically and say "I'm still here". >> >> HTH, >> >> DR > > In my case, this is a host that we have no administrative access to. > Imagine that you're monitoring yahoo.com :-) > > It just seems like while the host check is an integral part of the > process (in terms of the way that Nagios looks at things), it's also > completely pointless and actually gets in the way here. > > Mark Hmmm OK, well couldn't you just remove the check_command for those hosts? From the "define host" docs: "check_command: This directive is used to specify the short name of the command that should be used to check if the host is up or down. Typically, this command would try and ping the host to see if it is "alive". ... If you leave this argument blank, the host will not be actively checked. Thus, Nagios will likely always assume the host is up (it may show up as being in a "PENDING" state in the web interface). This is useful if you are monitoring printers or other devices that are frequently turned off. ..." DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] hostgroups causing hairloss
DAve wrote: > Everything looks fine, but when I start Nagios and look at View > Config->Object Type->Host Groups, I see this! > > Group NameDescription Host Members > CACHE_DNS Caching DNS service CDC-Dnscache , CDC-Ns1 , CDC-Ns1auth , > IDC-Ns2 , IDC-Ns2auth , SDC-Ns3 , SDC-Ns3auth > > The CACHE_DNS host group should only contain CDC-Ns1, IDC-Ns2, and > SDC-Ns3. Probably not a coincidence that CDC-Ns1auth starts with CDC-Ns1. Sounds like it's using CDC-Ns1 as a prefix to match hostnames, rather than as an actual hostname. Looks like there's config setting in 3.0 called use_regexp_matching which might control that. Perhaps in your old config file that was somehow turned on. (Or perhaps not defined at all, and on is the default.) Who knows what other old settings you're inheriting too. It's probably best for you to scrap your old config, and start fresh with a new 3.0 config file, then adopt it to your needs (i.e., point it to whatever custom definitions/definition-files you've created). HTH, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] monitoring unreachable hosts
Frost, Mark {PBG} wrote: > This topic of unreachable hosts that's come up recently has got me > thinking about an issue we have. We have a few hosts that are behind > proxies and as such are impossible to ping. They have a single service > which we can check through the proxy successfully. > > I'm a little stuck on what to do with the host checks. Might a passive check be the solution here? I.e., instead of having the nagios box do an active check and try to connect out to the external hosts, perhaps there's a way to have the external host connect in to the nagios box periodically and say "I'm still here". HTH, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] a special http check
shadih rahman wrote: > All, >I need to do a special kind of http check to see if our authentication > service is working or not. These are the sequence of event need to take > place. > > I need to hit the url https://myserver.com/login > then I need do additional query service=myservice > then I need to additinal query to validattion url > https://myserver.com/validate > here I will add additional query with my username and password. > then I want to check the http response and search for the string "yes" > > > Can this be done using check_http or do I need a custom check? Please > advise on this. Thanks I'm currently doing a login check using check_http, but it's a much simpler, single-step check. It basically just checks that we can successfully login with a special nagios username and password by checking for the presence of the word "Logout" on the resulting page. (That word only appears if the login is successful.) define service { service_description portal login use our-service servicegroups http-services host_name our-host check_command check_http!-S!-s Logout!-u '/portal/j_spring_security_check?j_username=$USER3$&j_password=$USER4$' _ADDL_DETAILThis service verifies that we can log in to the portal (via: if the login is successful then the string Logout will be found on the page) } Perhaps you might be to accomplish what you want using only check_http by breaking this up into several different service checks. i.e.: 1) check that you can log in successfully, 2) check that you can (login and) hit the service=myservice page, etc. HTH, DR -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/blackberry ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Event Handlers, start a java program, nohup issues.
Perhaps instead of using a java daemon to do JMX queries, you could use the check_jmx nagios plugin. (Available at the monitoring exchange site.) I've been using it in our Nagios system, and it's been working nicely (after some enhancements). I'm in the process of adopting and enhancing the code (neither of the previous 2 authors wanted to maintain it) and setting up a proper home for it on sourceforge (http://sourceforge.net/projects/nagioscheckjmx/). (Don't download it yet until I get the 1.0 release uploaded.) HTH, DR Marco Tirado wrote: > Hello Users: > > I have a problem with an event handler of mine. The handler starts a java > daemon-like program which loops forever waiting for connections and performs > JMX queries against our java applications. > > The problem is that the handler times out when it is run by nagios. This is > what I see in the logs: > > [01-07-2009 18:45:36] SERVICE EVENT HANDLER: > bj-mon-01;JMX_Server_Running;(null);(null);(null);start_jmx_server > [01-07-2009 18:46:07] Warning: Service event handler command > '/usr/local/nagios/libexec/eventhandlers/start_jmx_server CRITICAL SOFT 1' > timed out after 30 seconds > > The event handler should start my JMXServer both in hard and soft states. I > have run the command from the console as the "nagios" user and it works, so > the problem has nothing to do with user rights for nagios. > > The problem is that the handler hangs when I run "nohup" followed by my > command for starting the server (see the red text below). > > My event handler looks like this: > > ### > # PROPERTIES > ### > > PORT="" > ECHO_CMD="/bin/echo" > JAVA_CMD="/usr/bin/java" > CLASSPATH="MyClasspath" > JVM_OPTIONS="MyOptions" > > ### > > # What state is the JMXServer in? > case "$1" in > > OK) > ;; > > WARNING) > ;; > > UNKNOWN) > ;; > > CRITICAL) > > case "$2" in > > SOFT) > > `$ECHO_CMD "TRYING restart" >> /tmp/test` > nohup $JAVA_CMD -cp $CLASSPATH $JVM_OPTIONS JMXServer $PORT > &1 >> $LOG_FILE& > `$ECHO_CMD "TRYING restart" >> /tmp/test` > > ;; > > HARD) > > `$ECHO_CMD "TRYING restart" >> /tmp/test` > nohup $JAVA_CMD -cp $CLASSPATH $JVM_OPTIONS JMXServer $PORT > &1 >> $LOG_FILE& > `$ECHO_CMD "FINISHED trying" >> /tmp/test` > > ;; > > esac > > ;; > > esac > > exit 0 > > > > Any help, hint or recommendation is deeply appreciated. > > //Marco -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] RPM for Nagios 3.1.2 Fedora 9?
Natalie Aloi wrote: > Does anyone know of a location for rpm for Nagios 3.1.2 on fedora 9? > I located http://dag.wieers.com/rpm/packages/nagios/ but 2.11 is the > latest rpm > > thanks! Natalie If you use the rpmforge repositories and install it from there, it'll give you v3.0.6. How to configure to use RPMforge ? http://dag.wieers.com/rpm/FAQ.php#B2 HTH, DR -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null