Re: [Nagios-users] Monitor Cisco Uptime

2012-10-20 Thread Jörg Linge
https://github.com/mludvig/nagios-plugins/blob/268693c0afa2775c719584212433164cd846fe23/check_snmp_uptime.pl

Joerg

Am 20.10.2012 um 20:14 schrieb "Will Bashlor" :

> The thought in my mind for querying uptime via snmp had 2 purposes, 1.) to 
> (possibly) alert on no snmp response, and 2.) To display uptime in a list 
> format on Nagios for my network engineers/techs to view.
>  
> I like the idea of alerting on uptime < 1 day. But wouldn’t the reply need to 
> be in a numbered time format only? I guess my mind is in Cacti threshold 
> plugin mode. For example:
>  
> If the snmp respond replies with 4579200 in seconds (53 days), and I want to 
> alert on uptime less than 1 day then my alert value would be <86400, correct? 
> If this is true how can this be done when the response is:
>  
> “iso.3.6.1.2.1.1.3.0 = Timeticks: (447923619) 51 days, 20:13:56.19”
>  
> Or can I alert based on 447923619 only? It appears the 447923619 is seconds 
> with the last 2 digits being hundredths. Secondly, how can I simply display 
> “51 days, 20:13:56.19” In Nagios? I know these are basic questions, I have a 
> lot to learn and will continue to research every day.
>  
> We have ssh only enabled for obvious security reasons, so that would work as 
> well for monitoring device vitals.
>  
> I’m still interested in what other Network Managers as myself are using, 
> services wise to monitor critical core equipment vitals other than ping, if 
> anyone else would like to share.
>  
> Today we are using an older version of whatsup with a little over 1000 
> devices/services (includes CPE) with ping only. I’m looking to Nagios as a 
> possible future replacement. Love it so far, just a bit of a learning curve. 
> NagiosQL helps me a lot with config files.
>  
> Thanks for the response.
>  
> -Will
>  
> From: Fernando Feijo [mailto:ffe...@ffeijo.com] 
> Sent: Saturday, October 20, 2012 12:19 PM
> To: Nagios Users List
> Subject: Re: [Nagios-users] Monitor Cisco Uptime
>  
> >>>What other services do you recommend monitoring for up/down status besides 
> >>>ping?
> You could also do ssh or even telnet, if you have them enabled.
>  
> You did not ask, but a comment on uptime. You can use it to detect router 
> bounces, detecting rogue reloads or perhaps system crashes. I have it to go 
> to critical whenever the uptime is below X minutes, and email out. Network 
> tests such as ping, snmp or ssh availability might miss a quick box's trip 
> down, but uptime will always catch a reload.
>  
> Fernando
>  
> **
> On Oct 20, 2012, at 11:09 AM, Will Bashlor  wrote:
> 
> 
> Hi list,
>  
> Well I’m not sure which of the steps below worked but it started working in 
> Nagios and the check_snmp command works from cli. I’d still be interested in 
> a guide or the correct steps I should have taken. I’m on a test vm now but 
> will be reinstalling on a production vm later.
>  
> root@atcdeb01:~# /usr/lib/nagios/plugins/check_snmp -H x.x.x.x -C public -o 
> sysUpTime.0
> SNMP OK - Timeticks: (448261008) 51 days, 21:10:10.08 |
> root@atcdeb01:~#
>  
> Now I’ve got to figure out a couple more things. Feel free to respond if you 
> can help.
>  
> 1.)Even though ping and uptime show ok, the test Cisco host status went 
> to “PENDING” in Nagios…?
> 2.)Not sure how to get rid of part of the response I don’t need for 
> uptime. I only need “51 days, 21:10:10.08” under Status Information in Nagios
>  
> Another thing, I’m looking for advice. For monitoring up/down status’ for 
> core Cisco routers/switches my idea is to monitor snmp response and ping, but 
> I don’t want to get 2 alerts from 1 device if it were to go down.
>  
> So can I get ping or snmp alerts if they go down individually and only one 
> alert if the device is down hard? What other services do you recommend 
> monitoring for up/down status besides ping?
>  
> Or is ping sufficient enough in your opinion? I may configure text alerts 
> only on ping and maybe just email alerts for snmp, or no alerts for snmp. I 
> like seeing uptime in Nagios though, it makes it easy to see if any devices 
> have just rebooted after an event.
>  
> I continue to research every day but I am interested in your thoughts and 
> opinions if you’d like to share.
>  
> Oh, can I integrate Nagios with my Cacti? Sorry for all the questions but 
> will be very grateful for any responses.
>  
> Thanks all very much,
>  
> -Will
>  
> From: Will Bashlor [mailto:wbash...@atcnetworks.net] 
> Sent: Saturday, October 20, 2012 11:23 AM
> To: Nagios Users List
> Subject: [Nagios-users] Monitor Cisco Uptime
>  
> Hi list,
>  
> I’ve searched to no avail, this is a basic question with hopefully a simple 
> answer.
>  
> I’m a long time cacti user (windows), I know basic Linux knowledge, new to 
> Nagios. I have a fresh install of Debian stable (squeeze) with apt-get 
> installed Webmin, ssh, apache2, and Nagios 3.2.1, nagiosql, and whatever 
> dependencies. I am trying to setup monitoring of our core Cisco router

Re: [Nagios-users] check_http and other response codes

2011-10-13 Thread Jörg Linge

Am 13.10.2011 um 09:18 schrieb Morty:

> On Wed, Oct 12, 2011 at 11:16:17PM -0500, Frank Bulk wrote:
>> Isn't there some regex matching?
> 
> There is.  But it didn't help me in either case.  check_http
> apparently does an implicit test to make sure it gets a valid response
> code such as 200.  And the regex checking is in content, not headers
> or response code.  So check_http -H $host -S -r 401 still returns a
> warning with a server that requires auth, and check_http -H $host -S
> -j TRACE -r 405 still returns a warning on a server with TRACE
> disabled.
> 
> While reading a different thread on this mailing list, I found Mark
> Thomas's mention of "negate".  That actually did workaround my HTTP
> TRACE problem -- TRACE will cause check_http to return a warning when
> it's disabled and ok when it's enabled, so the following command
> definition will test for HTTP TRACE:
> 
> define command{
>command_namecheck_http_trace
>command_line$USER1$/negate -sw OK -o CRITICAL -c OK -- 
> $USER1$/check_http -j TRACE -f sticky -H $HOSTADDRESS$ -p $ARG1$ $ARG2$
>}
> 
> But IMHO, that's something of a hack.  And it doesn't deal with the 401 issue.

http://nagiosplugins.org/man/check_http

Option -e

-e, --expect=STRING
Comma-delimited list of strings, at least one of them is expected in
the first (status) line of the server response (default: HTTP/1.)
If specified skips all other status line logic (ex: 3xx, 4xx, 5xx 
processing)
--
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense.
http://p.sf.net/sfu/splunk-d2d-oct
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Average Check latency and execution time growth - 3.2.3

2011-10-08 Thread Jörg Linge

Am 07.10.2011 um 20:52 schrieb Jim Avery:

> On 3 October 2011 04:36, Stuart Browne  
> wrote:
>> Hi,
>> 
>> I know this topic has been covered many times, but I've tried those tweaks 
>> and I have the remaining issue.
>> 
>> After a few days, the latency on checks explodes.  It goes along quite 
>> happily with small values, then after (about) 3 days, the values rise quite 
>> sharply.  I've recently been graphing performance statistics (nagiostats, 
>> mrtg) and as you can see by the two attachments (day, week), it's rather 
>> surprising.
> 
> 
> I'm sorry I can't shed much light on it, but I've seen the same
> behaviour myself, albeit on my system the service check latency
> wouldn't start increasing until after a week or two but you're right -
> the rate of increase when it starts is quite alarming.  I've recently
> culled a lot of checks from the system which has ameliorated the issue
> for the time being, but it would be good to get it fixed properly.


Increasing latency  is mostly an indicator of memory leaks.

The nagios core is well tested but the embedded perl interpreter in combination 
with badly written per plugin might cause a memory leak.
Also some eventbrokers module lead to memory leaks. For example mk_livestatus 
should never be used with environment macros enabled.

You have to monitor more then just the latency to get a feeling whats going on.

Joerg  
--
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security
threats, fraudulent activity, and more. Splunk takes this data and makes
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2dcopy2
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Question about notifications

2011-09-19 Thread Jörg Linge

Am 18.09.2011 um 15:08 schrieb Kirill Bychkov:

> Hello,
> 
> Nagios monitored host and services on this host. When host is down, can 
> notify me about only host? I.e. don't notify about service on this host?

Thats the normal behavior!  

I it dose´t work, your host is not marked as "DOWN/HARD" and you have to 
check/fix your host check_command.

Joerg
--
BlackBerry® DevCon Americas, Oct. 18-20, San Francisco, CA
Learn about the latest advances in developing for the 
BlackBerry® mobile platform with sessions, labs & more.
See new tools and technologies. Register for BlackBerry® DevCon today!
http://p.sf.net/sfu/rim-devcon-copy1 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Perfdata not showing up

2011-09-14 Thread Jörg Linge

Am 14.09.2011 um 17:13 schrieb Jonathan Gazeley:

> Hi all,
> 
> I'm using nagios with pnp4nagios to graph my performance data. When a 
> plugin returns performance data, it automatically get graphed. However, 
> one of my plugins isn't getting graphed.
> 
> I'm aware that this is quite a large amount of perfdata being returned 
> (pool use for lots of pools on a DHCP server) but is there anything 
> wrong with this output?
> 
> [jg4461@dhcp1 ~]$ /usr/lib64/nagios/plugins/check_dhcpd_pools
> OK - all pools less than 80% full | resnet-wireless-652=0.983%;80%;90%, 
> resnet-wireless-653=1.377%;80%;90%, resnet-wireless-654=1.180%;80%;90%, 
> resnet-wireless-655=1.180%;80%;90%, resnet-wireless-656=0.983%;80%;90%, 
> resnet-wireless-657=0.000%;80%;90%, resnet-wireless-658=0.000%;80%;90%, 
> resnet-wireless-659=0.000%;80%;90%, 
> resnet-wireless-ratelimited-660=0.000%;80%;90%, 
> resnet-wireless-onlinepayment-661=0.098%;80%;90%, 
> resnet-wireless-onlinepayment-662=0.000%;80%;90%, 
> resnet-wireless-onlinepayment-663=0.000%;80%;90%, 
> resnet-wireless-consoles-665=0.000%;80%;90%, 
> resnet-wireless-message-666=0.000%;80%;90%, 
> resnet-wireless-instructions-667=0.367%;80%;90%
> 
> As I understand it, it should be in a format like:
> 
> title1=value1;warning1;critical1, title2=value2;warning2;critical2
> 
> I've tried using semicolons and commas to separate the different pieces 
> of data being returned, neither worked. This plugins works normally 
> aside from not graphing - Nagios shows the status and the return code.
> 
> Any ideas?
> 
> Cheers,
> Jonathan

Hi Jonathan,
the perfdata your plugin provides is invalid.

see -> http://docs.pnp4nagios.org/pnp-0.6/about#system_requirements
and -> http://nagiosplug.sourceforge.net/developer-guidelines.html#AEN201

Hint: 

8. value, min and max in class [-0-9.]. Must all be the same UOM

Joerg

--
BlackBerry® DevCon Americas, Oct. 18-20, San Francisco, CA
Learn about the latest advances in developing for the 
BlackBerry® mobile platform with sessions, labs & more.
See new tools and technologies. Register for BlackBerry® DevCon today!
http://p.sf.net/sfu/rim-devcon-copy1 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Observing latency

2011-09-05 Thread Jörg Linge

Am 05.09.2011 um 13:53 schrieb Daniel Wittenberg:

> We also had to do some tuning with our pnp4nagios for latency.  I ended up 
> using a ramdisk for the perfdata folder (that it uses while processing data, 
> not the rrd's) and modified it to run with a nice of 20 to make sure that 
> Nagios always had enough priority to get its work done (which is already 
> running with a nice of -15) and sometimes pnp lags a little but it's still 
> acceptable.

OT: using the rrdcached might be a better solution.

Joerg
--
Special Offer -- Download ArcSight Logger for FREE!
Finally, a world-class log management solution at an even better 
price-free! And you'll get a free "Love Thy Logs" t-shirt when you
download Logger. Secure your free ArcSight Logger TODAY!
http://p.sf.net/sfu/arcsisghtdev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Off to the Nagios Conference

2006-09-19 Thread Jörg Linge
Am Dienstag, 19. September 2006 22:00 schrieb Thomas Slutyer:
> Hey lot!
>
> In about twelve hours I'll jump in my car and drive for eight hours to
> Nürnberg, Germania for the first annual Nagios Conference
> (http://www.nagiosconference.de). I know Ton and Ethan and a bunch of
> the other, more famous members of the community are going. But I've no
> clue who else will turn up.

Hi Thomas,
nice to meet you !


Jörg

-
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null