Re: [Nagios-users] What kind of checks/minute numbers are you getting for single host / non-distributed setups?

2009-08-29 Thread Max
Hi Ryan,

On Fri, Aug 28, 2009 at 11:31 PM, Ryan Bowlby wrote:
> Those are impressive numbers for a single Nagios instance. You may be able to 
> tweak out some additional time but you leave the Nagios daemon little room 
> for leeway. What I mean is if two dozen hosts start reporting critical and 
> Nagios starts performing checks at the more aggressive retry_check_interval 
> instead of the normal check_interval, then your check latency is going to go 
> through the roof.

Yes, we have had to balance that already and good point, right now the
majority of our service checks have a 5 normal / 3 retry with 2
retries which has been fine for our users.

> That being said here are some ideas that you may already be trying, but if 
> not may by you some time.
> - switch from check_ping to check_icmp as it's 9x faster in some instances.

We did that, and yes, definitely a noticable improvement there.

> - If any of the client-side nrpe checks are perl, python, etc you may see a 
> decrease in check-time by compiling them. Same for the Nagios server if you 
> aren't already (built-in perl, etc).

All of our checks are either C plugins or ePN-based perl checks :).

> - Often NRPE checks such as those monitoring hardware don't need to be 
> performed as often as say a check_tcp, but since people use templates NRPE 
> frequently gets configured with the same aggressive check_interval as other 
> checks. Scaling back on these will greatly increase the amount of checks the 
> server can do.

Our retry is pretty conservative (60% of normal), good point though,
should see if anyone has configured their checks to have a more
aggressive retry rate (we work on a self-service model where users of
our system can modify their own configs).

> At my work we have 4 remote Nagios instances performing approximately 9400+ 
> checks to our Central Nagios server via nsca. This leaves room for a 400% 
> increase in checks as more departments begin utilizing the monitoring system. 
> Our configs are built by a custom script from our custom dbase and pushed out 
> to the servers via a custom script that keeps everything in cvs. It all works 
> great but took forever to configure. If I had to do it again I would take a 
> serious look at two other options:

Very nice.

> http://dnx.sourceforge.net/ - Crap ton of checks ONE nagios instance!

We are going to try this first for our poller tier and keep
notifications, trending, and trap persistence on a separate tier.
Only concern here is that the remote pollers will not have ePN and I
know how much CPU and load will increase running thousands of checks a
minute in perl without ePN

We have been playing (at the whiteboard level) with the idea of a
persistent script execution proxy/daemon that would allow us to
develop scripts in VHLL languages and the scripts would then be called
with a check command like check_nrpe that just knows how to talk to
the persistent script daemon over a socket .. so, for example, we
could write something based on a JVM that embedded jruby and jython as
well, allowing developers to write checks in java, ruby, or python.
This would also eliminate the problem of having to always do full
restarts of Nagios when ePN is on .. we realize this would not be an
easy thing to implement, but it sounds theoretically very attractive
as a way to allow for decent check performance and abstract the
persistent language daemon outside of Nagios.

> http://www.opsview.org/ - Multiple Nagios instances without writing a slew of 
> custom scripts to do it!

That is definitely well known for performing well and it's stability :).

Thanks a lot for responding, Ryan, appreciate it!

- Max

--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] check_http confusion / problem

2009-08-29 Thread David Dyer-Bennet
Jon Angliss wrote:
> On Fri, 28 Aug 2009 14:01:44 -0500, "David Dyer-Bennet"
>  wrote:
>
>   
>> [check_http questions]
>> 
>
> --help usually gives a whole bunch of extra information...
>
>   

Yes, that's where I got the information I had; it's what got me confused 
in the first place.  In particular, help shows a long-form as well as a 
short-form option for -e, and it was while discovering that the 
long-form option given in help doesn't appear to be recognized that I 
ended up with the "-e=400" syntax which is wrong (but which IS accepted).


>> [...@prcapp00 dev]$ /usr/lib/nagios/plugins/check_http
>> --IP-address=192.168.5.3 -p 8075 --no-body -f critical -v -v -v -e="Bad
>> Request"
>> GET / HTTP/1.0
>> User-Agent: check_http/1.99 (nagios-plugins 1.4.6)
>>
>>
>> http://192.168.5.3:8075/ is 168 characters
>> STATUS: HTTP/1.1 400 Bad Request
>>  HEADER 
>> Content-Type: text/html
>> Date: Fri, 28 Aug 2009 18:33:44 GMT
>> Connection: close
>> Content-Length: 39
>>  CONTENT 
>>  [[ skipped ]]
>> Invalid HTTP response received from host on port 8075
>> [...@prcapp00 dev]$ echo $?
>> 2
>> 
>
> Command syntax is incorrect.
>
> # ./check_http -I 192.168.5.3 -p 8075 --no-body -f critical -vvv -e \
>  "Bad Request"
>   

That was the base problem in some sense, thanks.


> I tried against one of my servers without any issues.  Albeit I got a
> critical failure because my server didn't return bad request.
>
> I do notice you're using an old version of the plugins package. 1.4.13
> is the current version, can you download and compile in a different
> directory and see if you still end upw with the same issue?
>   

This is a Centos 4.7 install, and I'm trying to stick to the packaged 
distributions that match each other, rather than going around upgrading 
things at random and hoping they work together.

-- 
David Dyer-Bennet, d...@dd-b.net; http://dd-b.net/
Snapshots: http://dd-b.net/dd-b/SnapshotAlbum/data/
Photos: http://dd-b.net/photography/gallery/
Dragaera: http://dragaera.info


--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] hostgroup expansion and regex

2009-08-29 Thread Terry
On Sat, Aug 29, 2009 at 4:22 AM, Jim Avery wrote:
> 2009/8/29 Terry :
>> I am overlooking something simple.
>> Nagios Core 3.2.0
>>
>> use_regexp_matching=1
>> use_true_regexp_matching=0
>>
>> define hostgroup {
>>        hostgroup_name  allhosts
>>        members         *
>>
>> Error: Could not expand members specified in hostgroup (config file
>> '/etc/nagios/objects/hostgroups.cfg', starting on line 89)
>>
>> What in the world am I missing here?
>
>
> "*" is a wildcard, not a regular expression.  To use simple wildcards
> like this you need:
>
>  use_regexp_matching=0
>
>
> If you still want to use a regular expression though, the one you need
> which would be equivalent to a "*" wildcard is:
>
>  .*
>
> The "." means "match any character" and the "*" in a regular
> expression means "match zero or more repetitions of the previous
> element.".
>
>
> hth,
>
> Jim
>
> --

I tried .* with regex and true_regex but I clearly didnt try * without
enable_regex.   Thanks for helping!

--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Redundant Nagios setups...

2009-08-29 Thread Ciro Iriarte
2009/8/26 Janet Sullivan :
> Other than the methods described in the official docs
> (http://nagios.sourceforge.net/docs/3_0/redundancy.html) and Merlin
> (http://www.op5.org/community/projects/merlin) are there any other suggested
> solutions for setting up redundant Nagios servers?
>


For HA we use a Xen domU that can jump between 2 physical hosts...

Regards,

-- 
Ciro Iriarte
http://cyruspy.wordpress.com
--

--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] hostgroup expansion and regex

2009-08-29 Thread Jim Avery
2009/8/29 Terry :
> I am overlooking something simple.
> Nagios Core 3.2.0
>
> use_regexp_matching=1
> use_true_regexp_matching=0
>
> define hostgroup {
>        hostgroup_name  allhosts
>        members         *
>
> Error: Could not expand members specified in hostgroup (config file
> '/etc/nagios/objects/hostgroups.cfg', starting on line 89)
>
> What in the world am I missing here?


"*" is a wildcard, not a regular expression.  To use simple wildcards
like this you need:

  use_regexp_matching=0


If you still want to use a regular expression though, the one you need
which would be equivalent to a "*" wildcard is:

  .*

The "." means "match any character" and the "*" in a regular
expression means "match zero or more repetitions of the previous
element.".


hth,

Jim

--
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null