Re: [Nagios-users] Notification period inheritance problem

2009-12-15 Thread Martin Melin
I don't understand why this is confusing.

You have a service template that defines a timeperiod. All services
that use that template will inherit the notification_period value from
the template, if they don't specify a notification_period themselves.

By setting "notification_period null" in your service definition, you
are explicitly overriding all forms of inheritance and setting the
notification_period to null, which happens to be a special
notification_period.

So, the issue you are running into is due to you expecting implied
inheritance to take precedence over values defined in the service
definition itself. This would break the documented and expected
behavior of inheritance, and a lot of people's configurations.

Now, I understand how you could run into this issue. But take a look
at the documentation again:

>>> Quoting from the documentation:
>>>
>>> The following table lists the object variables that will be implicitly 
>>> inherited from related objects if you don't explicitly specify their value 
>>> in your object definition or inherit them from a template

The implied inheritance variables will only be inherited from related
objects, if they are not specified in either the object definition or
template. In your case, notification_period is defined in your
template, so the implied inheritance is already overridden. You can't
bring that back by also explicitly defining a notification_period in
your service definition, even if that happens to be null, as null is
also a valid value for notification_period.

Something that definitely would work is for you to define another
service template without a notification_period value, and use that for
checks you want implied inheritance from.

Something that might work, but unfortunately I can't try it as I've
already spent too much time on this email :-), is to set
notification_period to null in the template, which could mean that the
service definition will behave as though the notification_period value
is unset in the template. This would allow you to let the "null
notification period" template inherit from your normal template.

Best regards,
Martin Melin

On Wed, Dec 16, 2009 at 12:26 AM, Gordon Messmer  wrote:
> On 12/15/2009 01:38 PM, Marc Powell wrote:
>> On Dec 15, 2009, at 2:04 PM, Gordon Messmer wrote:
>>
>>> Quoting from the documentation:
>>>
>>> The following table lists the object variables that will be implicitly 
>>> inherited from related objects if you don't explicitly specify their value 
>>> in your object definition or inherit them from a template
>>>
>>> ...so I need to not inherit the value from my template.  How do I do that?
>>>
>> Don't specify the directive in the template.
>>
>
> Which means that I have to expand the template where I null-ify the
> notification period with the values from any of the templates it
> "uses".  That seems like a minor inconvenience, but it's still uglier
> than it ought to be.  When I do so, I still get the warning from "nagios
> -v".  I'll test it to make sure that it's working.
>
>>> In some cases you may not want your host, service, or contact definitions 
>>> to inherit values of string variables from the templates they reference. If 
>>> this is the case, you can specify "null" (without quotes) as the value of 
>>> the variable that you do not want to inherit.
>>>
>> You're mixing sections of the documentation to try to make a connection that 
>> doesn't exist.
>
> If that's so, then at the very least the documentation should be updated
> to note that this won't work.
>
>> Again, specified with a value of 'null' does not equate to unspecified. 
>> Looking at the code, if you specify the value in any way, including a value 
>> of 'null', it breaks the implied inheritance.
>>
>> xodtemplate.h --
>>
>> #define XODTEMPLATE_NULL                  "null"
>>
>> xdata/xodtemplate.c --
>>
>>          else if(!strcmp(variable,"notification_period")){
>>              if(strcmp(value,XODTEMPLATE_NULL)){
>>                  if((temp_service->notification_period=(char 
>> *)strdup(value))==NULL)
>>                      result=ERROR;
>>                      }
>>              temp_service->have_notification_period=TRUE;
>>                  }
>>
>
> That sure looks like a bug to me.  Even the indentation looks weird to
> me, is that the normal style?  I can't imagine why "null" would be
> considered a valid notification period.  If
> have_notification_period=TRUE were in the preceding block, the
> documentation would make more sense, and implied inheritance would be a
> lot easier to use.  Do you think a patch to modify this behavior would
> be accepted?
>
> Thanks for the explanation.  I appreciate your help.
>
>
> --
> This SF.Net email is sponsored by the Verizon Developer Community
> Take advantage of Verizon's best-in-class app development support
> A streamlined, 14 day to market process makes app distribution fast and easy
> Joi

Re: [Nagios-users] Problem installing MRTG

2009-12-15 Thread Morris, Patrick
VIKRAM MS wrote:
> Hi,
>
>  1) I dont have anything plugged to port 1. So I cannot 
> monitor any port unless it is plugged to something.
>
>  2) Now I have this trouble while installing MRTG. I followed 
> the steps given in 
> http://oss.oetiker.ch/mrtg/doc/mrtg-unix-guide.en.html.
> Should I create any of those folders?
>
> [r...@localhost bin]# pwd
> /usr/local/src/mrtg-2.16.2/bin
> [r...@localhost bin]# ls
> cfgmaker  cfgmaker~  indexmaker  indexmaker~  mrtg  mrtg~  
> mrtg-traffic-sum  rateup  rateup.o
> [r...@localhost bin]# cfgmaker --global 'WorkDir: /home/httpd/mrtg' 
> --global 'Options[_]: bits,growright' --output /home/mrtg/cfg/mrtg.cfg 
> x...@12.345.67.890
> ERROR: creating /home/mrtg/cfg/mrtg.cfg: No such file or directory
> [r...@localhost bin]# 


Yes, if you're going to tell mrtg to place your config in a specific 
directory with the --output directive, that directory should exist.

And, again, this really isn't the right place to look for help on 
setting up MRTG.  You may want to try http://oss.oetiker.ch/mrtg/support/

--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Problem installing MRTG

2009-12-15 Thread VIKRAM MS
Hi,

 1) I dont have anything plugged to port 1. So I cannot monitor any
port unless it is plugged to something.

 2) Now I have this trouble while installing MRTG. I followed the
steps given in http://oss.oetiker.ch/mrtg/doc/mrtg-unix-guide.en.html.
Should I create any of those folders?

[r...@localhost bin]# pwd
/usr/local/src/mrtg-2.16.2/bin
[r...@localhost bin]# ls
cfgmaker  cfgmaker~  indexmaker  indexmaker~  mrtg  mrtg~  mrtg-traffic-sum
rateup  rateup.o
[r...@localhost bin]# cfgmaker --global 'WorkDir: /home/httpd/mrtg' --global
'Options[_]: bits,growright' --output /home/mrtg/cfg/mrtg.cfg
x...@12.345.67.890
ERROR: creating /home/mrtg/cfg/mrtg.cfg: No such file or directory
[r...@localhost bin]#

Thanks
Viki


On Sun, Dec 13, 2009 at 7:44 PM, Greg Pangrazio  wrote:

> For the first error, do you have something plugged into port 1 on the
> switch and is the port in the Up/UP state?
>
>
> For the check_mrtg plugin you must have mrtg installed and graphing
> utilization before you can configure the alterts via nagios.  You
> should check out the MRTG site for more information on that.
>
>
> Greg Pangrazio
> pangr...@gmail.com
>
>
>
>
>
> On Sat, Dec 12, 2009 at 12:33 AM, VIKRAM MS  wrote:
> > Hello,
> >
> >  I am trying to monitor Cisco switches and routers. I am able
> to
> > ping and check the uptime of the switch.
> > 1) But for the port 1 link status, I am getting "SNMP CRITICAL -
> *down(2)*"
> > error.
> >
> > 2) For port 1 bandwidth usage, the error is "check_mrtgtraf: Unable to
> open
> > MRTG log file"
> > I have installed MRTG, but I am not able to go any further.
> Also
> > for the http://localhost/mrtg/10.111.100.102_43.html check, I am getting
> the
> > following error
> > 404 Not found
> > The requested URL /mrtg/10.111.100.102_43.html was not found on this
> server.
> >
> > I have checked for assistance in
> > http://www.mail-archive.com/nagios-users@lists.sourceforge.net/ but in
> vain.
> > I am using Nagios Core version 3.2.0.
> > Thanks in advance
> >
> > Regards
> > Vikram
> >
> >
> >
> --
> > Return on Information:
> > Google Enterprise Search pays you back
> > Get the facts.
> > http://p.sf.net/sfu/google-dev2dev
> >
> > ___
> > Nagios-users mailing list
> > Nagios-users@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nagios-users
> > ::: Please include Nagios version, plugin version (-v) and OS when
> reporting
> > any issue.
> > ::: Messages without supporting info will risk being sent to /dev/null
> >
>
--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev ___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] NRPE refused connect

2009-12-15 Thread shacky
Hi.

I installed the check_nrpe plugin on the Nagios server and NRPE
running as daemon on the host I have to monitor, both compiled with
SSL support and the dh.h file created and saved in the include/
directory on the NRPE host.

[r...@monitored-host ~]# /opt/nrpe/bin/nrpe
NRPE - Nagios Remote Plugin Executor
Copyright (c) 1999-2008 Ethan Galstad (nag...@nagios.org)
Version: 2.12
Last Modified: 03-10-2008
License: GPL v2 with exemptions (-l for more info)
SSL/TLS Available: Anonymous DH Mode, OpenSSL 0.9.6 or higher required
TCP Wrappers Available

nagios-server:/# /usr/local/nagios/libexec/check_nrpe
Incorrect command line arguments supplied
NRPE Plugin for Nagios
Copyright (c) 1999-2008 Ethan Galstad (nag...@nagios.org)
Version: 2.12
Last Modified: 03-10-2008
License: GPL v2 with exemptions (-l for more info)
SSL/TLS Available: Anonymous DH Mode, OpenSSL 0.9.6 or higher required

The IP address of the Nagios server is specified in the
"allowed_hosts" declaration in the nrpe.cfg file:

[r...@monitored-host ~]# grep allowed_hosts /opt/nrpe/etc/nrpe.cfg
allowed_hosts=192.168.10.250

The problem is that if I try to execute the check_nrpe command to test
the NRPE daemon on the monitored host, I receive this error:

nagios-server:/# /usr/local/nagios/libexec/check_nrpe -H 192.168.10.18
CHECK_NRPE: Error - Could not complete SSL handshake.

On the monitored host I see that the IP address of the Nagios server
is allowed and then refused:

[r...@monitored-host ~]# tail /var/log/messages
Dec 16 01:24:27 monitored-host nrpe[25047]: INFO: SSL/TLS initialized.
All network traffic will be encrypted.
Dec 16 01:24:27 monitored-host nrpe[25048]: Starting up daemon
Dec 16 01:24:27 monitored-host nrpe[25048]: Warning: Daemon is
configured to accept command arguments from clients!
Dec 16 01:24:27 monitored-host nrpe[25048]: Listening for connections
on port 5666
Dec 16 01:24:27 monitored-host nrpe[25048]: Allowing connections from:
192.168.10.250
Dec 16 01:27:01 monitored-host nrpe[25063]: refused connect from
192.168.10.250 (192.168.10.250)

What is the problem?
It is not a firewall problem because the connection works, and it does
not seems to be a SSL related problem because it does not work even if
I try the check command disabling SSL with -n (and the NRPE daemon
runned with -n also), and it is quite curious to have two opposite log
messages.

Could you help me please? I worked all the afternoon trying to let it
work, but it does not work...

Thank you very much!!
Bye.

--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Notification period inheritance problem

2009-12-15 Thread Gordon Messmer
On 12/15/2009 01:38 PM, Marc Powell wrote:
> On Dec 15, 2009, at 2:04 PM, Gordon Messmer wrote:
>
>> Quoting from the documentation:
>>
>> The following table lists the object variables that will be implicitly 
>> inherited from related objects if you don't explicitly specify their value 
>> in your object definition or inherit them from a template
>>
>> ...so I need to not inherit the value from my template.  How do I do that?
>>  
> Don't specify the directive in the template.
>

Which means that I have to expand the template where I null-ify the 
notification period with the values from any of the templates it 
"uses".  That seems like a minor inconvenience, but it's still uglier 
than it ought to be.  When I do so, I still get the warning from "nagios 
-v".  I'll test it to make sure that it's working.

>> In some cases you may not want your host, service, or contact definitions to 
>> inherit values of string variables from the templates they reference. If 
>> this is the case, you can specify "null" (without quotes) as the value of 
>> the variable that you do not want to inherit.
>>  
> You're mixing sections of the documentation to try to make a connection that 
> doesn't exist.

If that's so, then at the very least the documentation should be updated 
to note that this won't work.

> Again, specified with a value of 'null' does not equate to unspecified. 
> Looking at the code, if you specify the value in any way, including a value 
> of 'null', it breaks the implied inheritance.
>
> xodtemplate.h --
>
> #define XODTEMPLATE_NULL  "null"
>
> xdata/xodtemplate.c --
>
>  else if(!strcmp(variable,"notification_period")){
>  if(strcmp(value,XODTEMPLATE_NULL)){
>  if((temp_service->notification_period=(char 
> *)strdup(value))==NULL)
>  result=ERROR;
>  }
>  temp_service->have_notification_period=TRUE;
>  }
>

That sure looks like a bug to me.  Even the indentation looks weird to 
me, is that the normal style?  I can't imagine why "null" would be 
considered a valid notification period.  If 
have_notification_period=TRUE were in the preceding block, the 
documentation would make more sense, and implied inheritance would be a 
lot easier to use.  Do you think a patch to modify this behavior would 
be accepted?

Thanks for the explanation.  I appreciate your help.


--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Nagios 30 seconds initial delay

2009-12-15 Thread Max
On Tue, Dec 15, 2009 at 5:57 PM, Brandino Andreas  wrote:
> When I click a page I get the error "Error: Could not read host and
> service status information!" . After 30 seconds, all pages appear
> again!!
>
> - I don't have duplicated nagios service running
> - This delay didn't show up to older versions
> - My linux is Debian 5.0.3 (stable)
> - This error appears every time I restart nagios

We typically see about the same start up time, both on 3.0.3 and 3.2.0
on hosts with 1-2k hosts and ~10k services where we have retention.dat
files that are weeks or months old and we are using regular
expressions and service ->  hostgroup mappings extensively.

- Max

--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Question - Scripting to stop monitor

2009-12-15 Thread Edwin Zoeller
Is it possible and if so can someone share with a script that will stop a 
monitor script for a certain time period then restart. Just putting the service 
in scheduled downtime still lets runs and I don't want to access the service 
being monitored.
 
Thanks,
 
Ed
--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev ___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

[Nagios-users] Nagios 30 seconds initial delay

2009-12-15 Thread Brandino Andreas
Hi all,
I am using Nagios 3.2.0  (I just upgrade from early 3.0 releases)
Every time I restart nagios I face a 30 seconds delay before the
various pages appear for first time (not before starting checks, 30
seconds before displaying pages!!!)

When I click a page I get the error "Error: Could not read host and
service status information!" . After 30 seconds, all pages appear
again!!

- I don't have duplicated nagios service running
- This delay didn't show up to older versions
- My linux is Debian 5.0.3 (stable)
- This error appears every time I restart nagios

Any idea what can cause this delay??
Thank you


<> ---  ---  --- <> 
Brandino Andreas
ampra...@gmail.com
<> ---  ---  --- <> 


--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] Nagios 3.2 and max_concurrent_checks=0

2009-12-15 Thread Cary Petterborg
I'm doing some testing for migrating our installation to Nagios3.2. The test 
server is running 3.2 on an 8 CPU box with 34,000 active service checks and 
3,000 active host checks. The initial configuration file had 
max_concurrent_checks=0, but latency was about 9,000 seconds. I changed it to 
max_concurrent_checks=200 and the latency went down to about 7,000 seconds. I 
then set it to 2,000 and the latency dropped to about 200 seconds. I currently 
have it set to 100,000 and latency has not changed from about 200 seconds.

>From all the documentation I have seen, if max_concurrent_checks is set to 
>zero, there should be no limit on the number of concurrent checks, but this 
>doesn't appear to be the case. Is there some other part of the configuration 
>that I'm missing which would make max_concurrent_checks=0 be limited instead 
>of unlimited?


Cary Petterborg
ICS Monitoring
The Church of Jesus Christ of Latter-day Saints
Office Phone: 801-240-8267
Email:  petterbor...@ldschurch.org


 NOTICE: This email message is for the sole use of the intended recipient(s) 
and may contain confidential and privileged information. Any unauthorized 
review, use, disclosure or distribution is prohibited. If you are not the 
intended recipient, please contact the sender by reply email and destroy all 
copies of the original message.



--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Notification period inheritance problem

2009-12-15 Thread Marc Powell

On Dec 15, 2009, at 2:04 PM, Gordon Messmer wrote:

> On 12/15/2009 10:49 AM, Marc Powell wrote:
>> 
>> No, in order to do that, the notification_period needs to be unspecified. 
>> null != unspecified. Because you're explicitly setting it to null, you break 
>> the implied inheritance.
>> 
>> To take advantage of the implied inheritance, you need to leave out the 
>> directive entirely as indicated above.
>>   
>> 
> 
> Quoting from the documentation:
> 
> The following table lists the object variables that will be implicitly 
> inherited from related objects if you don't explicitly specify their value in 
> your object definition or inherit them from a template
> 
> ...so I need to not inherit the value from my template.  How do I do that?

Don't specify the directive in the template.

> In some cases you may not want your host, service, or contact definitions to 
> inherit values of string variables from the templates they reference. If this 
> is the case, you can specify "null" (without quotes) as the value of the 
> variable that you do not want to inherit.

You're mixing sections of the documentation to try to make a connection that 
doesn't exist. Again, specified with a value of 'null' does not equate to 
unspecified. Looking at the code, if you specify the value in any way, 
including a value of 'null', it breaks the implied inheritance.

xodtemplate.h --

#define XODTEMPLATE_NULL  "null"

xdata/xodtemplate.c -- 

else if(!strcmp(variable,"notification_period")){
if(strcmp(value,XODTEMPLATE_NULL)){
if((temp_service->notification_period=(char 
*)strdup(value))==NULL)
result=ERROR;
}
temp_service->have_notification_period=TRUE;
}

'If the variable is "notification_period" and the value is not the word "null", 
save the value unless the value is blank (error otherwise). If the value is the 
word "null" or something else, set the service have_notification_period flag to 
TRUE."

Further down --

/* services inherit notification period from host if not already 
specified */
if(temp_service->have_notification_period==FALSE && 
temp_host->have_notification_period==TRUE && 
temp_host->notification_period!=NULL){
temp_service->notification_period=(char 
*)strdup(temp_host->notification_period);
temp_service->have_notification_period=TRUE;
}

"If the service 'have_notification_period' flag is not TRUE and the host and 
the host has a notification period, use that for the service."

As you can see, you fail that first condition.

> If I don't specify a value, it'll be inherited.  Right?  How will that help 
> me not specify a value for the service?

If you don't specify the value in the template, there's nothing to inherit from 
the template. If the directive is also not specified in the service{} 
definition, the template engine will look to the value in the host{} definition.

--
Marc




--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Notification period inheritance problem

2009-12-15 Thread Gordon Messmer

On 12/15/2009 10:49 AM, Marc Powell wrote:


No, in order to do that, the notification_period needs to be unspecified. null 
!= unspecified. Because you're explicitly setting it to null, you break the 
implied inheritance.

To take advantage of the implied inheritance, you need to leave out the 
directive entirely as indicated above.
   


Quoting from the documentation:

   The following table lists the object variables that will be
   implicitly inherited from related objects if you don't explicitly
   specify their value in your object definition or inherit them from a
   template


...so I need to not inherit the value from my template.  How do I do that?

   In some cases you may not want your host, service, or contact
   definitions to inherit values of string variables from the templates
   they reference. If this is the case, you can specify "*null*"
   (without quotes) as the value of the variable that you do not want
   to inherit.


If I don't specify a value, it'll be inherited.  Right?  How will that 
help me not specify a value for the service?



--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev ___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Notification period inheritance problem

2009-12-15 Thread Marc Powell

On Dec 15, 2009, at 11:33 AM, Gordon Messmer wrote:

> On 12/15/2009 12:52 AM, Martin Melin wrote:
>> Have you tried leaving out the notification_period line from the
>> service definition? I think, but don't have time to verify, that by
>> saying "notification_period null" you are actually overriding all
>> inherited values for notification_period and setting it to null
>> instead.
>> 
> 
> That's kind of the point.  I want the service to inherit its 
> notification period from the hosts it checks.  In order to do that, its 
> own notification period has to be null.

No, in order to do that, the notification_period needs to be unspecified. null 
!= unspecified. Because you're explicitly setting it to null, you break the 
implied inheritance.

To take advantage of the implied inheritance, you need to leave out the 
directive entirely as indicated above.

--
Marc


--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Notification period inheritance problem

2009-12-15 Thread Gordon Messmer
On 12/15/2009 12:52 AM, Martin Melin wrote:
> Have you tried leaving out the notification_period line from the
> service definition? I think, but don't have time to verify, that by
> saying "notification_period null" you are actually overriding all
> inherited values for notification_period and setting it to null
> instead.
>

That's kind of the point.  I want the service to inherit its 
notification period from the hosts it checks.  In order to do that, its 
own notification period has to be null.


--
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev 
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Interface for control room

2009-12-15 Thread Romain Le Merlus
On Tue, Dec 15, 2009 at 4:34 PM, shacky  wrote:

> The complication I forgot to tell you in my first post is that I need
> to monitor at least three different Nagios servers.
> How can I solve this requirement?
>

You can administrate your different Nagios poller through Centreon frontend.
Communication between them is based on SSH.
Each host definition is link to a poller, so the configuration dispatch is
automatic.
-- 
Romain LE MERLUS
--
Return on Information:
Google Enterprise Search pays you back
Get the facts.
http://p.sf.net/sfu/google-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Re: [Nagios-users] Interface for control room

2009-12-15 Thread Andreas Ericsson
On 12/15/2009 04:34 PM, shacky wrote:
>> I do this with NagVis. Quite simple and powerfull. You can create new maps
>> and add your own iconset (big blinking animated gif!)
>> http://www.nagvis.org/
> 
> Thank you very much.
> I will try Centreon and Nagvis.
> 
> The complication I forgot to tell you in my first post is that I need
> to monitor at least three different Nagios servers.
> How can I solve this requirement?
> 

Get three different monitors and hook up with virtual desktops to one
computer.

-- 
Andreas Ericsson   andreas.erics...@op5.se
OP5 AB www.op5.se
Tel: +46 8-230225  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

--
Return on Information:
Google Enterprise Search pays you back
Get the facts.
http://p.sf.net/sfu/google-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Interface for control room

2009-12-15 Thread shacky
> I do this with NagVis. Quite simple and powerfull. You can create new maps
> and add your own iconset (big blinking animated gif!)
> http://www.nagvis.org/

Thank you very much.
I will try Centreon and Nagvis.

The complication I forgot to tell you in my first post is that I need
to monitor at least three different Nagios servers.
How can I solve this requirement?

--
Return on Information:
Google Enterprise Search pays you back
Get the facts.
http://p.sf.net/sfu/google-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] benchmark question

2009-12-15 Thread Dario B. Bestetti (OpServices)


- "David Dumortier"  escreveu:

> Hi Patrick, hi all,
> 
> Dario B. Bestetti (OpServices) a écrit :
> >
> >
> > - "Patrick Morris"  escreveu:
> >
> >> shadih rahman wrote:
> >> > List,
> >> >Is there any information as to what is the maximum number of
> >> > services one can run on a single box with single instance of
> Nagios?
> >>
> >> > I did not find any concrete data on this.
> >> >
> >> > I am running a single instance of nagios on a quad core 2.5 GHZ
> >> > machine with with 4 Gigs of RAM.
> >> >
> >> > I have total of 7359 service check running on this box.  I have
> also
> >>
> >> > ndoutils running on the same box as backend.
> >> >
> >> > My total service checks is going to five fold very soon with a
> lot
> >> of
> >> > nrpe checks.
> >> >
> >> > Now, my question is should I run multiple instance of nagios on
> the
> >>
> >> > same box or a single instance will be able to handle about 3
> >> > service checks?
> >>
> >> In my experience, you'll run into issues right around 8,000 checks
> or
> >>
> >> so, depending no output verbosity, on a fairly stock setup. This
> is
> >> due
> >> to the size of the pipe used to temporarily store check results,
> at
> >> least on Linux (and you may be seeing it already with your number
> of
> >> checks). Around that point, even with quite a bit of tuning, the
> check
> >>
> >> results will fill the pipe in less than a second, which is the
> minimum
> >>
> >> amount of time I've been able to configure Nagios to flush it.
> When
> >> that
> >> happens, latencies go through the roof.
> >>
> >> Distributing the checks doesn't solve the problem if you're still
> >> sending the results to a centralized Nagios instance, since one
> >> machine
> >> still needs to process all of them.
> >>
> >> I'm working through this situation now, and it's looking like it
> may
> >> take a custom kernel with a larger pipe size to handle
> >> it.
> >>
> >>
> > Shadih, we have a customer running over 20.000 service checks in a
> single
> > box.
> >
> > The box is 2x Intel Dual-core Xeon 3Ghz with 8Gb RAM. The latency
> is
> > around 0.7s. The checks are performed between 3min and 5min. They
> have
> > around 40 simultaneous users.
> >
> > _
> > Dario B. BestettiOpServices
> 
> I had to install a bunch of nagios for 40 polls. We ran some tests
> on
> bi-Xeon Dual-Core with 4Go. It seems the limit of Nagios 3 is around
> 25000
> services by server, but we didn't use the interface and the recording
> to
> database is with Perl scripts of our own. Taking 2 services as a
> limit
> seem to be the right thing to do, perhaps less as 15000 with NDO.
> 
> Regards,
> -- 
> David Dumortier
> LINAGORA
> Service Management Monitoring
> 
> 
> 
> --
> Return on Information:
> Google Enterprise Search pays you back
> Get the facts.
> http://p.sf.net/sfu/google-dev2dev
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null

I forgot to tell that we did run some stress tests in this customer with up to 
30.000 services and Nagios supported it very well. We also use our own broker 
to store data in a MySQL database.

Regards,
Dario

--
Return on Information:
Google Enterprise Search pays you back
Get the facts.
http://p.sf.net/sfu/google-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] benchmark question

2009-12-15 Thread David Dumortier
Hi Patrick, hi all,

Dario B. Bestetti (OpServices) a écrit :
>
>
> - "Patrick Morris"  escreveu:
>
>> shadih rahman wrote:
>> > List,
>> >Is there any information as to what is the maximum number of
>> > services one can run on a single box with single instance of Nagios?
>>
>> > I did not find any concrete data on this.
>> >
>> > I am running a single instance of nagios on a quad core 2.5 GHZ
>> > machine with with 4 Gigs of RAM.
>> >
>> > I have total of 7359 service check running on this box.  I have also
>>
>> > ndoutils running on the same box as backend.
>> >
>> > My total service checks is going to five fold very soon with a lot
>> of
>> > nrpe checks.
>> >
>> > Now, my question is should I run multiple instance of nagios on the
>>
>> > same box or a single instance will be able to handle about 3
>> > service checks?
>>
>> In my experience, you'll run into issues right around 8,000 checks or
>>
>> so, depending no output verbosity, on a fairly stock setup. This is
>> due
>> to the size of the pipe used to temporarily store check results, at
>> least on Linux (and you may be seeing it already with your number of
>> checks). Around that point, even with quite a bit of tuning, the check
>>
>> results will fill the pipe in less than a second, which is the minimum
>>
>> amount of time I've been able to configure Nagios to flush it. When
>> that
>> happens, latencies go through the roof.
>>
>> Distributing the checks doesn't solve the problem if you're still
>> sending the results to a centralized Nagios instance, since one
>> machine
>> still needs to process all of them.
>>
>> I'm working through this situation now, and it's looking like it may
>> take a custom kernel with a larger pipe size to handle
>> it.
>>
>>
> Shadih, we have a customer running over 20.000 service checks in a single
> box.
>
> The box is 2x Intel Dual-core Xeon 3Ghz with 8Gb RAM. The latency is
> around 0.7s. The checks are performed between 3min and 5min. They have
> around 40 simultaneous users.
>
> _
> Dario B. BestettiOpServices

I had to install a bunch of nagios for 40 polls. We ran some tests on
bi-Xeon Dual-Core with 4Go. It seems the limit of Nagios 3 is around 25000
services by server, but we didn't use the interface and the recording to
database is with Perl scripts of our own. Taking 2 services as a limit
seem to be the right thing to do, perhaps less as 15000 with NDO.

Regards,
-- 
David Dumortier
LINAGORA
Service Management Monitoring



--
Return on Information:
Google Enterprise Search pays you back
Get the facts.
http://p.sf.net/sfu/google-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] Notification period inheritance problem

2009-12-15 Thread Martin Melin
Have you tried leaving out the notification_period line from the
service definition? I think, but don't have time to verify, that by
saying "notification_period null" you are actually overriding all
inherited values for notification_period and setting it to null
instead.

Best regards,
Martin Melin

On Tue, Dec 15, 2009 at 1:56 AM, Gordon Messmer  wrote:
> The documentation on "Implied Inheritance"[1] indicates that a service
> with no notification period will use the notification period of the
> associated host definition.  This doesn't seem to work when a service is
> applied to a hostgroup.
>
> In my configuration files, I define a generic service template of my
> own.  It has no notification period.  I also define a host template for
> each customer which defines the hostgroup and notification period which
> is appropriate for each host.  Finally, I define a service which will
> apply to all hosts within that hostgroup.
>
> When I run "nagios -v", I get warnings that no notification period is given:
> Warning: Service 'root partition' on host 'customer1-samba' has no
> notification time period defined!
>
> I've received some pages from hosts when checks timed out during a
> period when no notications should be sent.
>
> Is this a bug, or am I making some mistake?  Shouldn't the service be
> inheriting the value from the host?  Where would I start looking if it
> were a bug and I wanted to fix it?
>
>
>
> define service{
>         use                             generic-service
>         name                            my-generic-service ; The 'name'
> of this service template
>         notification_period             null            ; Use the
> period from the host
>         register                        0
>         }
>
> define host{
>         name            customer1-linux-server
>         use             my-generic-server
>         hostgroups      my-linux-servers
>         notification_period     my-workhours
>         register        0
>         }
>
> define  service{
>         use                     my-generic-service
>         hostgroup_name          my-linux-servers
>         service_description     root partition
>         check_command           ssh_disk!20%!10%!/
>         }
>
>
> 1: http://nagios.sourceforge.net/docs/3_0/objectinheritance.html
>
> --
> Return on Information:
> Google Enterprise Search pays you back
> Get the facts.
> http://p.sf.net/sfu/google-dev2dev
> ___
> Nagios-users mailing list
> Nagios-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting 
> any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>

--
Return on Information:
Google Enterprise Search pays you back
Get the facts.
http://p.sf.net/sfu/google-dev2dev
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue. 
::: Messages without supporting info will risk being sent to /dev/null