Re: [Nagios-users] Notification period inheritance problem
I don't understand why this is confusing. You have a service template that defines a timeperiod. All services that use that template will inherit the notification_period value from the template, if they don't specify a notification_period themselves. By setting "notification_period null" in your service definition, you are explicitly overriding all forms of inheritance and setting the notification_period to null, which happens to be a special notification_period. So, the issue you are running into is due to you expecting implied inheritance to take precedence over values defined in the service definition itself. This would break the documented and expected behavior of inheritance, and a lot of people's configurations. Now, I understand how you could run into this issue. But take a look at the documentation again: >>> Quoting from the documentation: >>> >>> The following table lists the object variables that will be implicitly >>> inherited from related objects if you don't explicitly specify their value >>> in your object definition or inherit them from a template The implied inheritance variables will only be inherited from related objects, if they are not specified in either the object definition or template. In your case, notification_period is defined in your template, so the implied inheritance is already overridden. You can't bring that back by also explicitly defining a notification_period in your service definition, even if that happens to be null, as null is also a valid value for notification_period. Something that definitely would work is for you to define another service template without a notification_period value, and use that for checks you want implied inheritance from. Something that might work, but unfortunately I can't try it as I've already spent too much time on this email :-), is to set notification_period to null in the template, which could mean that the service definition will behave as though the notification_period value is unset in the template. This would allow you to let the "null notification period" template inherit from your normal template. Best regards, Martin Melin On Wed, Dec 16, 2009 at 12:26 AM, Gordon Messmer wrote: > On 12/15/2009 01:38 PM, Marc Powell wrote: >> On Dec 15, 2009, at 2:04 PM, Gordon Messmer wrote: >> >>> Quoting from the documentation: >>> >>> The following table lists the object variables that will be implicitly >>> inherited from related objects if you don't explicitly specify their value >>> in your object definition or inherit them from a template >>> >>> ...so I need to not inherit the value from my template. How do I do that? >>> >> Don't specify the directive in the template. >> > > Which means that I have to expand the template where I null-ify the > notification period with the values from any of the templates it > "uses". That seems like a minor inconvenience, but it's still uglier > than it ought to be. When I do so, I still get the warning from "nagios > -v". I'll test it to make sure that it's working. > >>> In some cases you may not want your host, service, or contact definitions >>> to inherit values of string variables from the templates they reference. If >>> this is the case, you can specify "null" (without quotes) as the value of >>> the variable that you do not want to inherit. >>> >> You're mixing sections of the documentation to try to make a connection that >> doesn't exist. > > If that's so, then at the very least the documentation should be updated > to note that this won't work. > >> Again, specified with a value of 'null' does not equate to unspecified. >> Looking at the code, if you specify the value in any way, including a value >> of 'null', it breaks the implied inheritance. >> >> xodtemplate.h -- >> >> #define XODTEMPLATE_NULL "null" >> >> xdata/xodtemplate.c -- >> >> else if(!strcmp(variable,"notification_period")){ >> if(strcmp(value,XODTEMPLATE_NULL)){ >> if((temp_service->notification_period=(char >> *)strdup(value))==NULL) >> result=ERROR; >> } >> temp_service->have_notification_period=TRUE; >> } >> > > That sure looks like a bug to me. Even the indentation looks weird to > me, is that the normal style? I can't imagine why "null" would be > considered a valid notification period. If > have_notification_period=TRUE were in the preceding block, the > documentation would make more sense, and implied inheritance would be a > lot easier to use. Do you think a patch to modify this behavior would > be accepted? > > Thanks for the explanation. I appreciate your help. > > > -- > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Joi
Re: [Nagios-users] Problem installing MRTG
VIKRAM MS wrote: > Hi, > > 1) I dont have anything plugged to port 1. So I cannot > monitor any port unless it is plugged to something. > > 2) Now I have this trouble while installing MRTG. I followed > the steps given in > http://oss.oetiker.ch/mrtg/doc/mrtg-unix-guide.en.html. > Should I create any of those folders? > > [r...@localhost bin]# pwd > /usr/local/src/mrtg-2.16.2/bin > [r...@localhost bin]# ls > cfgmaker cfgmaker~ indexmaker indexmaker~ mrtg mrtg~ > mrtg-traffic-sum rateup rateup.o > [r...@localhost bin]# cfgmaker --global 'WorkDir: /home/httpd/mrtg' > --global 'Options[_]: bits,growright' --output /home/mrtg/cfg/mrtg.cfg > x...@12.345.67.890 > ERROR: creating /home/mrtg/cfg/mrtg.cfg: No such file or directory > [r...@localhost bin]# Yes, if you're going to tell mrtg to place your config in a specific directory with the --output directive, that directory should exist. And, again, this really isn't the right place to look for help on setting up MRTG. You may want to try http://oss.oetiker.ch/mrtg/support/ -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Problem installing MRTG
Hi, 1) I dont have anything plugged to port 1. So I cannot monitor any port unless it is plugged to something. 2) Now I have this trouble while installing MRTG. I followed the steps given in http://oss.oetiker.ch/mrtg/doc/mrtg-unix-guide.en.html. Should I create any of those folders? [r...@localhost bin]# pwd /usr/local/src/mrtg-2.16.2/bin [r...@localhost bin]# ls cfgmaker cfgmaker~ indexmaker indexmaker~ mrtg mrtg~ mrtg-traffic-sum rateup rateup.o [r...@localhost bin]# cfgmaker --global 'WorkDir: /home/httpd/mrtg' --global 'Options[_]: bits,growright' --output /home/mrtg/cfg/mrtg.cfg x...@12.345.67.890 ERROR: creating /home/mrtg/cfg/mrtg.cfg: No such file or directory [r...@localhost bin]# Thanks Viki On Sun, Dec 13, 2009 at 7:44 PM, Greg Pangrazio wrote: > For the first error, do you have something plugged into port 1 on the > switch and is the port in the Up/UP state? > > > For the check_mrtg plugin you must have mrtg installed and graphing > utilization before you can configure the alterts via nagios. You > should check out the MRTG site for more information on that. > > > Greg Pangrazio > pangr...@gmail.com > > > > > > On Sat, Dec 12, 2009 at 12:33 AM, VIKRAM MS wrote: > > Hello, > > > > I am trying to monitor Cisco switches and routers. I am able > to > > ping and check the uptime of the switch. > > 1) But for the port 1 link status, I am getting "SNMP CRITICAL - > *down(2)*" > > error. > > > > 2) For port 1 bandwidth usage, the error is "check_mrtgtraf: Unable to > open > > MRTG log file" > > I have installed MRTG, but I am not able to go any further. > Also > > for the http://localhost/mrtg/10.111.100.102_43.html check, I am getting > the > > following error > > 404 Not found > > The requested URL /mrtg/10.111.100.102_43.html was not found on this > server. > > > > I have checked for assistance in > > http://www.mail-archive.com/nagios-users@lists.sourceforge.net/ but in > vain. > > I am using Nagios Core version 3.2.0. > > Thanks in advance > > > > Regards > > Vikram > > > > > > > -- > > Return on Information: > > Google Enterprise Search pays you back > > Get the facts. > > http://p.sf.net/sfu/google-dev2dev > > > > ___ > > Nagios-users mailing list > > Nagios-users@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NRPE refused connect
Hi. I installed the check_nrpe plugin on the Nagios server and NRPE running as daemon on the host I have to monitor, both compiled with SSL support and the dh.h file created and saved in the include/ directory on the NRPE host. [r...@monitored-host ~]# /opt/nrpe/bin/nrpe NRPE - Nagios Remote Plugin Executor Copyright (c) 1999-2008 Ethan Galstad (nag...@nagios.org) Version: 2.12 Last Modified: 03-10-2008 License: GPL v2 with exemptions (-l for more info) SSL/TLS Available: Anonymous DH Mode, OpenSSL 0.9.6 or higher required TCP Wrappers Available nagios-server:/# /usr/local/nagios/libexec/check_nrpe Incorrect command line arguments supplied NRPE Plugin for Nagios Copyright (c) 1999-2008 Ethan Galstad (nag...@nagios.org) Version: 2.12 Last Modified: 03-10-2008 License: GPL v2 with exemptions (-l for more info) SSL/TLS Available: Anonymous DH Mode, OpenSSL 0.9.6 or higher required The IP address of the Nagios server is specified in the "allowed_hosts" declaration in the nrpe.cfg file: [r...@monitored-host ~]# grep allowed_hosts /opt/nrpe/etc/nrpe.cfg allowed_hosts=192.168.10.250 The problem is that if I try to execute the check_nrpe command to test the NRPE daemon on the monitored host, I receive this error: nagios-server:/# /usr/local/nagios/libexec/check_nrpe -H 192.168.10.18 CHECK_NRPE: Error - Could not complete SSL handshake. On the monitored host I see that the IP address of the Nagios server is allowed and then refused: [r...@monitored-host ~]# tail /var/log/messages Dec 16 01:24:27 monitored-host nrpe[25047]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Dec 16 01:24:27 monitored-host nrpe[25048]: Starting up daemon Dec 16 01:24:27 monitored-host nrpe[25048]: Warning: Daemon is configured to accept command arguments from clients! Dec 16 01:24:27 monitored-host nrpe[25048]: Listening for connections on port 5666 Dec 16 01:24:27 monitored-host nrpe[25048]: Allowing connections from: 192.168.10.250 Dec 16 01:27:01 monitored-host nrpe[25063]: refused connect from 192.168.10.250 (192.168.10.250) What is the problem? It is not a firewall problem because the connection works, and it does not seems to be a SSL related problem because it does not work even if I try the check command disabling SSL with -n (and the NRPE daemon runned with -n also), and it is quite curious to have two opposite log messages. Could you help me please? I worked all the afternoon trying to let it work, but it does not work... Thank you very much!! Bye. -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notification period inheritance problem
On 12/15/2009 01:38 PM, Marc Powell wrote: > On Dec 15, 2009, at 2:04 PM, Gordon Messmer wrote: > >> Quoting from the documentation: >> >> The following table lists the object variables that will be implicitly >> inherited from related objects if you don't explicitly specify their value >> in your object definition or inherit them from a template >> >> ...so I need to not inherit the value from my template. How do I do that? >> > Don't specify the directive in the template. > Which means that I have to expand the template where I null-ify the notification period with the values from any of the templates it "uses". That seems like a minor inconvenience, but it's still uglier than it ought to be. When I do so, I still get the warning from "nagios -v". I'll test it to make sure that it's working. >> In some cases you may not want your host, service, or contact definitions to >> inherit values of string variables from the templates they reference. If >> this is the case, you can specify "null" (without quotes) as the value of >> the variable that you do not want to inherit. >> > You're mixing sections of the documentation to try to make a connection that > doesn't exist. If that's so, then at the very least the documentation should be updated to note that this won't work. > Again, specified with a value of 'null' does not equate to unspecified. > Looking at the code, if you specify the value in any way, including a value > of 'null', it breaks the implied inheritance. > > xodtemplate.h -- > > #define XODTEMPLATE_NULL "null" > > xdata/xodtemplate.c -- > > else if(!strcmp(variable,"notification_period")){ > if(strcmp(value,XODTEMPLATE_NULL)){ > if((temp_service->notification_period=(char > *)strdup(value))==NULL) > result=ERROR; > } > temp_service->have_notification_period=TRUE; > } > That sure looks like a bug to me. Even the indentation looks weird to me, is that the normal style? I can't imagine why "null" would be considered a valid notification period. If have_notification_period=TRUE were in the preceding block, the documentation would make more sense, and implied inheritance would be a lot easier to use. Do you think a patch to modify this behavior would be accepted? Thanks for the explanation. I appreciate your help. -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios 30 seconds initial delay
On Tue, Dec 15, 2009 at 5:57 PM, Brandino Andreas wrote: > When I click a page I get the error "Error: Could not read host and > service status information!" . After 30 seconds, all pages appear > again!! > > - I don't have duplicated nagios service running > - This delay didn't show up to older versions > - My linux is Debian 5.0.3 (stable) > - This error appears every time I restart nagios We typically see about the same start up time, both on 3.0.3 and 3.2.0 on hosts with 1-2k hosts and ~10k services where we have retention.dat files that are weeks or months old and we are using regular expressions and service -> hostgroup mappings extensively. - Max -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Question - Scripting to stop monitor
Is it possible and if so can someone share with a script that will stop a monitor script for a certain time period then restart. Just putting the service in scheduled downtime still lets runs and I don't want to access the service being monitored. Thanks, Ed -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios 30 seconds initial delay
Hi all, I am using Nagios 3.2.0 (I just upgrade from early 3.0 releases) Every time I restart nagios I face a 30 seconds delay before the various pages appear for first time (not before starting checks, 30 seconds before displaying pages!!!) When I click a page I get the error "Error: Could not read host and service status information!" . After 30 seconds, all pages appear again!! - I don't have duplicated nagios service running - This delay didn't show up to older versions - My linux is Debian 5.0.3 (stable) - This error appears every time I restart nagios Any idea what can cause this delay?? Thank you <> --- --- --- <> Brandino Andreas ampra...@gmail.com <> --- --- --- <> -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Nagios 3.2 and max_concurrent_checks=0
I'm doing some testing for migrating our installation to Nagios3.2. The test server is running 3.2 on an 8 CPU box with 34,000 active service checks and 3,000 active host checks. The initial configuration file had max_concurrent_checks=0, but latency was about 9,000 seconds. I changed it to max_concurrent_checks=200 and the latency went down to about 7,000 seconds. I then set it to 2,000 and the latency dropped to about 200 seconds. I currently have it set to 100,000 and latency has not changed from about 200 seconds. >From all the documentation I have seen, if max_concurrent_checks is set to >zero, there should be no limit on the number of concurrent checks, but this >doesn't appear to be the case. Is there some other part of the configuration >that I'm missing which would make max_concurrent_checks=0 be limited instead >of unlimited? Cary Petterborg ICS Monitoring The Church of Jesus Christ of Latter-day Saints Office Phone: 801-240-8267 Email: petterbor...@ldschurch.org NOTICE: This email message is for the sole use of the intended recipient(s) and may contain confidential and privileged information. Any unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy all copies of the original message. -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notification period inheritance problem
On Dec 15, 2009, at 2:04 PM, Gordon Messmer wrote: > On 12/15/2009 10:49 AM, Marc Powell wrote: >> >> No, in order to do that, the notification_period needs to be unspecified. >> null != unspecified. Because you're explicitly setting it to null, you break >> the implied inheritance. >> >> To take advantage of the implied inheritance, you need to leave out the >> directive entirely as indicated above. >> >> > > Quoting from the documentation: > > The following table lists the object variables that will be implicitly > inherited from related objects if you don't explicitly specify their value in > your object definition or inherit them from a template > > ...so I need to not inherit the value from my template. How do I do that? Don't specify the directive in the template. > In some cases you may not want your host, service, or contact definitions to > inherit values of string variables from the templates they reference. If this > is the case, you can specify "null" (without quotes) as the value of the > variable that you do not want to inherit. You're mixing sections of the documentation to try to make a connection that doesn't exist. Again, specified with a value of 'null' does not equate to unspecified. Looking at the code, if you specify the value in any way, including a value of 'null', it breaks the implied inheritance. xodtemplate.h -- #define XODTEMPLATE_NULL "null" xdata/xodtemplate.c -- else if(!strcmp(variable,"notification_period")){ if(strcmp(value,XODTEMPLATE_NULL)){ if((temp_service->notification_period=(char *)strdup(value))==NULL) result=ERROR; } temp_service->have_notification_period=TRUE; } 'If the variable is "notification_period" and the value is not the word "null", save the value unless the value is blank (error otherwise). If the value is the word "null" or something else, set the service have_notification_period flag to TRUE." Further down -- /* services inherit notification period from host if not already specified */ if(temp_service->have_notification_period==FALSE && temp_host->have_notification_period==TRUE && temp_host->notification_period!=NULL){ temp_service->notification_period=(char *)strdup(temp_host->notification_period); temp_service->have_notification_period=TRUE; } "If the service 'have_notification_period' flag is not TRUE and the host and the host has a notification period, use that for the service." As you can see, you fail that first condition. > If I don't specify a value, it'll be inherited. Right? How will that help > me not specify a value for the service? If you don't specify the value in the template, there's nothing to inherit from the template. If the directive is also not specified in the service{} definition, the template engine will look to the value in the host{} definition. -- Marc -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notification period inheritance problem
On 12/15/2009 10:49 AM, Marc Powell wrote: No, in order to do that, the notification_period needs to be unspecified. null != unspecified. Because you're explicitly setting it to null, you break the implied inheritance. To take advantage of the implied inheritance, you need to leave out the directive entirely as indicated above. Quoting from the documentation: The following table lists the object variables that will be implicitly inherited from related objects if you don't explicitly specify their value in your object definition or inherit them from a template ...so I need to not inherit the value from my template. How do I do that? In some cases you may not want your host, service, or contact definitions to inherit values of string variables from the templates they reference. If this is the case, you can specify "*null*" (without quotes) as the value of the variable that you do not want to inherit. If I don't specify a value, it'll be inherited. Right? How will that help me not specify a value for the service? -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notification period inheritance problem
On Dec 15, 2009, at 11:33 AM, Gordon Messmer wrote: > On 12/15/2009 12:52 AM, Martin Melin wrote: >> Have you tried leaving out the notification_period line from the >> service definition? I think, but don't have time to verify, that by >> saying "notification_period null" you are actually overriding all >> inherited values for notification_period and setting it to null >> instead. >> > > That's kind of the point. I want the service to inherit its > notification period from the hosts it checks. In order to do that, its > own notification period has to be null. No, in order to do that, the notification_period needs to be unspecified. null != unspecified. Because you're explicitly setting it to null, you break the implied inheritance. To take advantage of the implied inheritance, you need to leave out the directive entirely as indicated above. -- Marc -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notification period inheritance problem
On 12/15/2009 12:52 AM, Martin Melin wrote: > Have you tried leaving out the notification_period line from the > service definition? I think, but don't have time to verify, that by > saying "notification_period null" you are actually overriding all > inherited values for notification_period and setting it to null > instead. > That's kind of the point. I want the service to inherit its notification period from the hosts it checks. In order to do that, its own notification period has to be null. -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Interface for control room
On Tue, Dec 15, 2009 at 4:34 PM, shacky wrote: > The complication I forgot to tell you in my first post is that I need > to monitor at least three different Nagios servers. > How can I solve this requirement? > You can administrate your different Nagios poller through Centreon frontend. Communication between them is based on SSH. Each host definition is link to a poller, so the configuration dispatch is automatic. -- Romain LE MERLUS -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Interface for control room
On 12/15/2009 04:34 PM, shacky wrote: >> I do this with NagVis. Quite simple and powerfull. You can create new maps >> and add your own iconset (big blinking animated gif!) >> http://www.nagvis.org/ > > Thank you very much. > I will try Centreon and Nagvis. > > The complication I forgot to tell you in my first post is that I need > to monitor at least three different Nagios servers. > How can I solve this requirement? > Get three different monitors and hook up with virtual desktops to one computer. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Interface for control room
> I do this with NagVis. Quite simple and powerfull. You can create new maps > and add your own iconset (big blinking animated gif!) > http://www.nagvis.org/ Thank you very much. I will try Centreon and Nagvis. The complication I forgot to tell you in my first post is that I need to monitor at least three different Nagios servers. How can I solve this requirement? -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] benchmark question
- "David Dumortier" escreveu: > Hi Patrick, hi all, > > Dario B. Bestetti (OpServices) a écrit : > > > > > > - "Patrick Morris" escreveu: > > > >> shadih rahman wrote: > >> > List, > >> >Is there any information as to what is the maximum number of > >> > services one can run on a single box with single instance of > Nagios? > >> > >> > I did not find any concrete data on this. > >> > > >> > I am running a single instance of nagios on a quad core 2.5 GHZ > >> > machine with with 4 Gigs of RAM. > >> > > >> > I have total of 7359 service check running on this box. I have > also > >> > >> > ndoutils running on the same box as backend. > >> > > >> > My total service checks is going to five fold very soon with a > lot > >> of > >> > nrpe checks. > >> > > >> > Now, my question is should I run multiple instance of nagios on > the > >> > >> > same box or a single instance will be able to handle about 3 > >> > service checks? > >> > >> In my experience, you'll run into issues right around 8,000 checks > or > >> > >> so, depending no output verbosity, on a fairly stock setup. This > is > >> due > >> to the size of the pipe used to temporarily store check results, > at > >> least on Linux (and you may be seeing it already with your number > of > >> checks). Around that point, even with quite a bit of tuning, the > check > >> > >> results will fill the pipe in less than a second, which is the > minimum > >> > >> amount of time I've been able to configure Nagios to flush it. > When > >> that > >> happens, latencies go through the roof. > >> > >> Distributing the checks doesn't solve the problem if you're still > >> sending the results to a centralized Nagios instance, since one > >> machine > >> still needs to process all of them. > >> > >> I'm working through this situation now, and it's looking like it > may > >> take a custom kernel with a larger pipe size to handle > >> it. > >> > >> > > Shadih, we have a customer running over 20.000 service checks in a > single > > box. > > > > The box is 2x Intel Dual-core Xeon 3Ghz with 8Gb RAM. The latency > is > > around 0.7s. The checks are performed between 3min and 5min. They > have > > around 40 simultaneous users. > > > > _ > > Dario B. BestettiOpServices > > I had to install a bunch of nagios for 40 polls. We ran some tests > on > bi-Xeon Dual-Core with 4Go. It seems the limit of Nagios 3 is around > 25000 > services by server, but we didn't use the interface and the recording > to > database is with Perl scripts of our own. Taking 2 services as a > limit > seem to be the right thing to do, perhaps less as 15000 with NDO. > > Regards, > -- > David Dumortier > LINAGORA > Service Management Monitoring > > > > -- > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null I forgot to tell that we did run some stress tests in this customer with up to 30.000 services and Nagios supported it very well. We also use our own broker to store data in a MySQL database. Regards, Dario -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] benchmark question
Hi Patrick, hi all, Dario B. Bestetti (OpServices) a écrit : > > > - "Patrick Morris" escreveu: > >> shadih rahman wrote: >> > List, >> >Is there any information as to what is the maximum number of >> > services one can run on a single box with single instance of Nagios? >> >> > I did not find any concrete data on this. >> > >> > I am running a single instance of nagios on a quad core 2.5 GHZ >> > machine with with 4 Gigs of RAM. >> > >> > I have total of 7359 service check running on this box. I have also >> >> > ndoutils running on the same box as backend. >> > >> > My total service checks is going to five fold very soon with a lot >> of >> > nrpe checks. >> > >> > Now, my question is should I run multiple instance of nagios on the >> >> > same box or a single instance will be able to handle about 3 >> > service checks? >> >> In my experience, you'll run into issues right around 8,000 checks or >> >> so, depending no output verbosity, on a fairly stock setup. This is >> due >> to the size of the pipe used to temporarily store check results, at >> least on Linux (and you may be seeing it already with your number of >> checks). Around that point, even with quite a bit of tuning, the check >> >> results will fill the pipe in less than a second, which is the minimum >> >> amount of time I've been able to configure Nagios to flush it. When >> that >> happens, latencies go through the roof. >> >> Distributing the checks doesn't solve the problem if you're still >> sending the results to a centralized Nagios instance, since one >> machine >> still needs to process all of them. >> >> I'm working through this situation now, and it's looking like it may >> take a custom kernel with a larger pipe size to handle >> it. >> >> > Shadih, we have a customer running over 20.000 service checks in a single > box. > > The box is 2x Intel Dual-core Xeon 3Ghz with 8Gb RAM. The latency is > around 0.7s. The checks are performed between 3min and 5min. They have > around 40 simultaneous users. > > _ > Dario B. BestettiOpServices I had to install a bunch of nagios for 40 polls. We ran some tests on bi-Xeon Dual-Core with 4Go. It seems the limit of Nagios 3 is around 25000 services by server, but we didn't use the interface and the recording to database is with Perl scripts of our own. Taking 2 services as a limit seem to be the right thing to do, perhaps less as 15000 with NDO. Regards, -- David Dumortier LINAGORA Service Management Monitoring -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Notification period inheritance problem
Have you tried leaving out the notification_period line from the service definition? I think, but don't have time to verify, that by saying "notification_period null" you are actually overriding all inherited values for notification_period and setting it to null instead. Best regards, Martin Melin On Tue, Dec 15, 2009 at 1:56 AM, Gordon Messmer wrote: > The documentation on "Implied Inheritance"[1] indicates that a service > with no notification period will use the notification period of the > associated host definition. This doesn't seem to work when a service is > applied to a hostgroup. > > In my configuration files, I define a generic service template of my > own. It has no notification period. I also define a host template for > each customer which defines the hostgroup and notification period which > is appropriate for each host. Finally, I define a service which will > apply to all hosts within that hostgroup. > > When I run "nagios -v", I get warnings that no notification period is given: > Warning: Service 'root partition' on host 'customer1-samba' has no > notification time period defined! > > I've received some pages from hosts when checks timed out during a > period when no notications should be sent. > > Is this a bug, or am I making some mistake? Shouldn't the service be > inheriting the value from the host? Where would I start looking if it > were a bug and I wanted to fix it? > > > > define service{ > use generic-service > name my-generic-service ; The 'name' > of this service template > notification_period null ; Use the > period from the host > register 0 > } > > define host{ > name customer1-linux-server > use my-generic-server > hostgroups my-linux-servers > notification_period my-workhours > register 0 > } > > define service{ > use my-generic-service > hostgroup_name my-linux-servers > service_description root partition > check_command ssh_disk!20%!10%!/ > } > > > 1: http://nagios.sourceforge.net/docs/3_0/objectinheritance.html > > -- > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null