Re: [Nagios-users] Check_load
As far as I understand : In your example, it means that during the last 15 minutes, there was, at a said time, 0.53 processes using the cpu to do something, or waiting to be allowed to use the cpu. At this time, the other processes were not doing anything, nor waiting to do something. They were probably in idle mode, having really nothing to do... It is not because a process is in the "top" list that it is either using the cpu, or waiting for it. If I am wrong, please someone correct me. But we are indeed beyond the context of Nagios, so we should maybe continue this conversation out of the list. Johan Le Mercredi 10 Mai 2006 17:26, Sandeep Narasimha Murthy a écrit : > Hi, > > So, does load average: 0.35, 0.47, 0.53, mean there are only 0.53 jobs > running and waiting over the last 15 minutes ? executing TOP in the > shell shows me atleast 15 different processes.. > > If at the same time, the CPU Usage is 100%, what does this mean ? 0.53 > jobs is consuming 100% of the CPU ? > > It appears, that I am posing questions beyond the context of Nagios so I > will stop here :) appreciate though any more clarifications.. > > Regards, > > sg > > -Original Message- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On Behalf Of Tiernan, > Michael C. > Sent: quarta-feira, 10 de Maio de 2006 14:43 > To: nagios-users@lists.sourceforge.net > Subject: RE: [Nagios-users] Check_load > > On Behalf Of Derek J. Balling > Sent: Wednesday, May 10, 2006 6:52 AM > To: nagios-users@lists.sourceforge.net > Subject: Re: [Nagios-users] Check_load > > > "load average" is the number of processes (average) in the > > wait state over a given period of time. > > Actually, it's the number of processes available to be run, that > includes those running. (i.e. the Run queue). > > A 20 processor machine with one job per processor and each processor > "running at 100%" is a load of 20 where the same machine, with the same > 20 jobs running *and* 10 jobs waiting to run has a load of 30. > > And as Mr Balling rightfully said, it's averaged over 1min, 5min, 15mins > by the OS itself. > > Now, this is what I've found on Solaris machines and Red Hat Linux > machines along with the SGI machines I wrangle. It *is* possible that > some other OS defines it differently. > (a.k.a. "Never say Never.") --- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid0709&bid&3057&dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] latency and parallelize check
Le Jeudi 27 Avril 2006 11:29, Sandeep Narasimha Murthy a écrit : > Hi, > > > > I have a few questions regarding Latency and parallelize checks. > Appreciate some help in this regard.. > > > > 1. Latency: The average Latency value is 70.24 Seconds for all the > services but I believe is way too high. I browsed the site > (http://nagios.sourceforge.net/docs/2_0/tuning.html) but couldn't > exactly figure out how to reduce this figure. How do I go about > reducing the Latency figure ? > > > > > > 2. Regarding Parallelize Check: (This directive is used to determine > whether or not the service check can be parallelized. By default, all > service checks are parallelized. Disabling parallel checks of services > can result in serious performance problems.) > > > > From what I can understand, Parallelize check means that if I have a > Service check configured to run every 3 minutes, and Nagios determines > that even though the previous check launched 3 mins ago has still not > finished it launches a new check !? Am I correct in this assumption ? If > yes, how can Disabling this result in performance problems ? shouldn't > it be viceversa ? As far as I understand, Parallelize check means that Nagios can multiple different service checks at the same time... So, if you have let say 100 checks. Without Parallelize check, Nagios would check the first one, wait for the result, then check the second one, wait, etc... It means that if you got checks that are kinda slow, the time to check every service can be huge... And even if the checks are fast, as soon as you got a lot of services, it will take some time anyway. > > 3. Finally, In a service configuration file, I have the following: > > Maximum Check Attempts: 5 > > Normal Check Interval In Time-Units: 5 > > Retry Check Interval In Time-Units: 3 > > > > In the above, Shouldn't the Normal Check Interval be Greater than > Maximum Check Attempts * Retry Check Interval ? I do not see why it should... With this configuration, nagios would, for a said service, perform one check every 5 minutes. When it detects a problem, it would do the check once every 3 minutes. And after 5 "negative" checks, it would then consider the service in a "hard state", and issue appropriate actions (ie : sending alert mails). > TIA, > sg Johan. --- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid0709&bid&3057&dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Is it possible to prevent messages about service unknown?
If you want to prevent emails about services becoming unknow for some users, no matter what the service is, then edit the concerned contact definitions and remove the "u" in the service_notification_options field. If you want to prevent emails about specific services becoming unknow, no matter what the contact is, then edit the concerned service definitions and remove the "u" in the service_notification_options field. Johan. Le Mercredi 19 Avril 2006 15:33, Scott Lundgren a écrit : > I received another round of 32 emails from nagios last night from services > that went unknown due to a socket error and then recovered. > > Is is possible to prevent emails about when a service becomes unknown > (perhaps because of a socket error) ? > > or > > What causes socket errors and how can I prevent them? > > Scott Lundgren > ITS - Web Services > UNC at Charlotte > http://www.uncc.edu > --- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid0709&bid&3057&dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] per host custom macros
I don't know if you have a real use of the "host_alias", but if not, you could maybe use this field to stock your customized definitions. You should also change the notification scripts to send the $HOSTNAME$ and not the $HOSTALIAS$ value you would have something like : host_alias $CUSTVAL1;$CUSTVAL2;$CUSTVAL3;... Ok, it is a bit tricky, but well, it think it could work... Johan Le Mercredi 29 Mars 2006 21:06, David Mansfield a écrit : > jeff vier wrote: > > On Wed, 2006-03-29 at 12:55 -0500, David Mansfield wrote: > >> Something like: > >> > >> define host { > >>use generic-host > >>host_name myhost.mydomain.com > >>alias myhost > >>address 1.2.3.4 > >> > >>custom_macro FOOPROPERTY=AA:BB:CC:DD > >>custom_macro BARVALUE=BigImportantValue > >> > >>hostgroups all_foo,all_bar > >> } > >> > >> With: > >> > >> define hostgroup { > >>name all_foo > >>alias all_foo > >> } > >> > >> define service { > >>hostgroup_name all_foo > >>check_command check_foo > >> } > >> > >> And finally, the payoff: > >> > >> define command { > >>command_name check_foo > >>command_line $USER1$/check_foo -f $FOOPROPERTY$ > >> } > > > > You've pretty much exactly described $ARG1$, $ARG2$, ... $ARGn$ > > No. I'm sorry but you are missing the point. $ARG1$ blah blah are > specified in the 'service' object. > > That means you HAVE to define a service object for every host/service > combination. Yuk! Right now you can apply a service to an entire > _hostgroup_, and the assignment/memebership to the hostgroup is set up > IN the host object. This means that for those services, all the > information necessary is captured in the host definition, as long as > only the standard attributes (hostaddress etc) are referenced. > > Why should I configure the XYZ property of my host anywhere other than > the host definition? It doesnt' make one lick of sense from a > management perspective. > > Not to mention it forces twice the management burden to have to copy > service definitions all over the place. > > David > > > > > > --- > This SF.Net email is sponsored by xPML, a groundbreaking scripting language > that extends applications into web and mobile media. Attend the live > webcast and join the prime developer group breaking into this new coding > territory! > http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642 > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > ::: reporting any issue. Messages without supporting info will risk being > ::: sent to /dev/null --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Differnt users
Le Mardi 21 Mars 2006 19:55, Marc Powell a écrit : > > -Original Message- > > From: [EMAIL PROTECTED] [mailto:nagios-users- > > [EMAIL PROTECTED] On Behalf Of Peder Christian Bach > > Sent: Tuesday, March 21, 2006 3:59 AM > > To: Nagios-users@lists.sourceforge.net > > Subject: [Nagios-users] Differnt users > > > > Hi.. > > > > Is there any possibility to have different users on Nagios ? > > > > User x has access to see everything in web. > > > > User z has access only to do reporting in web. > > This level of granularity is not possible. If a user can see a host or > service, they can perform all administrative functions related to that > host or service. >... I am sorry, but I do not fully agree. As far as I understand, a user can perform tasks on hosts and services that he is contact for, not hosts and services he can see. So, you could, for example, create a "read only" user using the following : - user reader is not contact for anything. In the cgi.cfg : - authorized_for_all_services=reader - authorized_for_all_hosts=reader With that, you would have an user who can see everything, but can't issue any command. But I might be wrong, if it is the case, please tell me :) Johan --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] defunct process / more than one nagios "root" process...
Hello everyone. I am facing problems for a few days with my Nagios server (2.0b4). I often have more than one nagios "root" process (launched by the root process), and sometimes have a "defunct" nagios process. It is really anoying, as it can leads to Nagios not working any more, or, at best, only doing half of the checks it should perform,... Often, it also stops recording the results of the checks, or "forget" latest informations after a restart, rolling back to the status data of hours ago. I don't remember doing any big change lately, so I can't find why I am facing all those problems right now... Does any of you know what can cause those defunct processes, or the multiple nagios "root" processes? Or best do you know how I could avoid it :) Thanks a lot for your help, Johan. --- This SF.Net email is sponsored by xPML, a groundbreaking scripting language that extends applications into web and mobile media. Attend the live webcast and join the prime developer group breaking into this new coding territory! http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Cgi configuration : possible to set "authorized_for_..." to all but one user?
Hello everyone, Up to now, everyone who had an access to my nagios webpages could execute commands. So, the cgi.cfg file had things like : authorized_for_all_service_commands=* I would like to add a new user who can only see the informations, but could not execute any command. I tried the following : authorized_for_all_service_commands=*,!new_username But it did not work. Do any of you know if there is a way to do this? I would like to avoid replacing the "all" by a list of all my allowed users... Johan --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] multiple contact notifications for one host
If I understand well, you want your check_disk on HostA to be send to the 2 mailing lists, while all the other check_disk being only send to one of the mailing list? If I am true, then you could create a new service, "check_disk_A", doing exactly the same things than the regular check_disk, but with other notification options. Sorry if I did not understand well the question. Johan Le Mardi 14 Février 2006 17:18, [EMAIL PROTECTED] a écrit : > This is what I want to accomplish: > > > One host's disk space to be monitored, named HostA > > > > Have HostA in two different hostgroupsGroupA and GroupB > > > GroupA contact group is defined with email1 > GroupB contact group is defined with email2 > > > GroupA has a bunch of other hosts that email2 shouldnt receive so thats > why I'm trying to put it into a different group. > > > **All the hosts are on passive checking.** > So when the disk space gets to a WARNING OR CRITICAL for nagios to email > GroupA and GroupB. But it looks like in the services.cfg is where you > actually setup the notification functionality. Since checking disk is > needed on all hosts, it looks like I cant separate the notifications for 2 > separate emails that are both responsibilty for only one box and not all. > > > I would imagine there is a way to do what I'm attempting but I'm going in > the wrong direction. I need some other clear minds to point in me in the > right direction. Thanks.! ;-) > > - > > Note: This email is for the sole use of the intended > recipient(s) and may contain confidential information. Any > unauthorized review, use, disclosure or distribution is > prohibited. Contact the sender if received in error. > > > --- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642 > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > ::: reporting any issue. Messages without supporting info will risk being > ::: sent to /dev/null --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Problems with NRPE
Is there any firewall between the Nagios host and the remote one? Did you try, from your Nagios host, something like "telnet remote_host 5666"? If you did, could you please tell us what was the result? Johan Le Lundi 06 Février 2006 14:13, Kannberg, Andy a écrit : > Johan, > > Yes, the port has been added to the services file, and netstat show me that > nrpe is listening on port 5666 > > > -Original Message- > > From: Johan Corsini [mailto:[EMAIL PROTECTED] > > Sent: maandag 6 februari 2006 14:07 > > To: nagios-users@lists.sourceforge.net > > Subject: Re: [Nagios-users] Problems with NRPE > > > > > > Hello, > > > > Did you add the nrpe port to the list in the /etc/services > > file? There should > > be a line with something like that : > > nrpe5666/tcp > > > > Also, did you check with a "netstat -a | grep nrpe" if the > > port is listening? > > > > > > Johan > > > > Le Lundi 06 Février 2006 13:53, Kannberg, Andy a écrit : > > > Hi, > > > > > > I've got some problems with connecting to NRPE on a remote host. > > > > > > I've got Nagios running, along with the 1.4.2 plugin set > > > > and NRPE on the > > > > > host (Slackware Linux) > > > On the remote client, a Tru64 box, the nrpe daemon runs > > > > under inet. Also, > > > > > the plugins are compiled and installef. > > > But for some reason, NRPE on the host can't reach the nrpe > > > > daemon on the > > > > > client. The message is: > > > > > > Connection refused or timed out > > > > > > I've checked and checked again, but can't find any error. > > > > Anyone a good > > > > > suggestion or hint ? > > > > > > thanks, > > > Andy > > > > --- > > This SF.net email is sponsored by: Splunk Inc. Do you grep > > through log files > > for problems? Stop! Download the new AJAX search engine that makes > > searching your log files as easy as surfing the web. > > DOWNLOAD SPLUNK! > > http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&; > > dat=121642 > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > ::: reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Problems with NRPE
Hello, Did you add the nrpe port to the list in the /etc/services file? There should be a line with something like that : nrpe5666/tcp Also, did you check with a "netstat -a | grep nrpe" if the port is listening? Johan Le Lundi 06 Février 2006 13:53, Kannberg, Andy a écrit : > Hi, > > I've got some problems with connecting to NRPE on a remote host. > > I've got Nagios running, along with the 1.4.2 plugin set and NRPE on the > host (Slackware Linux) > On the remote client, a Tru64 box, the nrpe daemon runs under inet. Also, > the plugins are compiled and installef. > But for some reason, NRPE on the host can't reach the nrpe daemon on the > client. The message is: > > Connection refused or timed out > > I've checked and checked again, but can't find any error. Anyone a good > suggestion or hint ? > > thanks, > Andy --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Different notification types for warnings vs. criticals
I am not sure about it, but I think that recoveries are only send to contacts who received the alert. So, if you only get the "Critical alerts" on your cell phone, you should only get the "Critical -> Ok" recoveries on it, not the "Warning -> Ok" recoveries. Johan Le Mercredi 18 Janvier 2006 23:30, Steve Shipway a écrit : > > I do see a hole in this method, however. Since I also want > > recoveries sent both for criticals and warnings, I can see > > that our cell phones will receive both types of recoveries > > since Nagios only knows that it is supposed to send c,r > > notifications. > > This is a good point. Is it really true that, wiith notifications set to > 'c,r', you will also get recovery alerts for Warn->OK and not just for > Crit->OK? If so, this will make the recovery alerts almost useless. > > Steve > > > > > --- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642 > ___ > Nagios-users mailing list > Nagios-users@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > ::: reporting any issue. Messages without supporting info will risk being > ::: sent to /dev/null --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Problem with nagios NRPE
Hello, I might be wrong, but I think that there is no definition of "check_fw1" in the nrpe.cfg (or in the included files) on your new server. So, NRPE doesn't know what to do when it receives the "check_fw1" instruction. There should be something like that : command[check_fw1]= Johan Corsini. Le Jeudi 12 Janvier 2006 14:14, Sameka Prather a écrit : > Hello Nagios List, > > I have added a new server to my nagios monitoring. Nagios starts fine > but I get notification messages from nagios stating the new server I > have enter is Down Info: NRPE: command check_fw1 not defined. Does > anyone have any ideas? > > The nagios log is: Service Alert: fw1; PING; CRITICAL;HARD;1;NRPE; > Command 'check_fw1' not defined. > > I have added the server to the following: command, hosts, hostgroup and > services definition files. --- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null