Re: [Nagios-users] Detecting partial outages
Just off the top of my head -- You could have each monitoring station report its status for each datacenter to each other and use check_cluster to set thresholds on the monitoring stations stating that both services have to be down in order to cause an alarm. Of course, I'm sure someone will swoop down from above with a better answer. On Thu, 2007-08-23 at 11:42 -0700, David Barrett wrote: Is there any way to configure Nagios to detect and ignore partial outages? Specifically, I have multiple datacenters for my production service, and then two separate locations from which I do monitoring. It's very rare that any of the production datacenters goes down, but it does happen on occasion where one of the datacenters becomes inaccessible from only *one* of the monitoring stations. (In other words, the datacenter is up and running fine, and appears accessible by real users, but looks down to one of my monitoring stations.) Is there any way to configure Nagios to detect this sort of partial outage condition and ignore it? I only want to be notified if it's reported down by *both* monitoring stations. Thanks! -david - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check_snmp_storage.pl doubt
Why are you using snmp to monitor a local resource? Just use check_disk. It comes as one of the standard nagios-plugins. On Thu, 2007-07-26 at 18:44 +0530, sujith h wrote: Hi , My name is Sujith. Am using nagios for our network monitoring. I am using Ubuntu(dapper) in our PC. And I had tested the check_smtp_storage.pl to monitor the diskspace in the PC. It works fine. But when I copied the exact file to other PC which runs Ubuntu in it and i had installed a package called lbsnmp-perl, snmp and snmpd. When I execute the script: ./check_snmp_storage -H 127.0.0.1 -C public -m / -w 80 -c 90 -v am getting error as: ERROR: Description/Type table : Requested table is empty or does not exist. Can anyone help me why is it so? Cheers, Sujith Haridaasan -- സുജിത് ഹരിദാസ൯ - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Splunk Inc. Still grepping through log files to find problems? Stop. Now Search log events and configuration files using AJAX and a browser. Download your FREE copy of Splunk now http://get.splunk.com/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check process ??
There's plenty of them on nagiosexchange.org On Thu, 2007-07-12 at 11:34 -0300, Guille wrote: Hello; What plugin i should use to check if a process is running or not, i was looking for one in google but with not luck! Regards! Guille. __ Preguntá. Respondé. Descubrí. Todo lo que querías saber, y lo que ni imaginabas, está en Yahoo! Respuestas (Beta). ¡Probalo ya! http://www.yahoo.com.ar/respuestas - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] check process ??
...or snmp, which is also useful for all sorts of other things. On Thu, 2007-07-12 at 15:43 +0100, Hari Sekhon wrote: If checking processes on remote machines you need a remote execution method like nrpe or nsca. - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios solution
You could make the host check a 'ping' (not active), and assign a single (active) 'ping' service to each host. That way when the service fails, the host is checked, and then the status of the host will change. Kinda redundant, but it'll work. I'm sure someone like Hugo or Marc will come up with something that'll blow our collective minds right after I click 'send'. On Fri, 2007-07-06 at 22:26 +0300, bolovan bogdan wrote: Hello all, I am in a dillema , I have to monitor a massive network of vpn hosts, ( most of them Cisco CPE routers ) , and all I really need is availability polling. The problem is that active host checking became really sluggish after 500 hosts testing. I made some tests by disabling active checks and enabling ping service, but the host doesn't change state when the service is critical. Does anyone know any solution for passive checks on cisco routers? Or how to make the ping service change the state of the host when the ping service becomes critical? Or a way to implement passive checks for these cisco routers? If anyone can help I would be eternally thankful. My regards, Bogdan - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios Message Acknowledgment
Please always reply on-list. I would assume, then, that your contact isn't set up to use the 'host-notify-by-email' notification method... On Thu, 2007-06-21 at 17:04 -0400, Nedim Bicic wrote: YEA that was it but. For some reason when i acknowlege it, i dont see any comments or authors name i have restarted nagios this is what i get when a host goes down * Nagios * Notification Type: PROBLEM Host: FCD155E State: DOWN Address: 10.31.6.111 Info: CRITICAL - Plugin timed out after 10 seconds Date/Time: Thu Jun 21 16:45:49 EDT 2007 and this is what i get when i acknowledge the problem does anyone know why it doesnt show up in here: * Nagios * Notification Type: ACKNOWLEDGEMENT Host: FCD155E State: DOWN Address: 10.31.6.111 Info: CRITICAL - Plugin timed out after 10 seconds Date/Time: Thu Jun 21 16:47:48 EDT 2007 any help would be On 6/21/07, Rev. Dr. Aaron M. Segura [EMAIL PROTECTED] wrote: Looks like you have a carriage return on line 53 when you shouldn't. Just a guess though. It helps if you post the config around the line that's being complained about. -- Marc On Thu, 2007-06-21 at 15:58 -0400, Nedim Bicic wrote: This is the command i have in and that i keep getting the same errors but printf that shouldnt matter because why would it work in general for host down noticifations Here is the command i have in now: define command { command_name host-notify-by-email command_line /bin/echo -e Subject: $NOTIFICATIONTYPE$ alert - Host $HOSTNAME$ is $HOSTSTATE$\n \n* Nagios *\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS $\nInfo: $HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n\nAcknowledged By: $HOSTACKAUTHOR$\nAcknowledgement: $HOSTACKCOMMENT$ | /bin/mail $CONTACTEMAIL$ } On 6/21/07, Adam Kennedy [EMAIL PROTECTED] wrote: What is the full config entry for that command? I fear you are not using printf when you should be. Nedim Bicic wrote: I wish it was that easy but its still not working regardless of spelling I changed it of course but same error * nagios -v /etc/nagios/nagios.cfg* Nagios 2.4 Copyright (c) 1999-2006 Ethan Galstad (http://www.nagios.org) Last Modified: 05-31-2006 License: GPL Reading configuration data... Error: Invalid command object directive '$HOSTACKAUTHOR $\nAcknowledged:'. Error: Could not add object property in file '/etc/nagios/misccommands.cfg' on line 54. On 6/21/07, *Palle Jensen* [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] wrote: Could it be a spelling problem Acnkowledgement: - Acknowledgement? Error: Invalid command object directive '$HOSTACKAUTHOR$\nAcnkowledgement:'. - Palle *From:* [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] mailto: [EMAIL PROTECTED]] *On Behalf Of *Nedim Bicic *Sent:* Thursday, June 21, 2007 2:11 PM *To: * Jim Perrin *Cc:* nagios-users@lists.sourceforge.net mailto:nagios-users@lists.sourceforge.net *Subject:* Re: [Nagios-users] Nagios Message Acknowledgment Hey this is the error i get when i try to debug the above command for the acknowledgement does anyone know why or how to fix
Re: [Nagios-users] Host/Ping check and Nagios performance.
That's just how Nagios 2.x works. I hear-tell that 3.x doesn't have this problem. On Thu, 2007-06-07 at 11:07 -0400, Jonathan Call wrote: Is there any reason why Nagios stops running all service checks while it executes check-host-alive/ping on hosts? Can I change that? I cannot find a setting to do it. With the large number of service checks I'm running (1300+) whenever a host goes down (or in some cases just stops answering ICMP) it kills performance on the Nagios server. Jonathan Call - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] problems with a Script on nagios
Every user should have rights to read '/etc/passwd'. More likely you don't have rights to read /var/log/messages as nagios. On Thu, 2007-05-31 at 10:14 +0200, Jose luis Fernandez wrote: Because the user on shell is root and you have rights to process /etc/passwd, but the user of NAGIOS doesn´t have rights to read /etc/passwd [EMAIL PROTECTED] escribió: Hi Nagiosexchange users!!! I have some problems with this bash script. The objetive is to monitor all the incorrect logins to the sistem so I created this script. check_acces --- #!/bin/bash di=`date '+%B %d'` for i in `cut -f1 -d: /etc/passwd`; do kkY=`grep \Failed\ \keyboard-interactive/pam\ \for\ \ $i /var/log/messages |cut -c1-15|awk /$di/` if [ $kkY != ] then sa1=$sa1 User: $i Date of Attempts:$kkY fi done kk6=`grep \Failed\ \keyboard-interactive/pam\ \for\ \illegal\ \user /var/log/messages |cut -c1-15 |awk /$di/` sal2=EXISTING USERS:$sa1 NO EXISTING USERS: Date of Attempts:$kk6 sal3=`echo $sal2` echo $sal3 The script works fine on shell, I execute the script like this ./check_acces and it returns this EXISTING USERS: User: root Date of Attempts:May 31 08:40:03 May 31 08:40:04 May 31 08:40:15 May 31 08:40:17 May 31 08:40:18 User: jon Date of Attempts:May 31 08:40:33 May 31 08:40:34 May 31 08:40:35 May 31 08:40:36 May 31 08:40:38 NO EXISTING USERS: Date of Attempts:May 31 08:40:54 May 31 08:40:56 May 31 08:40:57 May 31 08:40:59 May 31 08:41:01 May 31 08:41:02 May 31 08:41:04 May 31 08:41:14 May 31 08:41:15 May 31 08:41:16 May 31 08:41:17 May 31 08:41:19 May 31 08:41:20 May 31 08:41:23 this is ok, it works fine. But when I declare a service and a command on Nagios with this script something extrange happends, only appears this on Nagios, where are the incorect logins? They doesn´t appear. Service Status Last Check Duration Attempt Status information Incorrect Access OK 05-31-2007 09:30:46 2d 23h 30m 55s 1/4 EXISTING USERS: NO EXISTING USERS: Date of Attempts: I dont know why when the script is executed by nagios it doesn´t return anything and it works fine on shell. The script executed by nagios it doesn´t return the same as when you executed on shell. Someone knows something about this? Thank you for help!!! Aritz. - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- José Luís Fernández Sánchez Área de Nuevas Tecnologías C/ Marie Curie, nº 6 – bajo derecha. 29590. PTA. Málaga Tel. +34 951 299 524 – Fax: +34 951 299 596 Correo-e: [EMAIL PROTECTED] www.juntadeandalucia.es/fundaciondeserviciossociales Este mensaje se dirige exclusivamente a su destinatario y puede contener información privilegiada o confidencial. Si no es Ud. el destinatario indicado, queda notificado de que la utilización, divulgación y/o copia sin autorización está prohibida en virtud de la legislación vigente. Si ha recibido este mensaje por error, le rogamos que nos lo comunique inmediatamente por esta misma vía y proceda a su destrucción. This message is intended exclusively for its addressee and may contain information that is CONFIDENTIAL and protected by professional privilege. If you are not the intended recipient your are hereby notified that any dissemination, copy or disclosure of this communication is strictly prohibited by law. If this message has been received in error, please immediately notify us via e-mail and delete it. P Antes de imprimir, piense en su compromiso con el MEDIO AMBIENTE! - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users
[Nagios-users] Nagios SNMP MIB / .1 (Unique ID)
In the standard Nagios SNMP MIB there is an OID for each Host and Service notifications which is basically a unique identifier for the event. However, the MIB also specifically states that the ID is not generated by Nagios, but rather the SNMP application. Does anyone use this field? How do you populate it? I assume it's meant to be used to tie together subsequent notifications so whomever is receiving it knows that they are related to a single event. Is my thinking incorrect? nHostEventIndex OBJECT-TYPE SYNTAX Integer32 (1..65535) MAX-ACCESS not-accessible STATUS current DESCRIPTION This object uniquely identifies this host event entry. It is generated by the SNMP application and is not related to any Nagios data. ::= { nagiosHostEventEntry 1 } thanks - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null