Re: [Nagios-users] Bonding/Teaming Monitoring
On the server part, you should go with: - monitor physical interfaces (snmp) - monitor bonding sytatus (in /proc/net/bonding/bond0, or the sysfs equivalent) Moreover, you could monitor network interfaces on the switch side. You can for example, use custom variables to store switch ports on server side. Sent from my BlackBerry® wireless device -Original Message- From: Jones, Stuart stuart.jo...@health.wa.gov.au Date: Mon, 30 Aug 2010 09:29:00 To: Nagios Users Listnagios-users@lists.sourceforge.net Reply-To: Nagios Users List nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Bonding/Teaming Monitoring -- Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d -- Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios script output issue
You aren't passing that http://xyz.com thing to the server, are you? Sent from my BlackBerry® wireless device -Original Message- From: newme me allan...@gmail.com Date: Sun, 29 Aug 2010 18:12:10 To: nagios-users@lists.sourceforge.net Reply-To: Nagios Users List nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] Nagios script output issue -- Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d -- Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Are service contacts 'inherited' from respective host?
Hi, I'm monitoring a bunch of hosts and each of these hosts has at least two services used to monitor and graph ping RTA and packet-loss. Hosts configurations are organized in several .cfg files (by building), there is also a hostgroup definition grouping all hosts in a building. However, hosts in a building can have different contacts... I defined services using hostgroups - like this: define service { use template hostgroup_name MYHOSTGROUP } I was hoping a service that doesn't have a contact definition would 'inherit' the contact from the host - but it seems that's not the case. Is there a way to make that happen? Or will I need to define services one-by-one while and add contact definition to each? Danilo -- Danilo Godec, sistemska podpora / system administration Predlog! Obiscite prenovljeno spletno stran www.agenda.si ODPRTA KODA IN LINUX STORITVE : POSLOVNE RESITVE : UPRAVLJANJE IT : INFRASTRUKTURA IT : IZOBRAZEVANJE : PROGRAMSKA OPREMA Visit our updated web page at www.agenda.si OPEN SOURCE AND LINUX SERVICES : BUSINESS SOLUTIONS : IT MANAGEMENT : IT INFRASTRUCTURE : TRAINING : SOFTWARE -- Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Suggestion: Macro cleansing/clean_macro_chars() in macros.c should not allow trailing \
Hi, clean_macro_chars() basically removes illegal_output_chars from char *macro if(options STRIP_ILLEGAL_MACRO_CHARS). Now, by default '\' is not an illegal output char, so it is not stripped. However that can cause problems if e.g. we have a command_line plugin arg0 $STRIPPED_MACRO$ arg2 Consider: $STRIPPED_MACRO$ = normal plugin sees: arg0: arg0 arg1: normal arg2: arg2 All is well. However, if: $STRIPPED MACRO$ = foo\ arg0: arg0 arg1: foo arg2 arg2: NULL The trailing backslash from foo cases the space to be backslashed. It can be avoided by putting in two spaces after $STRIPPED_MACRO$ in the command_line but that isn't quite intuitive, is it? This value could arise from the before-stripping values of $STRIPPED_MACRO$: foo\, foo\$, foo\ etc. I suggest that if stripping is to be truly safe, somehow it should be possible to avoid trailing backslashes as that messes up the args given to plugins. Peter (Yes, in another thread in nagios-devel I'm talking about Custom Object Variables and cleansing, but I think this applies to cleansing in general so I post it as a separate subject. Hope that is ok.) -- Peter Valdemar Mørch http://www.morch.com -- Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Are service contacts 'inherited' from respectivehost?
-Ursprüngliche Nachricht- Von: Danilo Godec [mailto:danilo.go...@agenda.si] Gesendet: Montag, 30. August 2010 11:03 An: Nagios Users List Betreff: [Nagios-users] Are service contacts 'inherited' from respectivehost? Hi, I'm monitoring a bunch of hosts and each of these hosts has at least two services used to monitor and graph ping RTA and packet-loss. Hosts configurations are organized in several .cfg files (by building), there is also a hostgroup definition grouping all hosts in a building. However, hosts in a building can have different contacts... I defined services using hostgroups - like this: define service { use template hostgroup_name MYHOSTGROUP } I was hoping a service that doesn't have a contact definition would 'inherit' the contact from the host - but it seems that's not the case. Is there a way to make that happen? Or will I need to define services one-by-one while and add contact definition to each? Danilo Hi, What Nagios version do you use? Since Nagios 3.0 contact_groups, notification_interval, notification_period are inherited from the related host definition if you don't explicitly specify them in the service object or an inherited template. http://nagios.sourceforge.net/docs/3_0/objectinheritance.html Gerald OOe. Gesundheits- und Spitals-AG Techcenter, Hafenstrasse 47-51, 4020 Linz DVR 2107870, ATU 51928204, Firmenbuchgericht: Landesgericht Linz, FN 210146 p -- Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Help needed: How do you configure dynamic thresholds depending on timeframes.
Help needed: How do you configure dynamic thresholds depending on timeframes. Example. -w = 80 -c = 90 during business hours but -w = 90 -c = 95 outsite business hours. How can we configure into nagios dynamic thresholds depending on timeframes: Part II define service{ use local-service ; Name of service template to use host_name localhost service_description Current Users check_command check_local_users!20!50 check_period nonworkhours notification_period nonworkhours } define service{ use local-service ; Name of service template to use host_name localhost service_description Current Users check_command check_local_users!40!60 check_period workhours notification_period workhours } In my above example i dit configure the same test twice. The two timeframes 'nonworkhours' en 'workhours' together equals 24x7 Is this way of configuring allowed. Because this solves my problem. 1) how will the nagios scheduling react on this configuration? 2) how will the display react on this configuration? # 'workhours' timeperiod definition define timeperiod{ timeperiod_name workhours alias Normal Working Hours monday 09:00-17:00 tuesday 09:00-17:00 wednesday 09:00-17:00 thursday09:00-17:00 friday 09:00-17:00 } # 'nonworkhours' timeperiod definition define timeperiod{ timeperiod_name nonworkhours alias Non-Work Hours sunday 00:00-24:00 monday 00:00-09:00,17:00-24:00 tuesday 00:00-09:00,17:00-24:00 wednesday 00:00-09:00,17:00-24:00 thursday00:00-09:00,17:00-24:00 friday 00:00-09:00,17:00-24:00 saturday00:00-24:00 } check_period: This directive is used to specify the short name of the time period during which active checks of this host can be made. check_period: This directive is used to specify the short name of the time period during which active checks of this service can be made. If you do not use the check_period directive to specify a timeperiod, Nagios will be able to schedule active checks of the host or service anytime it needs to. This is essentially a 24x7 monitoring scenario. Specifying a timeperiod in the check_period directive allows you to restrict the time that Nagios perform regularly scheduled, active checks of the host or service. When Nagios attempts to reschedule a host or service check, it will make sure that the next check falls within a valid time range within the defined timeperiod. If it doesnt, Nagios will adjust the next check time to coincide with the next valid time in the specified timeperiod. This means that the host or service may not get checked again for another hour, day, or week, etc. Timeperiods: Exclusions and Host/Service Checks - There is a bug in the service/host check scheduling logic that rears its head when you use timeperiod definitions that use the excludedirective. The problem occurs when Nagios Core tries to re-schedule the next check. In this case, the scheduling logic may incorrectly schedule the next check further out in the future than it should. In essence, it skips over the (missing) logic where it could determine an earlier possible time using the exception times. Imperfect Solution: Dont use timeperiod definitions that exclude other timeperods for your host/service check periods. A fix is being worked on, and will hopefully make it into a 3.4.x release. -- Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Internal Server Error Occuring when trying to access Nagios on the browser
what ports do tomcat and postgres listen on? On Fri, Aug 27, 2010 at 3:40 AM, Ifeanyi Agu ifepet2...@yahoo.com wrote: Hello Alex, I want to monitor Apache Tomcat (i.e. to know when it is running or not). I also want to monitor Postgres service. My Nagios is working well now but I noticed that it does not send me mails. Am using Cent OS. Is there any other configuration I need to do outside or within Nagios that would enable mails to be sent? Ifeanyi Agu 08063728802 --- On *Thu, 26/8/10, Alex Dehaini alexdeha...@gmail.com* wrote: From: Alex Dehaini alexdeha...@gmail.com Subject: Re: [Nagios-users] Internal Server Error Occuring when trying to access Nagios on the browser To: Nagios Users List nagios-users@lists.sourceforge.net Date: Thursday, 26 August, 2010, 7:24 PM Yep, Nagios has tons and tons of plugins that can monitor virtually any service - what exactly do you want to monitor, a service, connection to a port, what? Let us know and we can help Regards, Alex PS: Review the docx, common questions are usually answered in the docx. :) -Inline Attachment Follows- -- Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d -Inline Attachment Follows- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.nethttp://mc/compose?to=nagios-us...@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios script output issue
On Aug 29, 2010, at 8:12 PM, newme me wrote: ./check_http -H xyz.com -p -u /abc -t 3 OK Result from Nagios Interface - nodename nor servname provided, or not known HTTP CRITICAL - Unable to open TCP socket I am thinking that this has something to do with Nagios as the box is behaving just fine based on the commandline result(and verified on the URL through the browser). It seems like you may have multiple nagios daemons running at the same time; one that's using a previously bad config, and one with a good new config. Stop nagios, verify that all nagios processes are dead and restart. -- Marc -- Sell apps to millions through the Intel(R) Atom(Tm) Developer Program Be part of this innovative community and reach millions of netbook users worldwide. Take advantage of special opportunities to increase revenue and speed time-to-market. Join now, and jumpstart your future. http://p.sf.net/sfu/intel-atom-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios script output issue
Hi Marc, Thanks for your reply, I checked on the multiple process of Nagios but there is only one running. Any other thing you can think of which might be causing this odd behaviour? Thanks, Allan. On Sun, Aug 29, 2010 at 6:12 PM, newme me allan...@gmail.com wrote: Folks, I switched to check_http plugin (official nagios plugin) but that also has same problem. Initially when I start nagios then the status on the Nagios Web interface is same as the one returned from commandline. After some time the status becomes critical but is not as same as the commandline. Command line returns the correct status of OK instead of what the Nagios web interface shows as CRITICAL Result from commandline( browser) - ./check_http -H xyz.com -p -u /abc -t 3 OK Result from Nagios Interface - nodename nor servname provided, or not known HTTP CRITICAL - Unable to open TCP socket I am thinking that this has something to do with Nagios as the box is behaving just fine based on the commandline result(and verified on the URL through the browser). Please help! Thanks, Allan. On Thu, Aug 26, 2010 at 10:26 PM, newme me allan...@gmail.com wrote: Hi Folks, Nagios is acting a little weird for me, I have this external script which I hooked into Nagios, it merely does a curl/wget on a URL and returns the status based on string in the content/output. Initially for 2-3 hrs the script returns the right status and Nagios reports correctly i.e. OK, WARN, ERROR based on the exit from the script. After 2-3 hrs output which was (and should be) OK or WARN starts returning CRITICAL and the output line says Application is and not even Application is ERROR or Application is FATAL. There is nothing in the logs to suggest what could be the problem. Have you experienced this before and let me know the corrective action. I am running Nagios on Mac OSX. Here is the script for the curious - #!/bin/bash read URL $1 STATUS=`curl -s $URL |grep summary|awk -F\ '{print $2}'` echo Application is $STATUS echo curl $URL case $STATUS in OK) exit 0 ;; WARN) exit 1 ;; ERROR) exit 2 ;; FATAL) exit 2 ;; *) exit 2 ;; esac Thanks, Allan. -- This SF.net Dev2Dev email is sponsored by: Show off your parallel programming skills. Enter the Intel(R) Threading Challenge 2010. http://p.sf.net/sfu/intel-thread-sfd___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] Help hacking a script to montior my AppServers?
I am trying to monitor our OpenEdge Database Application Servers on our AIX box with Nagios, and to send an alert based on a tricky situation, described below: The tool that I run locally on the AIX host to check our servers outputs the following: #asbman -name production -query PID State Port nRqnRcvd nSent Started Last Change 2175264 AVAILABLE 15614 006968 007643 008661 Aug 29, 2010 19:03 Aug 30, 2010 11:16 922080 AVAILABLE 15616 002675 002970 003465 Aug 30, 2010 09:22 Aug 30, 2010 11:16 483832 AVAILABLE 15607 001740 001939 002204 Aug 30, 2010 10:02 Aug 30, 2010 11:16 2936902 AVAILABLE 15620 001753 001955 002236 Aug 30, 2010 10:02 Aug 30, 2010 11:16 3658058 AVAILABLE 15623 001742 001921 002190 Aug 30, 2010 10:02 Aug 30, 2010 11:16 3027394 AVAILABLE 15608 001297 001377 001493 Aug 30, 2010 10:21 Aug 30, 2010 11:16 3965150 SENDING 15611 001290 001352 001465 Aug 30, 2010 10:21 Aug 30, 2010 06:12 4186168 AVAILABLE 15615 001300 001379 001549 Aug 30, 2010 10:21 Aug 30, 2010 11:16 2044322 AVAILABLE 15617 001298 001369 001558 Aug 30, 2010 10:21 Aug 30, 2010 11:16 I would want Nagios to raise an alert if State = SENDING for more than 6 hours (Last Change)? Would this be possible? Anyone have an idea of how I would do this? I guess one way would be to make a shell script to output asbman -name production -query | grep SENDING status.txt and have a check log type check read it and raise an error if stuff is found? But it does not exactly implement the Last Change date time idea, we will always have PIDs going from AVAILABLE to SENDING and back, but those that are stuck in SENDING for 6+ hours is of concern. Any help on this is appreciated. The Oppenheimer Group CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. -- This SF.net Dev2Dev email is sponsored by: Show off your parallel programming skills. Enter the Intel(R) Threading Challenge 2010. http://p.sf.net/sfu/intel-thread-sfd___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Help hacking a script to montior my AppServers?
You will have to write your script in a way that it is able to parse the date fields in that output and calculate the difference between that time and the current time.. Is it possible to have that program output UNIX timestamps instead? If not, it will require you to get creative with awk, Perl, shell, etc. scripting to parse the fields properly in order to do the required math to determine how long it has been in that state. Depending on how long that binary takes to run (e.g., if it takes less than a few seconds), you may not need to have it run separately and output data to a text file. You may be able to just have your custom plugin script run it and parse the output directly. On 08/30/2010 02:20 PM, Mirza Dedic wrote: I am trying to monitor our OpenEdge Database Application Servers on our AIX box with Nagios, and to send an alert based on a tricky situation, described below: The tool that I run locally on the AIX host to check our servers outputs the following: #asbman --name production --query PID State Port nRqnRcvd nSent Started Last Change 2175264 AVAILABLE 15614 006968 007643 008661 Aug 29, 2010 19:03 Aug 30, 2010 11:16 922080 AVAILABLE 15616 002675 002970 003465 Aug 30, 2010 09:22 Aug 30, 2010 11:16 483832 AVAILABLE 15607 001740 001939 002204 Aug 30, 2010 10:02 Aug 30, 2010 11:16 2936902 AVAILABLE 15620 001753 001955 002236 Aug 30, 2010 10:02 Aug 30, 2010 11:16 3658058 AVAILABLE 15623 001742 001921 002190 Aug 30, 2010 10:02 Aug 30, 2010 11:16 3027394 AVAILABLE 15608 001297 001377 001493 Aug 30, 2010 10:21 Aug 30, 2010 11:16 *3965150 *_*SENDING*_* ** 15611 001290 001352 001465 Aug 30, *_*2010 10:21 Aug 30, 2010 *__*06*__*:*__*12*_* * 4186168 AVAILABLE 15615 001300 001379 001549 Aug 30, 2010 10:21 Aug 30, 2010 11:16 2044322 AVAILABLE 15617 001298 001369 001558 Aug 30, 2010 10:21 Aug 30, 2010 11:16 I would want Nagios to raise an alert if State = SENDING for more than 6 hours (Last Change)? Would this be possible? Anyone have an idea of how I would do this? I guess one way would be to make a shell script to output asbman --name production --query | grep SENDING status.txt and have a check log type check read it and raise an error if stuff is found? But it does not exactly implement the Last Change date time idea, we will always have PIDs going from AVAILABLE to SENDING and back, but those that are stuck in SENDING for 6+ hours is of concern. Any help on this is appreciated. The Oppenheimer Group CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. -- This SF.net Dev2Dev email is sponsored by: Show off your parallel programming skills. Enter the Intel(R) Threading Challenge 2010. http://p.sf.net/sfu/intel-thread-sfd ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This SF.net Dev2Dev email is sponsored by: Show off your parallel programming skills. Enter the Intel(R) Threading Challenge 2010. http://p.sf.net/sfu/intel-thread-sfd___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] Nagios script output issue
One more thing I would like to add here , I have 5 URLs check , 2 thru check_http and 3 thru my own curl script and all of them fail at the same time, initially for 2-3 hrs all are returning the right status. The error that I am getting from Nagios(check_http) - nodename nor servname provided, or not known HTTP CRITICAL - Unable to open TCP socket is somewhat as if the dns server becomes inaccessible but when I run the commands from the command line, the status is fine. Is there something within Nagios that I need to set/configure? Please help! Thanks, Allan. On Mon, Aug 30, 2010 at 10:47 AM, newme me allan...@gmail.com wrote: Hi Marc, Thanks for your reply, I checked on the multiple process of Nagios but there is only one running. Any other thing you can think of which might be causing this odd behaviour? Thanks, Allan. On Sun, Aug 29, 2010 at 6:12 PM, newme me allan...@gmail.com wrote: Folks, I switched to check_http plugin (official nagios plugin) but that also has same problem. Initially when I start nagios then the status on the Nagios Web interface is same as the one returned from commandline. After some time the status becomes critical but is not as same as the commandline. Command line returns the correct status of OK instead of what the Nagios web interface shows as CRITICAL Result from commandline( browser) - ./check_http -H xyz.com -p -u /abc -t 3 OK Result from Nagios Interface - nodename nor servname provided, or not known HTTP CRITICAL - Unable to open TCP socket I am thinking that this has something to do with Nagios as the box is behaving just fine based on the commandline result(and verified on the URL through the browser). Please help! Thanks, Allan. On Thu, Aug 26, 2010 at 10:26 PM, newme me allan...@gmail.com wrote: Hi Folks, Nagios is acting a little weird for me, I have this external script which I hooked into Nagios, it merely does a curl/wget on a URL and returns the status based on string in the content/output. Initially for 2-3 hrs the script returns the right status and Nagios reports correctly i.e. OK, WARN, ERROR based on the exit from the script. After 2-3 hrs output which was (and should be) OK or WARN starts returning CRITICAL and the output line says Application is and not even Application is ERROR or Application is FATAL. There is nothing in the logs to suggest what could be the problem. Have you experienced this before and let me know the corrective action. I am running Nagios on Mac OSX. Here is the script for the curious - #!/bin/bash read URL $1 STATUS=`curl -s $URL |grep summary|awk -F\ '{print $2}'` echo Application is $STATUS echo curl $URL case $STATUS in OK) exit 0 ;; WARN) exit 1 ;; ERROR) exit 2 ;; FATAL) exit 2 ;; *) exit 2 ;; esac Thanks, Allan. -- This SF.net Dev2Dev email is sponsored by: Show off your parallel programming skills. Enter the Intel(R) Threading Challenge 2010. http://p.sf.net/sfu/intel-thread-sfd___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null