Re: [Nagios-users] NSCA : Time out
On 16/07/12 20:44, trm asn wrote: On Mon, Jul 16, 2012 at 2:08 AM, trm asn trm.nag...@gmail.com mailto:trm.nag...@gmail.com wrote: Dear Team, I am getting the below error message while pushing the results to central nagios. I am able to telnet 5667 from Monitor node to central server. in Tcp-Wrapper this ip is allowed . The results are not getting updated due to this. Please help me out to sort this .. Warning: OCSP command '/mnt/vol1/nagios/libexec/eventhandlers/submit_check_result Lnx-ProdDatabase1 'Checks if the SQL Slave thread is running' OK 'OK - Slave io is running'' for service 'Checks if the SQL Slave thread is running' on host ' Lnx-ProdDatabase1' timed out after 10 seconds . Still not able to figure out , where is the actual problem is. i) I am able to telnet to central server's 5667 port . ii) I am able to test it from commandline .. echo testhostTestService 0OK | /mnt/vol1/nagios/libexec/send_nsca -H x.x.x.x -p 5667 -c /mnt/vol1/nagios/etc/send_nsca.cfg 0 data packet(s) sent to host successfully. Is there anywhere else , I need to lookout to get the result submitted to the central server. /\ TRM i have found that some time NSCA becomes stale on the server side and a restart of the service helps to restore it to action. Also , are you writing to the syslog ? if so check the log size and if the file system has not filled up ( on both sides of the interaction) . did you check the nsca log file ( usually syslog ) ? -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA : Time out
On Mon, Jul 16, 2012 at 2:08 AM, trm asn trm.nag...@gmail.com wrote: Dear Team, I am getting the below error message while pushing the results to central nagios. I am able to telnet 5667 from Monitor node to central server. in Tcp-Wrapper this ip is allowed . The results are not getting updated due to this. Please help me out to sort this .. Warning: OCSP command '/mnt/vol1/nagios/libexec/eventhandlers/submit_check_result Lnx-ProdDatabase1 'Checks if the SQL Slave thread is running' OK 'OK - Slave io is running'' for service 'Checks if the SQL Slave thread is running' on host ' Lnx-ProdDatabase1' timed out after 10 seconds . Still not able to figure out , where is the actual problem is. i) I am able to telnet to central server's 5667 port . ii) I am able to test it from commandline .. echo testhostTestService 0OK | /mnt/vol1/nagios/libexec/send_nsca -H x.x.x.x -p 5667 -c /mnt/vol1/nagios/etc/send_nsca.cfg 0 data packet(s) sent to host successfully. Is there anywhere else , I need to lookout to get the result submitted to the central server. /\ TRM -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA : Time out
Dear Team, I am getting the below error message while pushing the results to central nagios. I am able to telnet 5667 from Monitor node to central server. in Tcp-Wrapper this ip is allowed . The results are not getting updated due to this. Please help me out to sort this .. Warning: OCSP command '/mnt/vol1/nagios/libexec/eventhandlers/submit_check_result Lnx-ProdDatabase1 'Checks if the SQL Slave thread is running' OK 'OK - Slave io is running'' for service 'Checks if the SQL Slave thread is running' on host ' Lnx-ProdDatabase1' timed out after 10 seconds -- TRM -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nsca old server with new nsca client
Albert, The packet size was increased in 2.9 that caused 2.9 clients to fail with older servers and 2.9 servers to fail with older clients. A fix was implemented in the server in 2.9.1 that would allow older clients to connect to it, but 2.9 and newer clients will not be able to talk to older servers because they still use the larger packet size. The reason the fix is in the server is because there are far fewer servers than clients, meaning less things to replace in order to get things working. If you update your server, your older clients should be able to connect. Hope that helps. Eric On 2/7/2012 3:48 PM, Albert Shih wrote: Le 07/02/2012 à 13:34:21-0800, Mike Lindsey a écrit On 2/7/12 12:10 PM, Albert Shih wrote: Hi all, Is there any way I can use a new (2.9.1) client nsca (send_nsca) with old server (2.7.x) ? 2.9.1 shouldn't include any backwards incompatible code. That said, the I don't knownAll I known is the new version of client don't work (for me) when the server is the old one. I've two nagios server, on one I run the new version of nsca-server and everything working. On the second from same client with old sever I was unable to make passive service working. normal cross-version issues have been with newer server, and older client so I'm not sure old server has been sufficiently tested with new client... Is there a particular reason why you can't upgrade your server side? Well...let's «it's complicated» ;-) In fact I solve my problem by recompiling on the client the 2.7.2 version of nsca client. But well...the problem still here... I can run some test if you like. The client running FreeBSD 8.2 and the server running Linux Debian. Regards. JAS -- Eric Stanley ___ Developer Nagios Enterprises, LLC Email: estan...@nagios.com Web:www.nagios.com -- Virtualization Cloud Management Using Capacity Planning Cloud computing makes use of virtualization - but cloud computing also focuses on allowing computing to be delivered as a service. http://www.accelacomm.com/jaw/sfnl/114/51521223/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nsca old server with new nsca client
Hi all, Is there any way I can use a new (2.9.1) client nsca (send_nsca) with old server (2.7.x) ? regards. JAS -- Albert SHIH DIO batiment 15 Observatoire de Paris 5 Place Jules Janssen 92195 Meudon Cedex Téléphone : 01 45 07 76 26/06 86 69 95 71 xmpp: j...@obspm.fr Heure local/Local time: mar 7 fév 2012 21:10:06 CET -- Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nsca old server with new nsca client
On 2/7/12 12:10 PM, Albert Shih wrote: Hi all, Is there any way I can use a new (2.9.1) client nsca (send_nsca) with old server (2.7.x) ? 2.9.1 shouldn't include any backwards incompatible code. That said, the normal cross-version issues have been with newer server, and older client so I'm not sure old server has been sufficiently tested with new client... Is there a particular reason why you can't upgrade your server side? -- Mike Lindsey -- Keep Your Developer Skills Current with LearnDevNow! The most comprehensive online learning library for Microsoft developers is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3, Metro Style Apps, more. Free future releases when you subscribe now! http://p.sf.net/sfu/learndevnow-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA and host checks
On 12/20/2010 03:45 PM, stan wrote: I am working on geting a small distributed system up. I have the serviec checks going back to the master, but have not managed to get host checks passed back to the master yet. With help from the list I found: obsess_over_hosts and set it to 1. It appears that I also need an ochp command. I tried using the script I had set p for service checks, but it appears that the arguments passed to this are different. Here are the arguments that are being passed the the service check comand: Arguments: # $1 = host_name (Short name of host that the service is # associated with) # $2 = svc_description (Description of the service) # $3 = state_string (A string representing the status of # the given service - OK, WARNING, CRITICAL # or UNKNOWN) # $4 = plugin_output (A text string that # What do the one passed to the host check look like? the same, but without the service description. -- Andreas Ericsson andreas.erics...@op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA and host checks
On Mon, Dec 20, 2010 at 03:54:45PM +0100, Andreas Ericsson wrote: On 12/20/2010 03:45 PM, stan wrote: I am working on geting a small distributed system up. I have the serviec checks going back to the master, but have not managed to get host checks passed back to the master yet. With help from the list I found: obsess_over_hosts and set it to 1. It appears that I also need an ochp command. I tried using the script I had set p for service checks, but it appears that the arguments passed to this are different. Here are the arguments that are being passed the the service check comand: Arguments: # $1 = host_name (Short name of host that the service is # associated with) # $2 = svc_description (Description of the service) # $3 = state_string (A string representing the status of # the given service - OK, WARNING, CRITICAL # or UNKNOWN) # $4 = plugin_output (A text string that # What do the one passed to the host check look like? the same, but without the service description. OK, here are my 2 command defs: command_line/usr/share/nagios/sbin/submit_service_check_result $HOSTNAME$ '$SERVICEDESC$' $SERVICESTATEID$ '$SERVICEOUTPUT$' and /usr/share/nagios/sbin/submit_host_check_result $HOSTNAME$ $HOSTSTATEID$ '$HOSTOUTPUT$' Do these look correct? And do the *STATEID's need translating to an integer before I pass them to send_nsca? I found an example in the web, that used a case statement to do thism but I do not know if it is needed or not. Thanks for the help/ -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA hangs?
Heya guys, i once more need your help :) It seems that my NSCA works absolutely perfect for 2-3 days, with 17 hosts monitored, all sending Passive checks on a 10 second intervall. (The problem may be here, I just want a second opinion) Anyway, it works really good for about 2-3 days but then it appears to shut down. This is the errormsg i get at the NSclient++: 2010-12-12 13:54:44: error:modules\NSCAAgent\NSCAThread.cpp:275: Could not connect to: (keeping IP a secret) 10060: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond. And from the syslogfile i can only see that packet was dropped, 32 seconds old. I cant get real print out because uhm the syslogfile are to heavy to use so it hangs my putty..:) Any ideas? -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA hangs?
On Thu, 16 Dec 2010 13:57:38 +, Rikard Dahlberg wrote: It seems that my NSCA works absolutely perfect for 2-3 days, with 17 hosts monitored, all sending Passive checks on a 10 second intervall. (The problem may be here, I just want a second opinion) Anyway, it works really good for about 2-3 days but then it appears to shut down. This is the errormsg i get at the NSclient++: 2010-12-12 13:54:44: error:modulesNSCAAgentNSCAThread.cpp:275: -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Status is changing after 10 seconds to OK
Friday, November 5, 2010, 4:59:01 AM, you wrote: On Nov 4, 2010, at 4:38 PM, Brandino Andreas wrote: The message appearing in the Status Information when service is again green and OK is a simple OK !!! Were you previously testing something that gave that message? Freshness checks or active checks perhaps? Do you have multiple nagios daemons running at the same time, one running with the old config and one with the new? They'd both write to the same status file so you'd see results like this. -- Marc I have definedbymistakefreshness checks to final (TestMessage10) service definition. The full definition of the service that finally seems to work is: define service{ namegeneric-service active_checks_enabled 1 passive_checks_enabled 1 parallelize_check 1 obsess_over_service 1 check_freshness 0 notifications_enabled 1 event_handler_enabled 1 flap_detection_enabled 1 failure_prediction_enabled 1 process_perf_data 1 retain_status_information 1 retain_nonstatus_information1 is_volatile 0 check_period24x7 max_check_attempts 3 normal_check_interval 10 retry_check_interval2 contact_groups admins notification_optionsw,u,c,r notification_interval 60 notification_period 24x7 register0 } define service{ use generic-service namepassive-service active_checks_enabled 0 passive_checks_enabled 1 register0 is_volatile 0 max_check_attempts 1 check_freshness 0 check_command check_dummy!0 } define service{ use passive-service service_description TestMessage10 host_name NMS register1 flap_detection_enabled 0 contact_groups admins check_period24x7 notification_interval 240 notification_period 24x7 notification_optionsw,u,c,r } Thank you for the help -- The Next 800 Companies to Lead America's Growth: New Video Whitepaper David G. Thomson, author of the best-selling book Blueprint to a Billion shares his insights and actions to help propel your business during the next growth cycle. Listen Now! http://p.sf.net/sfu/SAP-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null --- --- --- Brandino Andreas ampra...@gmail.com --- --- --- -- The Next 800 Companies to Lead America's Growth: New Video Whitepaper David G. Thomson, author of the best-selling book Blueprint to a Billion shares his insights and actions to help propel your business during the next growth cycle. Listen Now! http://p.sf.net/sfu/SAP-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA - Status is changing after 10 seconds to OK
Hi all, I am trying to configure nagios with NSCA. NSCA is successfully receiving the messages (sent with send_nsca), updates the services state and the message info. The problem is that after 10 seconds, the service is returning to OK state; in the meantime I haven't send any other messages. How can I make the service keep that last state until it receives a new message? The definition of the services I am using follows: define service{ use generic-service namepassive-service active_checks_enabled 0 passive_checks_enabled 1 register0 is_volatile 0 check_period24x7 max_check_attempts 1 normal_check_interval 5 retry_check_interval1 check_freshness 0 freshness_threshold 5 contact_groups admins check_command check_dummy!0 notification_interval 120 notification_period 24x7 notification_optionsw,u,c,r } define service{ use passive-service service_description TestMessage10 host_name NMS register1 contact_groups admins check_freshness 1 freshness_threshold 60 check_period24x7 } Thank you -- The Next 800 Companies to Lead America's Growth: New Video Whitepaper David G. Thomson, author of the best-selling book Blueprint to a Billion shares his insights and actions to help propel your business during the next growth cycle. Listen Now! http://p.sf.net/sfu/SAP-dev2dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Status is changing after 10 seconds to OK
Hi all, I am trying to configure nagios with NSCA. NSCA is successfully receiving the messages (sent with send_nsca), updates the services state and the message info. The problem is that after 10 seconds, the service is returning to OK state; in the meantime I haven't send any other messages. Make sure that no new state info arrived from the remote nsca sender, check the NAGIOS event log,for details M. -- The Next 800 Companies to Lead America's Growth: New Video Whitepaper David G. Thomson, author of the best-selling book Blueprint to a Billion shares his insights and actions to help propel your business during the next growth cycle. Listen Now! http://p.sf.net/sfu/SAP-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Status is changing after 10 seconds to OK
Thursday, November 4, 2010, 10:18:33 AM, you wrote: Hi all, I am trying to configure nagios with NSCA. NSCA is successfully receiving the messages (sent with send_nsca), updates the services state and the message info. The problem is that after 10 seconds, the service is returning to OK state; in the meantime I haven't send any other messages. Make sure that no new state info arrived from the remote nsca sender, check the NAGIOS event log,for details M. Its a test environment, only I am sending test messages to NSCA service. Event log is not showing a new received message... -- The Next 800 Companies to Lead America's Growth: New Video Whitepaper David G. Thomson, author of the best-selling book Blueprint to a Billion shares his insights and actions to help propel your business during the next growth cycle. Listen Now! http://p.sf.net/sfu/SAP-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null --- --- --- Brandino Andreas ampra...@gmail.com --- --- --- -- The Next 800 Companies to Lead America's Growth: New Video Whitepaper David G. Thomson, author of the best-selling book Blueprint to a Billion shares his insights and actions to help propel your business during the next growth cycle. Listen Now! http://p.sf.net/sfu/SAP-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Status is changing after 10 seconds to OK
On 04/11/10 08:30, Brandino Andreas wrote: Thursday, November 4, 2010, 10:18:33 AM, you wrote: Hi all, I am trying to configure nagios with NSCA. NSCA is successfully receiving the messages (sent with send_nsca), updates the services state and the message info. The problem is that after 10 seconds, the service is returning to OK state; in the meantime I haven't send any other messages. Its a test environment, only I am sending test messages to NSCA service. Event log is not showing a new received message... Make sure that the nagios server is considering the check as passive and not trying to actively check it also . I've had this when both the nagios master and the remote server were checking a service and thus providing conflicting data. Assaf -- Never,Ever Cut A Deal With a Dragon Next year I will be doing the London to Paris bike ride to raise money for the DogTrust (www.dogstrust.co.uk) . Please Sponsor me at http://www.justgiving.com/Assaf-Flatto -- The Next 800 Companies to Lead America's Growth: New Video Whitepaper David G. Thomson, author of the best-selling book Blueprint to a Billion shares his insights and actions to help propel your business during the next growth cycle. Listen Now! http://p.sf.net/sfu/SAP-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Status is changing after 10 seconds to OK
The message appearing in the Status Information when service is again green and OK is a simple OK !!! Thursday, November 4, 2010, 11:34:05 AM, you wrote: ... ... Its a test environment, only I am sending test messages to NSCA service. Event log is not showing a new received message... Which message string is displayed in the 'Status information' , row, for the particular service when it is being set to OK again ? M. --- --- --- Brandino Andreas ampra...@gmail.com --- --- --- -- The Next 800 Companies to Lead America's Growth: New Video Whitepaper David G. Thomson, author of the best-selling book Blueprint to a Billion shares his insights and actions to help propel your business during the next growth cycle. Listen Now! http://p.sf.net/sfu/SAP-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Status is changing after 10 seconds to OK
On Nov 4, 2010, at 4:38 PM, Brandino Andreas wrote: The message appearing in the Status Information when service is again green and OK is a simple OK !!! Were you previously testing something that gave that message? Freshness checks or active checks perhaps? Do you have multiple nagios daemons running at the same time, one running with the old config and one with the new? They'd both write to the same status file so you'd see results like this. -- Marc -- The Next 800 Companies to Lead America's Growth: New Video Whitepaper David G. Thomson, author of the best-selling book Blueprint to a Billion shares his insights and actions to help propel your business during the next growth cycle. Listen Now! http://p.sf.net/sfu/SAP-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA w/ secondary nagios machine monitoring separate network segment
Have you checked the nagios.cfg file and ensured the path for command_file is the same in both the nagios.cfg and nsca.cfg? Nagios should be creating the command file on start up. If they are the same, check that nagios has write permissions to /var/lib/nagios3/rw. Also, the command file is a pipe, not a simple file, so touch isn't good enough, you'll need to use mkfifo or mknod. Jonathon, it was simple newb issue. nagios.cfg needed 'check_external_commands' enabled. I got passive service checks working last night. Many thanks to bashk on irc. Now to get passive host checks working. -Andy -- Nokia and ATT present the 2010 Calling All Innovators-North America contest Create new apps games for the Nokia N8 for consumers in U.S. and Canada $10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store http://p.sf.net/sfu/nokia-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA w/ secondary nagios machine monitoring separate network segment
I've read this http://nagios.sourceforge.net/docs/3_0/distributed.html a few times, but I'm having a hard time understanding it. I will keep reading it until it makes sense. I'm running Ubuntu 10.04. I've installed nagios and nsca with the package manager. I have most hosts/services on 192.168.2.x network, but there are a few on a 10.0.0.x network behind a gateway machine 192.168.2.200-10.0.0.1. My nagios box is 192.168.2.201 The nsca daemon is running fine. I can do a send_nsca from 192.168.2.200 and get an Okay Packet sent. In my syslog it says: Oct 30 14:04:36 nagios nsca[713]: Connection from 192.168.2.200 port 8853 Oct 30 14:04:36 nagios nsca[713]: Handling the connection... Oct 30 14:04:37 nagios nsca[713]: SERVICE CHECK - Host Name: 'monarch', Service Description: 'TestMessage', Return Code: '0', Output: 'This is a test message' Oct 30 14:04:37 nagios nsca[713]: Command file '/var/lib/nagios3/rw/nagios.cmd' does not exist, attempting to use alternate dump file '/var/run/nagios/nsca.dump' for output Oct 30 14:04:37 nagios nsca[713]: Could not open alternate dump file '/var/run/nagios/nsca.dump' for appending Oct 30 14:04:37 nagios nsca[713]: End of connection... I wonder why nagios.cmd does not exist, if I touch this manually, the error goes away, but it happens again if I restart the nagios service. Does anyone have example config files of a distributed nagios where the only reason that it's distributed is because of network topology, not for redundancy. I'm sorry I don't understand the documentation and need a little more hand holding. -Andy -- Nokia and ATT present the 2010 Calling All Innovators-North America contest Create new apps games for the Nokia N8 for consumers in U.S. and Canada $10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store http://p.sf.net/sfu/nokia-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA w/ secondary nagios machine monitoring separate network segment
On 10/30/10 13:12, Andy Graybeal wrote: I've read this http://nagios.sourceforge.net/docs/3_0/distributed.html a few times, but I'm having a hard time understanding it. I will keep reading it until it makes sense. I'm running Ubuntu 10.04. I've installed nagios and nsca with the package manager. I have most hosts/services on 192.168.2.x network, but there are a few on a 10.0.0.x network behind a gateway machine 192.168.2.200-10.0.0.1. My nagios box is 192.168.2.201 The nsca daemon is running fine. I can do a send_nsca from 192.168.2.200 and get an Okay Packet sent. In my syslog it says: Oct 30 14:04:36 nagios nsca[713]: Connection from 192.168.2.200 port 8853 Oct 30 14:04:36 nagios nsca[713]: Handling the connection... Oct 30 14:04:37 nagios nsca[713]: SERVICE CHECK - Host Name: 'monarch', Service Description: 'TestMessage', Return Code: '0', Output: 'This is a test message' Oct 30 14:04:37 nagios nsca[713]: Command file '/var/lib/nagios3/rw/nagios.cmd' does not exist, attempting to use alternate dump file '/var/run/nagios/nsca.dump' for output Oct 30 14:04:37 nagios nsca[713]: Could not open alternate dump file '/var/run/nagios/nsca.dump' for appending Oct 30 14:04:37 nagios nsca[713]: End of connection... I wonder why nagios.cmd does not exist, if I touch this manually, the error goes away, but it happens again if I restart the nagios service. Have you checked the nagios.cfg file and ensured the path for command_file is the same in both the nagios.cfg and nsca.cfg? Nagios should be creating the command file on start up. If they are the same, check that nagios has write permissions to /var/lib/nagios3/rw. Also, the command file is a pipe, not a simple file, so touch isn't good enough, you'll need to use mkfifo or mknod. -- Jonathan Angliss -- Nokia and ATT present the 2010 Calling All Innovators-North America contest Create new apps games for the Nokia N8 for consumers in U.S. and Canada $10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store http://p.sf.net/sfu/nokia-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nsca packets disappear between xinetd and the daemon
I've got a system that accepts and processes nsca packets from a number of collectors successfully. Everything is using nsca-2.7.2. I added a new collector recently and it's not able to get data into the central nagios system via nsca. I recompiled the send_nsca client with the DEBUG flag enabled and get this output, which looks good: Connected okay... Got init packet from server Initialized encryption routines Done sending data 1 data packet(s) sent to host successfully. Cleaned up encryption routines On the central server, the xinetd logs look like the packets are being accepted: 10/10/1...@11:01:29: START: nsca from=not working collector in question IP 10/10/1...@11:01:29: EXIT: nsca status=0 But then nothing seems to show up the nsca log. All the other collector's data logs in the nsca log very nicely. Where should I look to debug my problem? Andy Ford Network Security Compliance Automation, Wells Fargo Co. 314-600-7025 andrew.f...@wachovia.com -- Beautiful is writing same markup. Internet Explorer 9 supports standards for HTML5, CSS3, SVG 1.1, ECMAScript5, and DOM L2 L3. Spend less time writing and rewriting code and more time creating great experiences on the web. Be a part of the beta today. http://p.sf.net/sfu/beautyoftheweb___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA error: log do not record any data.
Good day, I just installed NSCA. I tried to test it with commands below and seems like it cant carry the actual data/mesg. out puts:: test file: [code] cat test localhosttest abc [/code] File sent: [code] ./src/send_nsca localhost -c sample-config/send_nsca.cfg test 0 data packet(s) sent to host successfully. [/code] Results from /var/log/message:: [code] Jul 16 14:20:09 XXX nsca[20897]: Handling the connection... Jul 16 14:20:09 XXX nsca[20897]: End of connection... [/code] Configurations: [code] grep -v '#' /usr/local/nagios/etc/nsca.cfg | grep ^\w pid_file=/var/run/nsca.pid server_port=5667 nsca_user=nagios nsca_group=nagios debug=1 command_file=/usr/local/nagios/var/rw/nagios.cmd alternate_dump_file=/usr/local/nagios/var/rw/nsca.dump aggregate_writes=0 append_to_file=0 max_packet_age=30 decryption_method=1 [/code] [code] grep -v '#' sample-config/send_nsca.cfg | grep ^\w encryption_method=1 [/code] Please help. _ Game on: Challenge friends to great games on Messenger http://go.microsoft.com/?linkid=9734387-- This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA + NSClient
Hi, I have NSCA configured on my Nagios host, and enabled the necessary plugins on NSClient++ to support NSCA, configure XINETD appropriately inside my NSClient config I have: [NSCA Commands] my_cpu_check=checkCPU warn=80 crit=90 time=20m time=10s time=4 my_mem_check=checkMem MaxWarn=80% MaxCrit=90% ShowAll type=page This is just for testing, I also have in my nagios.cfg: accept_passive_service_checks=1 accept_passive_host_checks=1 In my NSClient Log I can see: 2010-06-24 12:48:44: debug:modules\NSCAAgent\NSCAThread.cpp:205: Executing (from NSCA): my_cpu_check 2010-06-24 12:48:44: debug:NSClient++.cpp:1106: Injecting: checkCPU: warn=80, crit=90, time=20m, time=10s, time=4 2010-06-24 12:48:44: debug:NSClient++.cpp:1142: Injected Result: OK 'OK CPU Load ok.' 2010-06-24 12:48:44: debug:NSClient++.cpp:1143: Injected Performance Result: ''20m'=0%;80;90; '10s'=6%;80;90; '4'=0%;80;90; ' 2010-06-24 12:48:44: debug:modules\NSCAAgent\NSCAThread.cpp:205: Executing (from NSCA): my_mem_check 2010-06-24 12:48:44: debug:NSClient++.cpp:1106: Injecting: checkMem: MaxWarn=80%, MaxCrit=90%, ShowAll, type=page 2010-06-24 12:48:44: debug:NSClient++.cpp:1142: Injected Result: OK 'OK: page file: 8.82G' 2010-06-24 12:48:44: debug:NSClient++.cpp:1143: Injected Performance Result: ''page file %'=45%;80;90; 'page file'=8.81G;15.6;17.59;0;19.5; ' What I want to know is, until now I have been using active checks, and for some servers I want to use passive_checks as well, so that the server updates Nagios. If I have active checks defined within my Nagios installation such as: define service{ use generic-service host_name van-mail01 service_description D - Disk Space check_command check_nt_disk!D!98!99 } Can I modify this to also read from the passive_check, and what would my [NSCA Commands] definition look like? Does the first part have to reflect the service description? How does the information coming from the NSCA Client get mapped to my configured checks? Thank you. The Oppenheimer Group CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA + NSClient
The short answer is yes, the service description you configure on the client nsca message needs to match that of the service description on your nagios server. If you want to migrate to a passive check you need to disable active checks and enable passive ones. You can leave the check command in place and incorporate freshness checks to force an active check if the passive fail. For me I don't want to do active if passive fail. I would rather cut a ticket service stale.The nagios doc clearly shows how to configure passive service checks so give it a read. So you need my_cpu_check to be a service description so nagios knows what to match that incoming nsca message to. Ash From: Mirza Dedic [mailto:mi...@oppy.com] Sent: Thursday, June 24, 2010 2:56 PM To: 'Nagios-Users' Subject: [Nagios-users] NSCA + NSClient Hi, I have NSCA configured on my Nagios host, and enabled the necessary plugins on NSClient++ to support NSCA, configure XINETD appropriately inside my NSClient config I have: [NSCA Commands] my_cpu_check=checkCPU warn=80 crit=90 time=20m time=10s time=4 my_mem_check=checkMem MaxWarn=80% MaxCrit=90% ShowAll type=page This is just for testing, I also have in my nagios.cfg: accept_passive_service_checks=1 accept_passive_host_checks=1 In my NSClient Log I can see: 2010-06-24 12:48:44: debug:modules\NSCAAgent\NSCAThread.cpp:205: Executing (from NSCA): my_cpu_check 2010-06-24 12:48:44: debug:NSClient++.cpp:1106: Injecting: checkCPU: warn=80, crit=90, time=20m, time=10s, time=4 2010-06-24 12:48:44: debug:NSClient++.cpp:1142: Injected Result: OK 'OK CPU Load ok.' 2010-06-24 12:48:44: debug:NSClient++.cpp:1143: Injected Performance Result: ''20m'=0%;80;90; '10s'=6%;80;90; '4'=0%;80;90; ' 2010-06-24 12:48:44: debug:modules\NSCAAgent\NSCAThread.cpp:205: Executing (from NSCA): my_mem_check 2010-06-24 12:48:44: debug:NSClient++.cpp:1106: Injecting: checkMem: MaxWarn=80%, MaxCrit=90%, ShowAll, type=page 2010-06-24 12:48:44: debug:NSClient++.cpp:1142: Injected Result: OK 'OK: page file: 8.82G' 2010-06-24 12:48:44: debug:NSClient++.cpp:1143: Injected Performance Result: ''page file %'=45%;80;90; 'page file'=8.81G;15.6;17.59;0;19.5; ' What I want to know is, until now I have been using active checks, and for some servers I want to use passive_checks as well, so that the server updates Nagios. If I have active checks defined within my Nagios installation such as: define service{ use generic-service host_name van-mail01 service_description D - Disk Space check_command check_nt_disk!D!98!99 } Can I modify this to also read from the passive_check, and what would my [NSCA Commands] definition look like? Does the first part have to reflect the service description? How does the information coming from the NSCA Client get mapped to my configured checks? Thank you. The Oppenheimer Group CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. -- ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA + NSClient
Thanks, after reading the 3.x on passive_checks I get how to configure the service. Now, what would be the benefit of having active/passive checks enabled for a service? Say, it takes 5 minutes for Nagios to process my 80 hosts/600 services, if the service that I am looking to enable passive checks on as well is checked near the end of the 5 minute mark, wouldn't it get an update much sooner having passive checks enabled? That said, NSClient sending the information to Nagios, logically this sounds like it should work like that, or based off: check_result_reaper_frequency=5 max_check_result_reaper_time=30 So within a max of 30 seconds, I should be able to see if that service is UP/DOWN in the Nagios (or the op5 Ninja) interface? Are passive checks spread out like active checks on say, when Nagios starts? Basically, I want to have the alerting tight as possible, if I login to my IIS server and stop the IISADMIN service, I want to be alerted within those 0-30 seconds based on the reaper frequency. The box that I put Nagios on has enough CPU/RAM and fast enough subsystem I/O to build this type of configuration, but I want to make sure the logic above is correct. Thanks. From: Ryan C Ash [mailto:ryan.c.ash.l...@statefarm.com] Sent: June/24/2010 1:08 PM To: Nagios-Users Subject: Re: [Nagios-users] NSCA + NSClient The short answer is yes, the service description you configure on the client nsca message needs to match that of the service description on your nagios server. If you want to migrate to a passive check you need to disable active checks and enable passive ones. You can leave the check command in place and incorporate freshness checks to force an active check if the passive fail. For me I don't want to do active if passive fail. I would rather cut a ticket service stale.The nagios doc clearly shows how to configure passive service checks so give it a read. So you need my_cpu_check to be a service description so nagios knows what to match that incoming nsca message to. Ash From: Mirza Dedic [mailto:mi...@oppy.com] Sent: Thursday, June 24, 2010 2:56 PM To: 'Nagios-Users' Subject: [Nagios-users] NSCA + NSClient Hi, I have NSCA configured on my Nagios host, and enabled the necessary plugins on NSClient++ to support NSCA, configure XINETD appropriately inside my NSClient config I have: [NSCA Commands] my_cpu_check=checkCPU warn=80 crit=90 time=20m time=10s time=4 my_mem_check=checkMem MaxWarn=80% MaxCrit=90% ShowAll type=page This is just for testing, I also have in my nagios.cfg: accept_passive_service_checks=1 accept_passive_host_checks=1 In my NSClient Log I can see: 2010-06-24 12:48:44: debug:modules\NSCAAgent\NSCAThread.cpp:205: Executing (from NSCA): my_cpu_check 2010-06-24 12:48:44: debug:NSClient++.cpp:1106: Injecting: checkCPU: warn=80, crit=90, time=20m, time=10s, time=4 2010-06-24 12:48:44: debug:NSClient++.cpp:1142: Injected Result: OK 'OK CPU Load ok.' 2010-06-24 12:48:44: debug:NSClient++.cpp:1143: Injected Performance Result: ''20m'=0%;80;90; '10s'=6%;80;90; '4'=0%;80;90; ' 2010-06-24 12:48:44: debug:modules\NSCAAgent\NSCAThread.cpp:205: Executing (from NSCA): my_mem_check 2010-06-24 12:48:44: debug:NSClient++.cpp:1106: Injecting: checkMem: MaxWarn=80%, MaxCrit=90%, ShowAll, type=page 2010-06-24 12:48:44: debug:NSClient++.cpp:1142: Injected Result: OK 'OK: page file: 8.82G' 2010-06-24 12:48:44: debug:NSClient++.cpp:1143: Injected Performance Result: ''page file %'=45%;80;90; 'page file'=8.81G;15.6;17.59;0;19.5; ' What I want to know is, until now I have been using active checks, and for some servers I want to use passive_checks as well, so that the server updates Nagios. If I have active checks defined within my Nagios installation such as: define service{ use generic-service host_name van-mail01 service_description D - Disk Space check_command check_nt_disk!D!98!99 } Can I modify this to also read from the passive_check, and what would my [NSCA Commands] definition look like? Does the first part have to reflect the service description? How does the information coming from the NSCA Client get mapped to my configured checks? Thank you. The Oppenheimer Group CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. The Oppenheimer Group CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited
[Nagios-users] nsca daemon performance
I'm having a problem with my nsca daemon (v. 2.7.2) in a distributed monitoring setup (nagios 3.2.1 on SLES 10.2) and I was wondering if anyone else has seen something like this. nsca seems to be dropping incoming passives in an inconsistent way, sometimes keeping up, other times seeming to skip 1,2, or even sometimes 3 incoming passive checks. I've checked the collector, and it seems to be sending the nsca packets very solidly on the 5 minute normal_check_interval. These are the times that nagios logged a PASSIVE SERVICE CHECK for a typical service on a single host since last midnight: 00:03:55 00:08:56 00:13:50 00:18:51 00:28:53 00:38:55 00:43:56 00:48:50 00:53:51 00:58:52 01:08:54 01:13:55 01:18:49 01:23:50 01:38:53 01:43:54 01:48:55 01:53:49 01:58:50 02:18:54 02:23:55 02:28:49 02:33:50 02:53:47 02:58:55 03:03:56 03:08:50 03:23:46 03:28:54 03:33:55 03:38:56 03:43:50 03:48:51 03:58:46 04:03:54 04:08:55 04:13:56 04:18:50 04:38:54 04:43:55 04:48:56 04:53:50 04:58:51 05:03:52 05:13:54 05:18:55 05:23:56 05:28:50 05:48:54 05:53:55 05:58:49 06:03:50 06:23:54 06:28:48 06:33:49 06:38:50 07:03:55 07:08:49 07:13:50 07:33:54 07:38:55 07:43:56 07:48:50 08:03:46 08:08:54 08:13:55 08:18:49 08:23:50 08:33:52 08:43:54 08:48:55 08:53:56 08:58:50 09:08:45 09:18:54 09:23:55 09:28:49 09:33:50 09:38:44 09:53:54 09:58:55 10:03:49 10:08:50 10:28:54 10:33:55 10:38:49 10:43:50 11:08:55 11:13:56 11:18:50 11:33:46 11:38:54 11:43:55 11:48:56 11:53:50 12:13:54 12:18:55 Andy Ford Network Security Compliance Automation, Wells Fargo Co. 314-600-7025 andrew.f...@wachovia.com -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nsca daemon performance
If you're running the nsca daemon in xinetd you can up the number of threads and connections. I had a similar issue as I started adding services and eventually fixed it by adding these lines to /etc/xinetd.d/nsca instances = UNLIMITED per_source = UNLIMITED cps = 5000 0 On Tue, May 4, 2010 at 2:53 PM, Ford, Andy andrew.f...@wachovia.com wrote: I’m having a problem with my nsca daemon (v. 2.7.2) in a distributed monitoring setup (nagios 3.2.1 on SLES 10.2) and I was wondering if anyone else has seen something like this. nsca seems to be dropping incoming passives in an inconsistent way, sometimes keeping up, other times seeming to skip 1,2, or even sometimes 3 incoming passive checks. I’ve checked the collector, and it seems to be sending the nsca packets very solidly on the 5 minute normal_check_interval. These are the times that nagios logged a “PASSIVE SERVICE CHECK” for a typical service on a single host since last midnight: 00:03:55 00:08:56 00:13:50 00:18:51 00:28:53 00:38:55 00:43:56 00:48:50 00:53:51 00:58:52 01:08:54 01:13:55 01:18:49 01:23:50 01:38:53 01:43:54 01:48:55 01:53:49 01:58:50 02:18:54 02:23:55 02:28:49 02:33:50 02:53:47 02:58:55 03:03:56 03:08:50 03:23:46 03:28:54 03:33:55 03:38:56 03:43:50 03:48:51 03:58:46 04:03:54 04:08:55 04:13:56 04:18:50 04:38:54 04:43:55 04:48:56 04:53:50 04:58:51 05:03:52 05:13:54 05:18:55 05:23:56 05:28:50 05:48:54 05:53:55 05:58:49 06:03:50 06:23:54 06:28:48 06:33:49 06:38:50 07:03:55 07:08:49 07:13:50 07:33:54 07:38:55 07:43:56 07:48:50 08:03:46 08:08:54 08:13:55 08:18:49 08:23:50 08:33:52 08:43:54 08:48:55 08:53:56 08:58:50 09:08:45 09:18:54 09:23:55 09:28:49 09:33:50 09:38:44 09:53:54 09:58:55 10:03:49 10:08:50 10:28:54 10:33:55 10:38:49 10:43:50 11:08:55 11:13:56 11:18:50 11:33:46 11:38:54 11:43:55 11:48:56 11:53:50 12:13:54 12:18:55 Andy Ford Network Security Compliance Automation, Wells Fargo Co. 314-600-7025 andrew.f...@wachovia.com -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nsca daemon performance
Thanks, Noel, that's very good advice for an xinetd-propelled nsca daemon, but this setup is using a standalone nsca. I used the xinetd approach for many years, maybe I should switch back... From: Noel Platzke [mailto:neuf...@gmail.com] Sent: Tuesday, May 04, 2010 2:51 PM To: Nagios Users List Subject: Re: [Nagios-users] nsca daemon performance If you're running the nsca daemon in xinetd you can up the number of threads and connections. I had a similar issue as I started adding services and eventually fixed it by adding these lines to /etc/xinetd.d/nsca instances = UNLIMITED per_source = UNLIMITED cps = 5000 0 On Tue, May 4, 2010 at 2:53 PM, Ford, Andy andrew.f...@wachovia.commailto:andrew.f...@wachovia.com wrote: I'm having a problem with my nsca daemon (v. 2.7.2) in a distributed monitoring setup (nagios 3.2.1 on SLES 10.2) and I was wondering if anyone else has seen something like this. nsca seems to be dropping incoming passives in an inconsistent way, sometimes keeping up, other times seeming to skip 1,2, or even sometimes 3 incoming passive checks. I've checked the collector, and it seems to be sending the nsca packets very solidly on the 5 minute normal_check_interval. These are the times that nagios logged a PASSIVE SERVICE CHECK for a typical service on a single host since last midnight: 00:03:55 00:08:56 00:13:50 00:18:51 00:28:53 00:38:55 00:43:56 00:48:50 00:53:51 00:58:52 01:08:54 01:13:55 01:18:49 01:23:50 01:38:53 01:43:54 01:48:55 01:53:49 01:58:50 02:18:54 02:23:55 02:28:49 02:33:50 02:53:47 02:58:55 03:03:56 03:08:50 03:23:46 03:28:54 03:33:55 03:38:56 03:43:50 03:48:51 03:58:46 04:03:54 04:08:55 04:13:56 04:18:50 04:38:54 04:43:55 04:48:56 04:53:50 04:58:51 05:03:52 05:13:54 05:18:55 05:23:56 05:28:50 05:48:54 05:53:55 05:58:49 06:03:50 06:23:54 06:28:48 06:33:49 06:38:50 07:03:55 07:08:49 07:13:50 07:33:54 07:38:55 07:43:56 07:48:50 08:03:46 08:08:54 08:13:55 08:18:49 08:23:50 08:33:52 08:43:54 08:48:55 08:53:56 08:58:50 09:08:45 09:18:54 09:23:55 09:28:49 09:33:50 09:38:44 09:53:54 09:58:55 10:03:49 10:08:50 10:28:54 10:33:55 10:38:49 10:43:50 11:08:55 11:13:56 11:18:50 11:33:46 11:38:54 11:43:55 11:48:56 11:53:50 12:13:54 12:18:55 Andy Ford Network Security Compliance Automation, Wells Fargo Co. 314-600-7025 andrew.f...@wachovia.commailto:andrew.f...@wachovia.com -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.netmailto:Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA and Windows
Hi Gareth i remove the semi-colon on the last lines for example, i think the format is lost in the e-mail. [External Scripts] check_cpu=scripts\check_cpu.exe and [NSCA Commands] C:\NSCLIENT++\scripts\check_cpu.exe greetz Olly Von: Gareth Fletcher [gareth.fletc...@onenet.co.nz] Gesendet: Montag, 29. Maerz 2010 03:08 An: Klein, Oliver Betreff: RE: NSCA and Windows Hi Olly, Did you manage to find a solution for this? All the commands you pated below from your NSC.ini are commented out, you need to remove the semi-colon beforehand. Cheers Gareth Mit freundlichen Gruessen Oliver Klein System Engineer Tel:02506 / 93 20 214 Mobil: 0163-5932004 E-Mail: okl...@nccms.de NCC Noethen Computer Center GmbH Wolbecker Windmuehle 55 48167 Muenster Tel: 02506 93 20 0 FAX: 02506 93 20 20 E-Mail: i...@nccms.de Homepage: www.nccms.de Amtsgericht Muenster HRB 5036Geschaeftsfuehrer: Ralf Bakenecker Uwe-Christian Surma _ Dieses E-Mail ist nur fuer den Empfaenger bestimmt, an den es gerichtet ist und kann vertrauliches bzw. unter das Berufsgeheimnis fallendes Material enthalten. Jegliche darin enthaltene Ansicht oder Meinungs- aeusserung ist die des Autors und stellt nicht notwendigerweise die Ansicht oder Meinung von NCC GmbH dar. Sind Sie nicht der Empfaenger, so haben Sie diese E-Mail irrtuemlich erhalten und jegliche Verwendung, Veroeffentlichung, Weiterleitung, Abschrift oder jeglicher Druck dieser E-Mail ist strengstens untersagt. Weder NCC GmbH noch der Absender (Oliver Klein) uebernehmen die Haftung fuer Viren; es obliegt Ihrer Verantwortung, die E-Mail und deren Anhaenge auf Viren zu pruefen. Anhaenge: _ Versand am 29.03.2010 09:21 von Klein Oliver Validation-Code: 4163033227071 -Original Message- From: Klein, Oliver [mailto:okl...@nccms.de] Sent: Monday, 29 March 2010 3:40 a.m. To: Nagios Users List Subject: Re: [Nagios-users] NSCA and Windows Hi plz help, nobody use the tnt_plugins with nsclient++? regards olly Mit freundlichen Gruessen Oliver Klein System Engineer Tel:02506 / 93 20 214 Mobil: 0163-5932004 E-Mail: okl...@nccms.de NCC Noethen Computer Center GmbH Wolbecker Windmuehle 55 48167 Muenster Tel: 02506 93 20 0 FAX: 02506 93 20 20 E-Mail: i...@nccms.de Homepage: www.nccms.de Amtsgericht Muenster HRB 5036Geschaeftsfuehrer: Ralf Bakenecker Uwe-Christian Surma _ Dieses E-Mail ist nur fuer den Empfaenger bestimmt, an den es gerichtet ist und kann vertrauliches bzw. unter das Berufsgeheimnis fallendes Material enthalten. Jegliche darin enthaltene Ansicht oder Meinungs- aeusserung ist die des Autors und stellt nicht notwendigerweise die Ansicht oder Meinung von NCC GmbH dar. Sind Sie nicht der Empfaenger, so haben Sie diese E-Mail irrtuemlich erhalten und jegliche Verwendung, Veroeffentlichung, Weiterleitung, Abschrift oder jeglicher Druck dieser E-Mail ist strengstens untersagt. Weder NCC GmbH noch der Absender (Oliver Klein) uebernehmen die Haftung fuer Viren; es obliegt Ihrer Verantwortung, die E-Mail und deren Anhaenge auf Viren zu pruefen. Anhaenge: _ Versand am 28.03.2010 16:40 von Klein Oliver Validation-Code: 4050430246410 Von: Klein, Oliver [okl...@nccms.de] Gesendet: Freitag, 26. Maerz 2010 15:13 An: Nagios Users List Betreff: Re: [Nagios-users] NSCA and Windows [cid:ncc-logo.jpg] Hi Kevin, thx i think that is what we need. But I have some problems to get this plugins work with nsclient. Any idea about this? This is the error massage in syslog on the nagios server Mar 26 15:04:41 nagios nagios: EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;sonicwall2;C:\NSCLIENT++\scripts\check_cpu.exe;3;Command was not found: Mar 26 15:04:41 nagios nagios: Warning: Passive check result was received for service 'C:\NSCLIENT++\scripts\check_cpu.exe' on host 'sonicwall2', but the service could not be found! So I define the following option in the nsc.ini [External Scripts] ;check_es_long=scripts\long.bat ;check_es_ok=scripts\ok.bat ;check_es_nok=scripts\nok.bat ;check_vbs_sample=cscript.exe //T:30 //NoLogo scripts\check_vb.vbs ;check_powershell_warn=cmd /c echo scripts\powershell.ps1 | powershell.exe -command - check_cpu=scripts\check_cpu.exe [NSCA Commands] ;my_cpu_check=checkCPU warn=80 crit=90 time=20m time=10s time=4 ;my_mem_check=checkMem MaxWarn=80% MaxCrit=90% ShowAll type=page ;my_svc_check=checkServiceState CheckAll exclude=wampmysqld exclude=MpfService ;host_check=check_ok C:\NSCLIENT++\scripts\check_cpu.exe I tested
Re: [Nagios-users] NSCA and Windows
Hi plz help, nobody use the tnt_plugins with nsclient++? regards olly Mit freundlichen Gruessen Oliver Klein System Engineer Tel:02506 / 93 20 214 Mobil: 0163-5932004 E-Mail: okl...@nccms.de NCC Noethen Computer Center GmbH Wolbecker Windmuehle 55 48167 Muenster Tel: 02506 93 20 0 FAX: 02506 93 20 20 E-Mail: i...@nccms.de Homepage: www.nccms.de Amtsgericht Muenster HRB 5036Geschaeftsfuehrer: Ralf Bakenecker Uwe-Christian Surma _ Dieses E-Mail ist nur fuer den Empfaenger bestimmt, an den es gerichtet ist und kann vertrauliches bzw. unter das Berufsgeheimnis fallendes Material enthalten. Jegliche darin enthaltene Ansicht oder Meinungs- aeusserung ist die des Autors und stellt nicht notwendigerweise die Ansicht oder Meinung von NCC GmbH dar. Sind Sie nicht der Empfaenger, so haben Sie diese E-Mail irrtuemlich erhalten und jegliche Verwendung, Veroeffentlichung, Weiterleitung, Abschrift oder jeglicher Druck dieser E-Mail ist strengstens untersagt. Weder NCC GmbH noch der Absender (Oliver Klein) uebernehmen die Haftung fuer Viren; es obliegt Ihrer Verantwortung, die E-Mail und deren Anhaenge auf Viren zu pruefen. Anhaenge: _ Versand am 28.03.2010 16:40 von Klein Oliver Validation-Code: 4050430246410 Von: Klein, Oliver [okl...@nccms.de] Gesendet: Freitag, 26. Maerz 2010 15:13 An: Nagios Users List Betreff: Re: [Nagios-users] NSCA and Windows [cid:ncc-logo.jpg] Hi Kevin, thx i think that is what we need. But I have some problems to get this plugins work with nsclient. Any idea about this? This is the error massage in syslog on the nagios server Mar 26 15:04:41 nagios nagios: EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;sonicwall2;C:\NSCLIENT++\scripts\check_cpu.exe;3;Command was not found: Mar 26 15:04:41 nagios nagios: Warning: Passive check result was received for service 'C:\NSCLIENT++\scripts\check_cpu.exe' on host 'sonicwall2', but the service could not be found! So I define the following option in the nsc.ini [External Scripts] ;check_es_long=scripts\long.bat ;check_es_ok=scripts\ok.bat ;check_es_nok=scripts\nok.bat ;check_vbs_sample=cscript.exe //T:30 //NoLogo scripts\check_vb.vbs ;check_powershell_warn=cmd /c echo scripts\powershell.ps1 | powershell.exe -command - check_cpu=scripts\check_cpu.exe [NSCA Commands] ;my_cpu_check=checkCPU warn=80 crit=90 time=20m time=10s time=4 ;my_mem_check=checkMem MaxWarn=80% MaxCrit=90% ShowAll type=page ;my_svc_check=checkServiceState CheckAll exclude=wampmysqld exclude=MpfService ;host_check=check_ok C:\NSCLIENT++\scripts\check_cpu.exe I tested it without path and other variations but with no luck. I hope someone give me the right hint for the correct syntax Best regards nice weekend Olly Mit freundlichen Gruessen Oliver Klein System Engineer Tel: 02506 / 93 20 214 Mobil: 0163-5932004 E-Mail: okl...@nccms.de NCC Noethen Computer Center GmbH Wolbecker Windmuehle 55 48167 Muenster Fon: 02506 / 93 20 0 Fax: 02506 / 93 20 20 Geschaeftsfuehrer: Ralf Bakenecker Uwe-Christian Surma Amtsgericht Muenster HRB 5036 i...@nccms.demailto:i...@nccms.de http://www.nccms.de [cid:compass-logo.jpg] Gesellschafter der Computer Compass-Gruppe Dieses E-Mail ist nur fuer den Empfaenger bestimmt, an den es gerichtet ist und kann vertrauliches bzw. unter das Berufsgeheimnis fallendes Material enthalten. Jegliche darin enthaltene Ansicht oder Meinungs- aeusserung ist die des Autors und stellt nicht notwendigerweise die Ansicht oder Meinung von NCC GmbH dar. Sind Sie nicht der Empfaenger, so haben Sie diese E-Mail irrtuemlich erhalten und jegliche Verwendung, Veroeffentlichung, Weiterleitung, Abschrift oder jeglicher Druck dieser E-Mail ist strengstens untersagt. Weder NCC GmbH noch der Absender (Oliver Klein) uebernehmen die Haftung fuer Viren; es obliegt Ihrer Verantwortung, die E-Mail und deren Anhaenge auf Viren zu pruefen. Anhaenge:. Von: Kevin Keane [mailto:subscript...@kkeane.com] Gesendet: Freitag, 26. Maerz 2010 06:18 An: Nagios Users List Betreff: Re: [Nagios-users] NSCA and Windows The plugins are simply command-line programs. By convention, all plugins should provide documentation when you start them with the –h switch. You can find quite a few plugins on monitoringexchange.org . I wrote a collection of plugins that you might find useful at http://sourceforge.net/projects/tntnagiosplugin/https://sourceforge.net/projects/tntnagiosplugin/ I’m not using them with NSClient, but they should work with it just fine. From: Klein, Oliver [mailto:okl...@nccms.de] Sent: Thursday, March 25, 2010 7:41 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] NSCA and Windows [cid:image001.jpg@01CACCF6.29F1E420] Hi List i have a question
Re: [Nagios-users] NSCA and Windows
Hi Kevin, thx i think that is what we need. But I have some problems to get this plugins work with nsclient. Any idea about this? This is the error massage in syslog on the nagios server Mar 26 15:04:41 nagios nagios: EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;sonicwall2;C:\NSCLIENT++\scripts\check_cpu.exe;3;Command was not found: Mar 26 15:04:41 nagios nagios: Warning: Passive check result was received for service 'C:\NSCLIENT++\scripts\check_cpu.exe' on host 'sonicwall2', but the service could not be found! So I define the following option in the nsc.ini [External Scripts] ;check_es_long=scripts\long.bat ;check_es_ok=scripts\ok.bat ;check_es_nok=scripts\nok.bat ;check_vbs_sample=cscript.exe //T:30 //NoLogo scripts\check_vb.vbs ;check_powershell_warn=cmd /c echo scripts\powershell.ps1 | powershell.exe -command - check_cpu=scripts\check_cpu.exe [NSCA Commands] ;my_cpu_check=checkCPU warn=80 crit=90 time=20m time=10s time=4 ;my_mem_check=checkMem MaxWarn=80% MaxCrit=90% ShowAll type=page ;my_svc_check=checkServiceState CheckAll exclude=wampmysqld exclude=MpfService ;host_check=check_ok C:\NSCLIENT++\scripts\check_cpu.exe I tested it without path and other variations but with no luck. I hope someone give me the right hint for the correct syntax Best regards nice weekend Olly Mit freundlichen Gruessen Oliver Klein System Engineer Tel:02506 / 93 20 214 Mobil: 0163-5932004 E-Mail: okl...@nccms.de NCC Noethen Computer Center GmbH Wolbecker Windmuehle 55 48167 Muenster Tel: 02506 93 20 0 FAX: 02506 93 20 20 E-Mail: i...@nccms.de Homepage: www.nccms.de Amtsgericht Muenster HRB 5036Geschaeftsfuehrer: Ralf Bakenecker Uwe-Christian Surma _ Dieses E-Mail ist nur fuer den Empfaenger bestimmt, an den es gerichtet ist und kann vertrauliches bzw. unter das Berufsgeheimnis fallendes Material enthalten. Jegliche darin enthaltene Ansicht oder Meinungs- aeusserung ist die des Autors und stellt nicht notwendigerweise die Ansicht oder Meinung von NCC GmbH dar. Sind Sie nicht der Empfaenger, so haben Sie diese E-Mail irrtuemlich erhalten und jegliche Verwendung, Veroeffentlichung, Weiterleitung, Abschrift oder jeglicher Druck dieser E-Mail ist strengstens untersagt. Weder NCC GmbH noch der Absender (Oliver Klein) uebernehmen die Haftung fuer Viren; es obliegt Ihrer Verantwortung, die E-Mail und deren Anhaenge auf Viren zu pruefen. Anhaenge: _ Versand am 26.03.2010 15:13 von Klein Oliver Validation-Code: 3623422302211 Von: Kevin Keane [mailto:subscript...@kkeane.com] Gesendet: Freitag, 26. Maerz 2010 06:18 An: Nagios Users List Betreff: Re: [Nagios-users] NSCA and Windows The plugins are simply command-line programs. By convention, all plugins should provide documentation when you start them with the -h switch. You can find quite a few plugins on monitoringexchange.org . I wrote a collection of plugins that you might find useful at http://sourceforge.net/projects/tntnagiosplugin/https://sourceforge.net/projects/tntnagiosplugin/ I'm not using them with NSClient, but they should work with it just fine. From: Klein, Oliver [mailto:okl...@nccms.de] Sent: Thursday, March 25, 2010 7:41 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] NSCA and Windows [cid:image001.jpg@01CACCF6.29F1E420] Hi List i have a question about Windows and NSCA. We use the NSclient with NCSA. In the nsclient config file there´re some checks descriped and this works well. So what we want to know is, where we can find more plugins? There´s a folder in the nsclient directory modules maybe we can use these plugins. Our problem is, that we don´t know how to use these plugins, like the syntax. Any help was appreciated kind regards Olly Mit freundlichen Gruessen Oliver Klein System Engineer Tel: 02506 / 93 20 214 Mobil: 0163-5932004 E-Mail: okl...@nccms.demailto:okl...@nccms.de NCC Noethen Computer Center GmbH Wolbecker Windmuehle 55 48167 Muenster Fon: 02506 / 93 20 0 Fax: 02506 / 93 20 20 Geschaeftsfuehrer: Ralf Bakenecker Uwe-Christian Surma Amtsgericht Muenster HRB 5036 i...@nccms.demailto:i...@nccms.de http://www.nccms.de [cid:image002.jpg@01CACCF6.29F1E420] Gesellschafter der Computer Compass-Gruppe Dieses E-Mail ist nur fuer den Empfaenger bestimmt, an den es gerichtet ist und kann vertrauliches bzw. unter das Berufsgeheimnis fallendes Material enthalten. Jegliche darin enthaltene Ansicht oder Meinungs- aeusserung ist die des Autors und stellt nicht notwendigerweise die Ansicht oder Meinung von NCC GmbH dar. Sind Sie nicht der Empfaenger, so haben Sie diese E-Mail irrtuemlich erhalten und jegliche Verwendung, Veroeffentlichung, Weiterleitung, Abschrift oder jeglicher Druck dieser E-Mail ist strengstens untersagt. Weder NCC GmbH noch
Re: [Nagios-users] NSCA and Windows
On Fri, Mar 26, 2010 at 3:13 PM, Klein, Oliver okl...@nccms.de wrote: Hi Kevin, thx i think that is what we need. But I have some problems to get this plugins work with nsclient. Any idea about this? This is the error massage in syslog on the nagios server Mar 26 15:04:41 nagios nagios: EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;sonicwall2;C:\NSCLIENT++\scripts\check_cpu.exe;3;Command was not found: Mar 26 15:04:41 nagios nagios: Warning: Passive check result was received for service 'C:\NSCLIENT++\scripts\check_cpu.exe' on host 'sonicwall2', but the service could not be found! Actually, that is quite a good error message. So I define the following option in the nsc.ini [External Scripts] ;check_es_long=scripts\long.bat ;check_es_ok=scripts\ok.bat ;check_es_nok=scripts\nok.bat ;check_vbs_sample=cscript.exe //T:30 //NoLogo scripts\check_vb.vbs ;check_powershell_warn=cmd /c echo scripts\powershell.ps1 | powershell.exe -command - check_cpu=scripts\check_cpu.exe What you have defined is the handler 'check_cpu' In this section [External Scripts]. Those are for NRPE if I recall correctly although I never used that section for NRPE, I use [NRPE Handlers] instead for my custom commands. [NSCA Commands] ;my_cpu_check=checkCPU warn=80 crit=90 time=20m time=10s time=4 ;my_mem_check=checkMem MaxWarn=80% MaxCrit=90% ShowAll type=page ;my_svc_check=checkServiceState CheckAll exclude=wampmysqld exclude=MpfService ;host_check=check_ok C:\NSCLIENT++\scripts\check_cpu.exe and here you have not defined anything, there is no LHS=RHS combination. I suppose that what you meant is check_cpu=scripts\check_cpu.exe. Restart the nsclientpp service and try again. -- groeten, natxo -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA and Windows
hi, now I realize that you are using nsca. I have never used that, so my previous posting may not work at all. I misread your question, sorry. -- groeten, natxo -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA and Windows
Hi List i have a question about Windows and NSCA. We use the NSclient with NCSA. In the nsclient config file there´re some checks descriped and this works well. So what we want to know is, where we can find more plugins? There´s a folder in the nsclient directory modules maybe we can use these plugins. Our problem is, that we don´t know how to use these plugins, like the syntax. Any help was appreciated kind regards Olly Mit freundlichen Gruessen Oliver Klein System Engineer Tel:02506 / 93 20 214 Mobil: 0163-5932004 E-Mail: okl...@nccms.de NCC Noethen Computer Center GmbH Wolbecker Windmuehle 55 48167 Muenster Tel: 02506 93 20 0 FAX: 02506 93 20 20 E-Mail: i...@nccms.de Homepage: www.nccms.de Amtsgericht Muenster HRB 5036Geschaeftsfuehrer: Ralf Bakenecker Uwe-Christian Surma _ Dieses E-Mail ist nur fuer den Empfaenger bestimmt, an den es gerichtet ist und kann vertrauliches bzw. unter das Berufsgeheimnis fallendes Material enthalten. Jegliche darin enthaltene Ansicht oder Meinungs- aeusserung ist die des Autors und stellt nicht notwendigerweise die Ansicht oder Meinung von NCC GmbH dar. Sind Sie nicht der Empfaenger, so haben Sie diese E-Mail irrtuemlich erhalten und jegliche Verwendung, Veroeffentlichung, Weiterleitung, Abschrift oder jeglicher Druck dieser E-Mail ist strengstens untersagt. Weder NCC GmbH noch der Absender (Oliver Klein) uebernehmen die Haftung fuer Viren; es obliegt Ihrer Verantwortung, die E-Mail und deren Anhaenge auf Viren zu pruefen. Anhaenge: _ Versand am 25.03.2010 15:40 von Klein Oliver Validation-Code: 3511017320244 inline: compass-logo.jpginline: ncc-logo.jpg-- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA and Windows
The plugins are simply command-line programs. By convention, all plugins should provide documentation when you start them with the -h switch. You can find quite a few plugins on monitoringexchange.org . I wrote a collection of plugins that you might find useful at http://sourceforge.net/projects/tntnagiosplugin/https://sourceforge.net/projects/tntnagiosplugin/ I'm not using them with NSClient, but they should work with it just fine. From: Klein, Oliver [mailto:okl...@nccms.de] Sent: Thursday, March 25, 2010 7:41 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] NSCA and Windows [cid:image001.jpg@01CACC68.A7297EA0] Hi List i have a question about Windows and NSCA. We use the NSclient with NCSA. In the nsclient config file there´re some checks descriped and this works well. So what we want to know is, where we can find more plugins? There´s a folder in the nsclient directory modules maybe we can use these plugins. Our problem is, that we don´t know how to use these plugins, like the syntax. Any help was appreciated kind regards Olly Mit freundlichen Grüßen Oliver Klein System Engineer Tel: 02506 / 93 20 214 Mobil: 0163-5932004 E-Mail: okl...@nccms.de NCC Nöthen Computer Center GmbH Wolbecker Windmühle 55 48167 Münster Fon: 02506 / 93 20 0 Fax: 02506 / 93 20 20 Geschäftsführer: Ralf Bakenecker Uwe-Christian Surma Amtsgericht Münster HRB 5036 i...@nccms.demailto:i...@nccms.de http://www.nccms.de [cid:image002.jpg@01CACC68.A7297EA0] Gesellschafter der Computer Compass-Gruppe Dieses E-Mail ist nur für den Empfänger bestimmt, an den es gerichtet ist und kann vertrauliches bzw. unter das Berufsgeheimnis fallendes Material enthalten. Jegliche darin enthaltene Ansicht oder Meinungs- äußerung ist die des Autors und stellt nicht notwendigerweise die Ansicht oder Meinung von NCC GmbH dar. Sind Sie nicht der Empfänger, so haben Sie diese E-Mail irrtümlich erhalten und jegliche Verwendung, Veröffentlichung, Weiterleitung, Abschrift oder jeglicher Druck dieser E-Mail ist strengstens untersagt. Weder NCC GmbH noch der Absender (Oliver Klein) übernehmen die Haftung für Viren; es obliegt Ihrer Verantwortung, die E-Mail und deren Anhänge auf Viren zu prüfen. Anhänge:. inline: image001.jpginline: image002.jpg-- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA Client
Hi all, I'm looking for the nsca_client package for 'SunOS 5.9 Generic_112233-07 sun4u sparc SUNW,UltraAX-i2' which I can install with pkgadd. thanks in advance Richard -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA Client - SOLUTION
On 3/22/10 10:55 AM Richard Gliebe wrote: Hi all, found it on http://www.monitoringexchange.org Richard -- Download Intel#174; Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA checks not appear on nagios
Hello I'm trying to setup a distributed nagios using complied nagios 3.2.0 . I set up the NSCA and i can see the result being delivered to the nagios (example below) : remote nagios: BEGIN MACRO PROCESSING *** [1265118755.249654] [2048.1] [pid=12343] Processing: '/usr/local/nagios/libexec/eventhandlers/submit_check_result $HOSTNAME$ '$SERVICEDESC$' $SERVICESTATE$ '$SERVICEOUTPUT$'' [1265118755.249670] [2048.1] [pid=12343] Done. Final output: '/usr/local/nagios/libexec/eventhandlers/submit_check_result foo.bar 'HTTP' CRITICAL 'HTTP CRITICAL - No data received from host'' [1265118755.249691] [2048.1] [pid=12343] END MACRO PROCESSING * [1265118755.249697] [001.0] [pid=12343] my_system() [1265118755.249703] [256.1] [pid=12343] Running command '/usr/local/nagios/libexec/eventhandlers/submit_check_result foo.bar 'HTTP' CRITICAL 'HTTP CRITICAL - No data received from host''... Central Nagios : Connection from xx.xx.xx.xx port 63416 nsca[6405]: Handling the connection... nsca[6405]: SERVICE CHECK - Host Name: 'foo.bar', Service Description: 'HTTP', Return Code: '0', Output: 'HTTP OK: HTTP/1.1 301 Moved Permanently - 461 bytes in 0.081 second response time' nsca[6405]: End of connection... The problem is that the host/service on the central server stay in the pending status and the web page does not change. I have 4 hosts with only check_http defined as the check running on them (both on the remote and the central nagios ). when i force a check on a service - it comes up at ok ( as it should be ) , how can i make sure they start with out manual intervention ? Thanks Assaf -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA checks not appear on nagios
On Feb 2, 2010, at 8:05 AM, Assaf Flatto wrote: The problem is that the host/service on the central server stay in the pending status and the web page does not change. I have 4 hosts with only check_http defined as the check running on them (both on the remote and the central nagios ). when i force a check on a service - it comes up at ok ( as it should be ) , how can i make sure they start with out manual intervention ? Make sure you've enabled external commands: http://nagios.sourceforge.net/docs/3_0/extcommands.html Make sure you've enabled passive checks in nagios.cfg and in the service definitions and that the host/service your are submitting for exists in your config: http://nagios.sourceforge.net/docs/3_0/passivechecks.html Also see this recent thread - http://www.mail-archive.com/nagios-users@lists.sourceforge.net/msg30439.html If you're still having problems, post consistent service definitions and nsca debug output (your examples don't match), nagios.cfg entries related to passive checks and external commands and an 'ls -l /path/to/nagios.cmd'. -- Marc -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nsca results don't reach nagios
Hi, I have configured a nagios and nsca server, When I run the service check I get the following message in syslog: Jan 6 09:06:58 cl-t036-222cl nsca[31340]: Connection from 10.1.233.10 port 48077 Jan 6 09:06:58 cl-t036-222cl nsca[31340]: Handling the connection... Jan 6 09:06:58 cl-t036-222cl nsca[31340]: SERVICE CHECK - Host Name: 'localhost', Service Description: 'TestMessage', Return Code: '1', Output: 'Service in warning state'command_file=/var/lib/nagios3/rw/nagios.cmd Jan 6 09:06:58 cl-t036-222cl nsca[31340]: End of connection... The results reach the command_file=/var/lib/nagios3/rw/nagios.cmd cat /var/lib/nagios3/rw/nagios.cmd [1262786818] PROCESS_SERVICE_CHECK_RESULT;localhost;TestMessage;1;Service in warning state However the results of my passive checks do not reach nagios. The nagios error log files say nothing about the check. What could I be doing wrong? Here are my conf files: nagios.cfg cat nagios.cfg | grep passive | grep -v '#' log_passive_checks=1 accept_passive_service_checks=1 accept_passive_host_checks=1 translate_passive_host_checks=0 passive_host_checks_are_soft=0 command_file=/var/lib/nagios3/rw/nagios.cmd nsca.cfg cat /etc/nsca.cfg | grep -v '#' pid_file=/var/run/nsca.pid server_port=5667 server_address=10.1.233.52 nsca_user=nagios nsca_group=nogroup debug=1 command_file=/var/lib/nagios3/rw/nagios.cmd alternate_dump_file=/var/run/nagios/nsca.dump aggregate_writes=0 append_to_file=0 max_packet_age=30 decryption_method=1 localhost_nagios2.cfg define host{ use generic-host; Name of host template to use host_name localhost alias localhost address 127.0.0.1 } define command{ command_name check_dummy command_line $USER1$/check_dummy $ARG1$ } define service{ usegeneric-service name passive_service active_checks_enabled 0 passive_checks_enabled 1 # We want only passive checking flap_detection_enabled 0 register 0 # This is a template, not a real service is_volatile0 check_period 24x7 max_check_attempts 1 normal_check_interval 5 retry_check_interval 1 check_freshness0 contact_groups admins check_command check_dummy!0 notification_interval 120 notification_period24x7 notification_options w,u,c,r stalking_options w,c,u } define service{ use passive_service service_description TestMessage host_name localhost } The service check #!/bin/sh HOST=localhost SERVICE=TestMessage NSCAHOST=10.1.233.52 echo -e $HOST\t$SERVICE\t1\tService in warning state | /usr/sbin/send_nsca -H $NSCAHOST -c /etc/send_nsca.cfg exit 0 -- ( 2b || ! 2b ) -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nsca results don't reach nagios
On Jan 6, 2010, at 8:10 AM, Pankaj wrote: cat /var/lib/nagios3/rw/nagios.cmd [1262786818] PROCESS_SERVICE_CHECK_RESULT;localhost;TestMessage;1;Service in warning state By doing this, you've removed it from the pipe and there's no possibility for nagios to see it at all. However the results of my passive checks do not reach nagios. The nagios error log files say nothing about the check. What could I be doing wrong? Here are my conf files: nagios.cfg cat nagios.cfg | grep passive | grep -v '#' log_passive_checks=1 accept_passive_service_checks=1 accept_passive_host_checks=1 translate_passive_host_checks=0 passive_host_checks_are_soft=0 command_file=/var/lib/nagios3/rw/nagios.cmd value of check_external_commands? ls -l /var/lib/nagios3/rw/nagios.cmd? -- Marc -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nsca results don't reach nagios
Hi Pankaj. Please always respond on-list so that others learn and benefit from your experience. More below -- On Jan 6, 2010, at 9:50 AM, Pankaj wrote: On Wed, Jan 6, 2010 at 8:31 PM, Marc Powell m...@ena.com wrote: On Jan 6, 2010, at 8:10 AM, Pankaj wrote: cat /var/lib/nagios3/rw/nagios.cmd [1262786818] PROCESS_SERVICE_CHECK_RESULT;localhost;TestMessage;1;Service in warning state By doing this, you've removed it from the pipe and there's no possibility for nagios to see it at all. This file did not exist ... so I touched it. Does it need to be a named pipe? Yes and must be created by nagios. I suspected this was your problem. Remove the file you've created. value of check_external_commands? log_external_commands=1 That's not what I asked for ;) check_external_commands must be set to 1. I'll bet it's 0. Enabling this causes nagios to create the external command pipe (nagios.cmd) and watch for input there (including passive check results which are external commands). -- Marc -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nsca results don't reach nagios
On Wed, Jan 6, 2010 at 10:08 PM, Marc Powell m...@ena.com wrote: Hi Pankaj. Please always respond on-list so that others learn and benefit from your experience. More below -- Thanks. value of check_external_commands? log_external_commands=1 That's not what I asked for ;) check_external_commands must be set to 1. I'll bet it's 0. Enabling this causes nagios to create the external command pipe (nagios.cmd) and watch for input there (including passive check results which are external commands). Yes That's right. removing the file and setting check_external_commands to 1 solved the problem. Thank you very much. Cheers! P. -- ( 2b || ! 2b ) -- This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA strange behaviour
Hello, problem solved, it was indeed a version problem (minor number).. redhat has released a new nsca client/server package, removing a patch, and that made it incompatible with previous version. See: http://www.mail-archive.com/nagios-users@lists.sourceforge.net/msg19402.html and related patch: http://cvs.fedoraproject.org/viewvc/EL-5/nsca/nsca-increase_max_plugin_output_length.patch?revision=1.1view=markupsortby=log Best regards, C. On Tue, 8 Dec 2009 10:52:50 -0600 Marc Powell m...@ena.com wrote: On Dec 8, 2009, at 4:40 AM, Cedric Jeanneret wrote: - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. On another hand, it shows me: [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. What do they show? What you've quoted here doesn't come from NSCA (that I can tell from a simple grep). -- nsca-2.7.2]# grep -r 'are fresh' * nsca-2.7.2]# I'm sure that comes from nagios via nagios.log, not NSCA. Are you sure you're looking in the right place? It's typically in /var/log/messages. How are you running nsca? daemon mode or via inetd? If inetd, is inetd rejecting the connection? -- Marc -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Cédric Jeanneret | System Administrator 021 619 10 32| Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL signature.asc Description: PGP signature -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA strange behaviour
Hello again I've made some other tests: - I changed encryption algo on server01 and client22, restarted nagiosnsca on server01, restarted nagios on client22. - let it run like that, checking logs for something. As I thought, nsca begins to output lot of error regarding version client and/or encryption method and/or password. Nice. - I put back the right encryption method on server01 but NOT on client22. I should have seen a pattern like Received invalid packet but nothing. I forced some status update from client22: for i in $(seq 1000); do echo -n 'host '; /usr/local/bin/submit_ochp $(hostname -f) UP Host is up; sleep 2; done Nothing. TCPDump shows me traffic, in and out for both hosts... Last test, I stopped nagiosnsca on server01, removed nsca.dump objects.cache retention.dat files, and start again nscanagios If we let the fact that all my hosts are pending, my client22 doesn't push any status... nothing in nsca logs. Any other idea ? It's like the nsca daemon ignore (without any output) client22 queries. and client22 doesn't know about it. Best regards, C. -- Cédric Jeanneret | System Administrator 021 619 10 32| Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL signature.asc Description: PGP signature -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA strange behaviour
Hello, I'm having troubles with NSCA. What we have : - about 47 passive hosts - about 220 passive services Versions : all are redhat servers, with: - NSCA 2.7.2 (latest one) - Nagios 3.1.2 We have a single nagios aggregator, which collect all NSCA status from the other hosts. What's happening: a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. What I've done: - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) - trying to push status by hand from client22 to server01; ALL packets are sent successfully 1 data packet(s) sent to host successfully.. I've done this with a loop like that: for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. On another hand, it shows me: [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. The only differences are : - the working network is debian-only - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. Does anyone of you have an idea ? Thank you in advance. Best regards, C. -- Cédric Jeanneret | System Administrator 021 619 10 32| Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL signature.asc Description: PGP signature -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA strange behaviour
Do all of your clients fail, or just the new one? Greg Pangrazio pangr...@gmail.com On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello, I'm having troubles with NSCA. What we have : - about 47 passive hosts - about 220 passive services Versions : all are redhat servers, with: - NSCA 2.7.2 (latest one) - Nagios 3.1.2 We have a single nagios aggregator, which collect all NSCA status from the other hosts. What's happening: a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. What I've done: - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) - trying to push status by hand from client22 to server01; ALL packets are sent successfully 1 data packet(s) sent to host successfully.. I've done this with a loop like that: for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. On another hand, it shows me: [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. The only differences are : - the working network is debian-only - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. Does anyone of you have an idea ? Thank you in advance. Best regards, C. -- Cédric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA strange behaviour
Hello, As far as I can see, only new one. Even if they are just reinstalled (client22 was in nagios config before it was reinstalled). Best regards, C. On Tue, 8 Dec 2009 08:08:03 -0600 Greg Pangrazio pangr...@gmail.com wrote: Do all of your clients fail, or just the new one? Greg Pangrazio pangr...@gmail.com On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello, I'm having troubles with NSCA. What we have : - about 47 passive hosts - about 220 passive services Versions : all are redhat servers, with: - NSCA 2.7.2 (latest one) - Nagios 3.1.2 We have a single nagios aggregator, which collect all NSCA status from the other hosts. What's happening: a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. What I've done: - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) - trying to push status by hand from client22 to server01; ALL packets are sent successfully 1 data packet(s) sent to host successfully.. I've done this with a loop like that: for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. On another hand, it shows me: [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. The only differences are : - the working network is debian-only - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. Does anyone of you have an idea ? Thank you in advance. Best regards, C. -- Cédric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Cédric Jeanneret | System Administrator 021 619 10 32| Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL signature.asc Description: PGP signature -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA strange behaviour
It sounds like there is something that changed with the re-install. Is the IP address of the system the same? Did you pick the same encryption type in the nsca config? Can you diff the nsca config with a working host? Greg Pangrazio pangr...@gmail.com On Tue, Dec 8, 2009 at 8:14 AM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello, As far as I can see, only new one. Even if they are just reinstalled (client22 was in nagios config before it was reinstalled). Best regards, C. On Tue, 8 Dec 2009 08:08:03 -0600 Greg Pangrazio pangr...@gmail.com wrote: Do all of your clients fail, or just the new one? Greg Pangrazio pangr...@gmail.com On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello, I'm having troubles with NSCA. What we have : - about 47 passive hosts - about 220 passive services Versions : all are redhat servers, with: - NSCA 2.7.2 (latest one) - Nagios 3.1.2 We have a single nagios aggregator, which collect all NSCA status from the other hosts. What's happening: a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. What I've done: - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) - trying to push status by hand from client22 to server01; ALL packets are sent successfully 1 data packet(s) sent to host successfully.. I've done this with a loop like that: for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. On another hand, it shows me: [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. The only differences are : - the working network is debian-only - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. Does anyone of you have an idea ? Thank you in advance. Best regards, C. -- Cédric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Cédric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA strange behaviour
Hello again, In fact, configuration files are dployed via puppet(http://reductivelabs.com/trac/puppet/wiki), so all files (should be) are the same. I'll check it, but as puppet runs on every hosts, they all should have the same files. IP addresses are the same (fixed IP, fixed ports). I'll check files once again. If anyone has another idea... Thank you. Best regards, C. On Tue, 8 Dec 2009 08:31:59 -0600 Greg Pangrazio pangr...@gmail.com wrote: It sounds like there is something that changed with the re-install. Is the IP address of the system the same? Did you pick the same encryption type in the nsca config? Can you diff the nsca config with a working host? Greg Pangrazio pangr...@gmail.com On Tue, Dec 8, 2009 at 8:14 AM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello, As far as I can see, only new one. Even if they are just reinstalled (client22 was in nagios config before it was reinstalled). Best regards, C. On Tue, 8 Dec 2009 08:08:03 -0600 Greg Pangrazio pangr...@gmail.com wrote: Do all of your clients fail, or just the new one? Greg Pangrazio pangr...@gmail.com On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello, I'm having troubles with NSCA. What we have : - about 47 passive hosts - about 220 passive services Versions : all are redhat servers, with: - NSCA 2.7.2 (latest one) - Nagios 3.1.2 We have a single nagios aggregator, which collect all NSCA status from the other hosts. What's happening: a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. What I've done: - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) - trying to push status by hand from client22 to server01; ALL packets are sent successfully 1 data packet(s) sent to host successfully.. I've done this with a loop like that: for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. On another hand, it shows me: [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. The only differences are : - the working network is debian-only - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. Does anyone of you have an idea ? Thank you in advance. Best regards, C. -- Cédric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Cédric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Cédric Jeanneret | System Administrator 021 619 10 32| Camptocamp SA
Re: [Nagios-users] NSCA strange behaviour
I just rsync-ed a complet config from a working host, then: for file in $(grep -lr working-client *); do sed -i 's/working-client/client22/g' $i done ... and it doesn't work any better. More over, puppet doesn't want to change anything... I'm stuck... :( On Tue, 8 Dec 2009 08:31:59 -0600 Greg Pangrazio pangr...@gmail.com wrote: It sounds like there is something that changed with the re-install. Is the IP address of the system the same? Did you pick the same encryption type in the nsca config? Can you diff the nsca config with a working host? Greg Pangrazio pangr...@gmail.com On Tue, Dec 8, 2009 at 8:14 AM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello, As far as I can see, only new one. Even if they are just reinstalled (client22 was in nagios config before it was reinstalled). Best regards, C. On Tue, 8 Dec 2009 08:08:03 -0600 Greg Pangrazio pangr...@gmail.com wrote: Do all of your clients fail, or just the new one? Greg Pangrazio pangr...@gmail.com On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello, I'm having troubles with NSCA. What we have : - about 47 passive hosts - about 220 passive services Versions : all are redhat servers, with: - NSCA 2.7.2 (latest one) - Nagios 3.1.2 We have a single nagios aggregator, which collect all NSCA status from the other hosts. What's happening: a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. What I've done: - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) - trying to push status by hand from client22 to server01; ALL packets are sent successfully 1 data packet(s) sent to host successfully.. I've done this with a loop like that: for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. On another hand, it shows me: [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. The only differences are : - the working network is debian-only - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. Does anyone of you have an idea ? Thank you in advance. Best regards, C. -- Cédric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Cédric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Cédric Jeanneret | System Administrator 021 619 10 32| Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL signature.asc Description: PGP signature
Re: [Nagios-users] NSCA strange behaviour
check openssl versions and compatibility. On Tue, Dec 8, 2009 at 2:13 PM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello again, In fact, configuration files are dployed via puppet( http://reductivelabs.com/trac/puppet/wiki), so all files (should be) are the same. I'll check it, but as puppet runs on every hosts, they all should have the same files. IP addresses are the same (fixed IP, fixed ports). I'll check files once again. If anyone has another idea... Thank you. Best regards, C. On Tue, 8 Dec 2009 08:31:59 -0600 Greg Pangrazio pangr...@gmail.com wrote: It sounds like there is something that changed with the re-install. Is the IP address of the system the same? Did you pick the same encryption type in the nsca config? Can you diff the nsca config with a working host? Greg Pangrazio pangr...@gmail.com On Tue, Dec 8, 2009 at 8:14 AM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello, As far as I can see, only new one. Even if they are just reinstalled (client22 was in nagios config before it was reinstalled). Best regards, C. On Tue, 8 Dec 2009 08:08:03 -0600 Greg Pangrazio pangr...@gmail.com wrote: Do all of your clients fail, or just the new one? Greg Pangrazio pangr...@gmail.com On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello, I'm having troubles with NSCA. What we have : - about 47 passive hosts - about 220 passive services Versions : all are redhat servers, with: - NSCA 2.7.2 (latest one) - Nagios 3.1.2 We have a single nagios aggregator, which collect all NSCA status from the other hosts. What's happening: a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. What I've done: - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) - trying to push status by hand from client22 to server01; ALL packets are sent successfully 1 data packet(s) sent to host successfully.. I've done this with a loop like that: for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. On another hand, it shows me: [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. The only differences are : - the working network is debian-only - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. Does anyone of you have an idea ? Thank you in advance. Best regards, C. -- Cédric Jeanneret | System Administrator 021 619 10 32| Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Cédric Jeanneret | System Administrator 021 619 10 32| Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. :::
Re: [Nagios-users] NSCA strange behaviour
Hello Marcel, well, same on server01 and client22... working-server has an earlier one, but it works on this. NSCA doesn't show me any encryption error (encryption method and passphrase are correct on both ends) :/ Regards, C. On Tue, 8 Dec 2009 14:34:43 -0200 Marcel mits...@gmail.com wrote: check openssl versions and compatibility. On Tue, Dec 8, 2009 at 2:13 PM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello again, In fact, configuration files are dployed via puppet( http://reductivelabs.com/trac/puppet/wiki), so all files (should be) are the same. I'll check it, but as puppet runs on every hosts, they all should have the same files. IP addresses are the same (fixed IP, fixed ports). I'll check files once again. If anyone has another idea... Thank you. Best regards, C. On Tue, 8 Dec 2009 08:31:59 -0600 Greg Pangrazio pangr...@gmail.com wrote: It sounds like there is something that changed with the re-install. Is the IP address of the system the same? Did you pick the same encryption type in the nsca config? Can you diff the nsca config with a working host? Greg Pangrazio pangr...@gmail.com On Tue, Dec 8, 2009 at 8:14 AM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello, As far as I can see, only new one. Even if they are just reinstalled (client22 was in nagios config before it was reinstalled). Best regards, C. On Tue, 8 Dec 2009 08:08:03 -0600 Greg Pangrazio pangr...@gmail.com wrote: Do all of your clients fail, or just the new one? Greg Pangrazio pangr...@gmail.com On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret cedric.jeanne...@camptocamp.com wrote: Hello, I'm having troubles with NSCA. What we have : - about 47 passive hosts - about 220 passive services Versions : all are redhat servers, with: - NSCA 2.7.2 (latest one) - Nagios 3.1.2 We have a single nagios aggregator, which collect all NSCA status from the other hosts. What's happening: a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. What I've done: - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) - trying to push status by hand from client22 to server01; ALL packets are sent successfully 1 data packet(s) sent to host successfully.. I've done this with a loop like that: for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. On another hand, it shows me: [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. The only differences are : - the working network is debian-only - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. Does anyone of you have an idea ? Thank you in advance. Best regards, C. -- Cédric Jeanneret | System Administrator 021 619 10 32| Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Cédric Jeanneret | System Administrator 021 619 10 32| Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL
Re: [Nagios-users] NSCA strange behaviour
On Dec 8, 2009, at 4:40 AM, Cedric Jeanneret wrote: - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. On another hand, it shows me: [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. What do they show? What you've quoted here doesn't come from NSCA (that I can tell from a simple grep). -- nsca-2.7.2]# grep -r 'are fresh' * nsca-2.7.2]# I'm sure that comes from nagios via nagios.log, not NSCA. Are you sure you're looking in the right place? It's typically in /var/log/messages. How are you running nsca? daemon mode or via inetd? If inetd, is inetd rejecting the connection? -- Marc -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA strange behaviour
Hello Marc, Indeed, the are fresh comes from nagios.log. ok, done. NSCA is running in daemon mode, iptables is opened for nsca port, and connections can go through it (tcpdump shows it to me, in both directions). Setting debug=1 in nsca.cfg seems to do nothing more in /var/log/messages (redhat server). I just see down hosts passing through (results for ... are stalled - forcing immedia check...). I set up debug for nagios itself, but it really seems to be a problem at NSCA level. Regards, C. On Tue, 8 Dec 2009 10:52:50 -0600 Marc Powell m...@ena.com wrote: On Dec 8, 2009, at 4:40 AM, Cedric Jeanneret wrote: - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. On another hand, it shows me: [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. What do they show? What you've quoted here doesn't come from NSCA (that I can tell from a simple grep). -- nsca-2.7.2]# grep -r 'are fresh' * nsca-2.7.2]# I'm sure that comes from nagios via nagios.log, not NSCA. Are you sure you're looking in the right place? It's typically in /var/log/messages. How are you running nsca? daemon mode or via inetd? If inetd, is inetd rejecting the connection? -- Marc -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Cédric Jeanneret | System Administrator 021 619 10 32| Camptocamp SA cedric.jeanne...@camptocamp.com | PSE-A / EPFL signature.asc Description: PGP signature -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA strange behaviour
On Dec 8, 2009, at 11:19 AM, Cedric Jeanneret wrote: Hello Marc, Indeed, the are fresh comes from nagios.log. ok, done. NSCA is running in daemon mode, iptables is opened for nsca port, and connections can go through it (tcpdump shows it to me, in both directions). Setting debug=1 in nsca.cfg seems to do nothing more in /var/log/messages (redhat server). I just see down hosts passing through (results for ... are stalled - forcing immedia check...). There will be lots of output. I'm pretty sure that NSCA logs to syslog's 'daemon' facility using the 'debug','err', and 'info' priorities. Find out what file those are being logged to in /etc/syslog.conf. You're not going to get much traction on this issue until you can see that NSCA output. You could also try grepping your log files for 'Listening for connections' to find where it's going (if it it's currently being logged by syslog). -- Marc -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA strange behaviour
Oh, thank you. I'll do that tomorrow and see what can be used here. On Tue, Dec 8, 2009 at 7:09 PM, Marc Powell m...@ena.com wrote: On Dec 8, 2009, at 11:19 AM, Cedric Jeanneret wrote: Hello Marc, Indeed, the are fresh comes from nagios.log. ok, done. NSCA is running in daemon mode, iptables is opened for nsca port, and connections can go through it (tcpdump shows it to me, in both directions). Setting debug=1 in nsca.cfg seems to do nothing more in /var/log/messages (redhat server). I just see down hosts passing through (results for ... are stalled - forcing immedia check...). There will be lots of output. I'm pretty sure that NSCA logs to syslog's 'daemon' facility using the 'debug','err', and 'info' priorities. Find out what file those are being logged to in /etc/syslog.conf. You're not going to get much traction on this issue until you can see that NSCA output. You could also try grepping your log files for 'Listening for connections' to find where it's going (if it it's currently being logged by syslog). -- Marc -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nsca binary for tru64 unix
Hello, I am inplementing Nagios on a Linux server and would like to get the NSCA_SEND for Tru64 UNIX 5.1b. Is there such a beast? Thank you, Anthony Gonzales -- Come build with us! The BlackBerryreg; Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9#45;12, 2009. Register now#33; http://p.sf.net/sfu/devconf___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA speed problem
Have you considered OCP_daemon? http://wiki.nagios.org/index.php/OCP_Daemon -Original Message- From: d...@chatham.org [mailto:d...@chatham.org] Sent: Tuesday, September 08, 2009 1:00 PM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] NSCA speed problem I have a Nagios setup that is monitoring ~ 1000 hosts and ~ 13,000 services. The active checks are run on a Sun box with 128 CPUs/cores. Since it appeared that status.cgi could only be single threaded, it meant that the Sun box was slow in putting a page together, so all checks were forwarded to a fast Intel machine which puts together the page in about 2 seconds instead of about 16 on the SPARC. However, NSCA is now slowing the process, either on the sending or the receiving end. There are only two NSCA processes running, so I suspect that this is the problem. I can think of a number of alternatives. One would be to load up ndoutils, which looks like a fine solution, but I'm a but under the gun here and I'd really like to find something that works quickly. An alternative might be to use syslog to get the data from one machine to another. Any ideas, suggestions? --- --- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null This email message is intended for the use of the person to whom it has been sent, and may contain information that is confidential or legally protected. If you are not the intended recipient or have received this message in error, you are not authorized to copy, distribute, or otherwise use this message or its attachments. Please notify the sender immediately by return e-mail and permanently delete this message and any attachments. Verio, Inc. makes no warranty that this email is error or virus free. Thank you. -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA speed problem
I have a Nagios setup that is monitoring ~ 1000 hosts and ~ 13,000 services. The active checks are run on a Sun box with 128 CPUs/cores. Since it appeared that status.cgi could only be single threaded, it meant that the Sun box was slow in putting a page together, so all checks were forwarded to a fast Intel machine which puts together the page in about 2 seconds instead of about 16 on the SPARC. However, NSCA is now slowing the process, either on the sending or the receiving end. There are only two NSCA processes running, so I suspect that this is the problem. I can think of a number of alternatives. One would be to load up ndoutils, which looks like a fine solution, but I'm a but under the gun here and I'd really like to find something that works quickly. An alternative might be to use syslog to get the data from one machine to another. Any ideas, suggestions? -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA speed problem
Try OCP Daemon (http://wiki.nagios.org/index.php/OCP_Daemon) --- On Tue, 9/8/09, d...@chatham.org d...@chatham.org wrote: From: d...@chatham.org d...@chatham.org Subject: [Nagios-users] NSCA speed problem To: nagios-users@lists.sourceforge.net Date: Tuesday, September 8, 2009, 2:59 PM I have a Nagios setup that is monitoring ~ 1000 hosts and ~ 13,000 services. The active checks are run on a Sun box with 128 CPUs/cores. Since it appeared that status.cgi could only be single threaded, it meant that the Sun box was slow in putting a page together, so all checks were forwarded to a fast Intel machine which puts together the page in about 2 seconds instead of about 16 on the SPARC. However, NSCA is now slowing the process, either on the sending or the receiving end. There are only two NSCA processes running, so I suspect that this is the problem. I can think of a number of alternatives. One would be to load up ndoutils, which looks like a fine solution, but I'm a but under the gun here and I'd really like to find something that works quickly. An alternative might be to use syslog to get the data from one machine to another. Any ideas, suggestions? -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA
Firstly many thanks to all that helped me, I finally got distributed servers up and running. It seems that there is a lot of discussion about distributed clusters. Is there a current discussion forum linked to development for this, and how do I join it I would like to return some input and help make future versions of nagios even better Cheers Nick -Original Message- From: Marc Powell [mailto:m...@ena.com] Sent: Saturday, July 11, 2009 2:04 To: Nagios Users Mail-list Subject: Re: [Nagios-users] Re NSCA On Jul 10, 2009, at 11:12 AM, Nick Price wrote: Hi All Ok I can see it coming into the central nagios logs. Trouble was to do with encryption/decryption and passwords How do I get the data into the central nagios and therefore into the central nagios web interface. I'm not sure what and how to write the central server configs for the passive results coming in from the remote server. The Distributed Monitoring section of the documentation goes into pretty specific detail. On the central server, you essentially just need to make sure that passive checks are enabled program-wide and for the services you are submitting results for, that nagios is checking for external commands and that NSCA is configured to write to nagios' external command file. You must have a matching host{} and service{} definition for every passive result that you will be submitting. -- Marc -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA
On Jul 14, 2009, at 11:16 AM, Nick Price wrote: Firstly many thanks to all that helped me, I finally got distributed servers up and running. It seems that there is a lot of discussion about distributed clusters. Not especially, that I've seen. Is there a current discussion forum linked to development for this, and how do I join it Nothing beyond the general nagios-devel list that I am aware of -- http://wiki.nagios.org/index.php/Mail_Lists I would like to return some input and help make future versions of nagios even better Better nagios is always better =) -- Marc -- Enter the BlackBerry Developer Challenge This is your chance to win up to $100,000 in prizes! For a limited time, vendors submitting new applications to BlackBerry App World(TM) will have the opportunity to enter the BlackBerry Developer Challenge. See full prize details at: http://p.sf.net/sfu/Challenge ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Command file does not exist
Do you have Linux SE or AppArmor that might block access for nsca? Otherwise, I'm pretty much out of ideas :-( Joan Tugores wrote: Manually feed works!! but stopstart nagios not solves the problem. Kevin Keane escribió: Yes, this looks good. Besides, I only just noticed that you did the ls -lah * as user nagios, so we now have double confirmation that this works. Another question: what happens when you manually feed a command into nagios.cmd? echo xxx /usr/local/nagios/var/rw/nagios.cmd It should basically succeed without any message. If you look in your nagios log files, you'll probably see an invalid command xxx being logged. Finally, you could try stopping nagios, deleting the nagios.cmd file if it stayed around, and restarting nagios. It should get recreated. This just in case it is somehow messed up. Joan Tugores wrote: drwxr-xr-x 15 root root 4,0K abr 29 13:07 /usr drwxr-xr-x 14 root root 4,0K abr 29 13:58 /usr/local drwxrwxr-x 9 nagios nagiosgrb 4,0K abr 29 14:16 /usr/local/nagios drwxrwxr-x 5 nagios nagiosgrb 4,0K may 11 11:07 /usr/local/nagios/var drwxrwxr-x 2 nagios nagiosgrb 4,0K may 8 13:22 /usr/local/nagios/var/rw I think is all ok. Kevin Keane escribió: How about ls -lahd /usr/local/nagios/var/rw /usr/local/nagios/var /usr/local/nagios /usr/local /usr They should all at a minimum have X permission for either the nagios user or the group or for world. ? Joan Tugores wrote: [nag...@dcserverbd2 rw]$ ls -lah * prw-rw 1 nagios nagiosgrb0 may 8 13:22 nagios.cmd -rwxrwxr-x 1 nagios nagiosgrb 4,0K may 11 09:20 nsca.dump seems to be well. Kevin Keane escribió: Does the nsca user or group have access to every single directory along the path? Easy way to try: su - nagios cd /usr/local/nagios/var/rw ls -lah * Note: the above won't work if user nagios has /bin/false as a shell. In that case, try su -s /bin/bash - nagios It should list nagios.cmd (and probably some other files) Joan Tugores wrote: I think that not using chroot jail, this is my nsca.cfg: pid_file=/var/run/nsca.pid server_port=5667 #server_address=192.168.1.1 nsca_user=nagios nsca_group=nagiosgrb #nsca_chroot=/usr/local/nagios/var/rw/ debug=1 command_file=/usr/local/nagios/var/rw/nagios.cmd alternate_dump_file=/usr/local/nagios/var/rw/nsca.dump aggregate_writes=0 append_to_file=0 max_packet_age=30 #password= decryption_method=1 Kevin Keane escribió: Does nsca run in a chroot jail? I'm not sure if it can do that. If that's the case, nsca may think that /usr/local/nagios is really the root of the file system, and anything above that directory is essentially hidden. In that case, you'd have to change the nsca.cfg configuration to use /var/rw/nagios.cmd (or strip out whatever the root of the chroot jail is). Joan Tugores wrote: Hi all, When I send a packet to ncsa server this not processed. In /var/log/messages: Command file '/usr/local/nagios/var/rw/nagios.cmd' does not exist,attempting to use alternate dump file '/usr/local/nagios/var/rw/nsca.dump' for output But nagios.cmd exists and the permissions are good. prw-rw 1 nagios nagiosgrb0 may 5 09:59 nagios.cmd Regards, Joan -- -- -- -- Kevin Keane Owner The NetTech Find the Uncommon: Expert Solutions for a Network You Never Have to Think About Office: 866-642-7116 http://www.4nettech.com This e-mail and attachments, if any, may contain confidential and/or proprietary information. Please be advised that the unauthorized use or disclosure of the information is strictly prohibited. The information herein is intended only for use by the intended recipient(s) named above. If you have received this transmission in error, please notify the sender immediately and permanently delete the e-mail and any copies, printouts or attachments thereof. -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Command file does not exist
How about ls -lahd /usr/local/nagios/var/rw /usr/local/nagios/var /usr/local/nagios /usr/local /usr They should all at a minimum have X permission for either the nagios user or the group or for world. ? Joan Tugores wrote: [nag...@dcserverbd2 rw]$ ls -lah * prw-rw 1 nagios nagiosgrb0 may 8 13:22 nagios.cmd -rwxrwxr-x 1 nagios nagiosgrb 4,0K may 11 09:20 nsca.dump seems to be well. Kevin Keane escribió: Does the nsca user or group have access to every single directory along the path? Easy way to try: su - nagios cd /usr/local/nagios/var/rw ls -lah * Note: the above won't work if user nagios has /bin/false as a shell. In that case, try su -s /bin/bash - nagios It should list nagios.cmd (and probably some other files) Joan Tugores wrote: I think that not using chroot jail, this is my nsca.cfg: pid_file=/var/run/nsca.pid server_port=5667 #server_address=192.168.1.1 nsca_user=nagios nsca_group=nagiosgrb #nsca_chroot=/usr/local/nagios/var/rw/ debug=1 command_file=/usr/local/nagios/var/rw/nagios.cmd alternate_dump_file=/usr/local/nagios/var/rw/nsca.dump aggregate_writes=0 append_to_file=0 max_packet_age=30 #password= decryption_method=1 Kevin Keane escribió: Does nsca run in a chroot jail? I'm not sure if it can do that. If that's the case, nsca may think that /usr/local/nagios is really the root of the file system, and anything above that directory is essentially hidden. In that case, you'd have to change the nsca.cfg configuration to use /var/rw/nagios.cmd (or strip out whatever the root of the chroot jail is). Joan Tugores wrote: Hi all, When I send a packet to ncsa server this not processed. In /var/log/messages: Command file '/usr/local/nagios/var/rw/nagios.cmd' does not exist,attempting to use alternate dump file '/usr/local/nagios/var/rw/nsca.dump' for output But nagios.cmd exists and the permissions are good. prw-rw 1 nagios nagiosgrb0 may 5 09:59 nagios.cmd Regards, Joan -- -- Kevin Keane Owner The NetTech Find the Uncommon: Expert Solutions for a Network You Never Have to Think About Office: 866-642-7116 http://www.4nettech.com This e-mail and attachments, if any, may contain confidential and/or proprietary information. Please be advised that the unauthorized use or disclosure of the information is strictly prohibited. The information herein is intended only for use by the intended recipient(s) named above. If you have received this transmission in error, please notify the sender immediately and permanently delete the e-mail and any copies, printouts or attachments thereof. -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Command file does not exist
Yes, this looks good. Besides, I only just noticed that you did the ls -lah * as user nagios, so we now have double confirmation that this works. Another question: what happens when you manually feed a command into nagios.cmd? echo xxx /usr/local/nagios/var/rw/nagios.cmd It should basically succeed without any message. If you look in your nagios log files, you'll probably see an invalid command xxx being logged. Finally, you could try stopping nagios, deleting the nagios.cmd file if it stayed around, and restarting nagios. It should get recreated. This just in case it is somehow messed up. Joan Tugores wrote: drwxr-xr-x 15 root root 4,0K abr 29 13:07 /usr drwxr-xr-x 14 root root 4,0K abr 29 13:58 /usr/local drwxrwxr-x 9 nagios nagiosgrb 4,0K abr 29 14:16 /usr/local/nagios drwxrwxr-x 5 nagios nagiosgrb 4,0K may 11 11:07 /usr/local/nagios/var drwxrwxr-x 2 nagios nagiosgrb 4,0K may 8 13:22 /usr/local/nagios/var/rw I think is all ok. Kevin Keane escribió: How about ls -lahd /usr/local/nagios/var/rw /usr/local/nagios/var /usr/local/nagios /usr/local /usr They should all at a minimum have X permission for either the nagios user or the group or for world. ? Joan Tugores wrote: [nag...@dcserverbd2 rw]$ ls -lah * prw-rw 1 nagios nagiosgrb0 may 8 13:22 nagios.cmd -rwxrwxr-x 1 nagios nagiosgrb 4,0K may 11 09:20 nsca.dump seems to be well. Kevin Keane escribió: Does the nsca user or group have access to every single directory along the path? Easy way to try: su - nagios cd /usr/local/nagios/var/rw ls -lah * Note: the above won't work if user nagios has /bin/false as a shell. In that case, try su -s /bin/bash - nagios It should list nagios.cmd (and probably some other files) Joan Tugores wrote: I think that not using chroot jail, this is my nsca.cfg: pid_file=/var/run/nsca.pid server_port=5667 #server_address=192.168.1.1 nsca_user=nagios nsca_group=nagiosgrb #nsca_chroot=/usr/local/nagios/var/rw/ debug=1 command_file=/usr/local/nagios/var/rw/nagios.cmd alternate_dump_file=/usr/local/nagios/var/rw/nsca.dump aggregate_writes=0 append_to_file=0 max_packet_age=30 #password= decryption_method=1 Kevin Keane escribió: Does nsca run in a chroot jail? I'm not sure if it can do that. If that's the case, nsca may think that /usr/local/nagios is really the root of the file system, and anything above that directory is essentially hidden. In that case, you'd have to change the nsca.cfg configuration to use /var/rw/nagios.cmd (or strip out whatever the root of the chroot jail is). Joan Tugores wrote: Hi all, When I send a packet to ncsa server this not processed. In /var/log/messages: Command file '/usr/local/nagios/var/rw/nagios.cmd' does not exist,attempting to use alternate dump file '/usr/local/nagios/var/rw/nsca.dump' for output But nagios.cmd exists and the permissions are good. prw-rw 1 nagios nagiosgrb0 may 5 09:59 nagios.cmd Regards, Joan -- -- -- Kevin Keane Owner The NetTech Find the Uncommon: Expert Solutions for a Network You Never Have to Think About Office: 866-642-7116 http://www.4nettech.com This e-mail and attachments, if any, may contain confidential and/or proprietary information. Please be advised that the unauthorized use or disclosure of the information is strictly prohibited. The information herein is intended only for use by the intended recipient(s) named above. If you have received this transmission in error, please notify the sender immediately and permanently delete the e-mail and any copies, printouts or attachments thereof. -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Command file does not exist
Does the nsca user or group have access to every single directory along the path? Easy way to try: su - nagios cd /usr/local/nagios/var/rw ls -lah * Note: the above won't work if user nagios has /bin/false as a shell. In that case, try su -s /bin/bash - nagios It should list nagios.cmd (and probably some other files) Joan Tugores wrote: I think that not using chroot jail, this is my nsca.cfg: pid_file=/var/run/nsca.pid server_port=5667 #server_address=192.168.1.1 nsca_user=nagios nsca_group=nagiosgrb #nsca_chroot=/usr/local/nagios/var/rw/ debug=1 command_file=/usr/local/nagios/var/rw/nagios.cmd alternate_dump_file=/usr/local/nagios/var/rw/nsca.dump aggregate_writes=0 append_to_file=0 max_packet_age=30 #password= decryption_method=1 Kevin Keane escribió: Does nsca run in a chroot jail? I'm not sure if it can do that. If that's the case, nsca may think that /usr/local/nagios is really the root of the file system, and anything above that directory is essentially hidden. In that case, you'd have to change the nsca.cfg configuration to use /var/rw/nagios.cmd (or strip out whatever the root of the chroot jail is). Joan Tugores wrote: Hi all, When I send a packet to ncsa server this not processed. In /var/log/messages: Command file '/usr/local/nagios/var/rw/nagios.cmd' does not exist,attempting to use alternate dump file '/usr/local/nagios/var/rw/nsca.dump' for output But nagios.cmd exists and the permissions are good. prw-rw 1 nagios nagiosgrb0 may 5 09:59 nagios.cmd Regards, Joan -- Kevin Keane Owner The NetTech Find the Uncommon: Expert Solutions for a Network You Never Have to Think About Office: 866-642-7116 http://www.4nettech.com This e-mail and attachments, if any, may contain confidential and/or proprietary information. Please be advised that the unauthorized use or disclosure of the information is strictly prohibited. The information herein is intended only for use by the intended recipient(s) named above. If you have received this transmission in error, please notify the sender immediately and permanently delete the e-mail and any copies, printouts or attachments thereof. -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Network server bind failure
This means exactly what the error message says: that there is already another program using port 5667. Odds are that nsca is already running on your computer; maybe it got started as a daemon on system startup. Joan Tugores wrote: Hi all, When I try to put up the daemon nsca with the command /usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg in /var/log/messages appears Network server bind failure (98: Address already in use) the port 5667 is configured and netstat -na show tcp 0 00.0.0.0:56670.0.0.0:* LISTEN. Please Help. Thanks. -- -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Kevin Keane Owner The NetTech Find the Uncommon: Expert Solutions for a Network You Never Have to Think About Office: 866-642-7116 http://www.4nettech.com This e-mail and attachments, if any, may contain confidential and/or proprietary information. Please be advised that the unauthorized use or disclosure of the information is strictly prohibited. The information herein is intended only for use by the intended recipient(s) named above. If you have received this transmission in error, please notify the sender immediately and permanently delete the e-mail and any copies, printouts or attachments thereof. -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Network server bind failure
Nsca not running "ps -ef | grep nsca" not show process nsca. Kevin Keane escribi: This means exactly what the error message says: that there is already another program using port 5667. Odds are that nsca is already running on your computer; maybe it got started as a daemon on system startup. Joan Tugores wrote: Hi all, When I try to put up the daemon nsca with the command "/usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg" in "/var/log/messages" appears "Network server bind failure (98: Address already in use)" the port 5667 is configured and netstat -na show "tcp 0 00.0.0.0:56670.0.0.0:* LISTEN". Please Help. Thanks. -- -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA - Network server bind failure
Try netstat -ltunp | grep 5667 And respond to the group, please. Joan Tugores wrote: Nsca not running ps -ef | grep nsca not show process nsca. Kevin Keane escribió: This means exactly what the error message says: that there is already another program using port 5667. Odds are that nsca is already running on your computer; maybe it got started as a daemon on system startup. Joan Tugores wrote: Hi all, When I try to put up the daemon nsca with the command /usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg in /var/log/messages appears Network server bind failure (98: Address already in use) the port 5667 is configured and netstat -na show tcp 0 00.0.0.0:56670.0.0.0:* LISTEN. Please Help. Thanks. -- -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Kevin Keane Owner The NetTech Find the Uncommon: Expert Solutions for a Network You Never Have to Think About Office: 866-642-7116 http://www.4nettech.com This e-mail and attachments, if any, may contain confidential and/or proprietary information. Please be advised that the unauthorized use or disclosure of the information is strictly prohibited. The information herein is intended only for use by the intended recipient(s) named above. If you have received this transmission in error, please notify the sender immediately and permanently delete the e-mail and any copies, printouts or attachments thereof. -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA - Command file does not exist
Hi all, When I send a packet to ncsa server this not processed. In /var/log/messages: Command file '/usr/local/nagios/var/rw/nagios.cmd' does not exist,attempting to use alternate dump file '/usr/local/nagios/var/rw/nsca.dump' for output But nagios.cmd exists and the permissions are good. prw-rw 1 nagios nagiosgrb0 may 5 09:59 nagios.cmd Regards, Joan -- The NEW KODAK i700 Series Scanners deliver under ANY circumstances! Your production scanning environment may not be a perfect world - but thanks to Kodak, there's a perfect scanner to get the job done! With the NEW KODAK i700 Series Scanner you'll get full speed at 300 dpi even with all image processing features enabled. http://p.sf.net/sfu/kodak-com ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA 2.7.2 (core Nagios 3.0.3) filters HTML-special characters in event messages
Dear list, *From our old Nagios system:* we had NSCA 2.7.2 and core *Nagios 3.0.RC2*instance: we had *no* *problem* with NSCA and special HTML characters like Fex.: in nagios.log: iframe src=http://xxx; width=400 height=400 scrolling=auto frameborder=1/iframe Note: nagios.cfg option: illegal_macro_output_chars=`~$|' *From our new Nagios system:**:* we have NSCA 2.7.2 and core *Nagios 3.0.3 *instance: we had problem with NSCA and special HTML characters like Fex.: in nagios.log: iframe src=http://xxx width=400 height=400 scrolling=auto frameborder=1 /iframe Note: nagios.cfg option: illegal_macro_output_chars=`~$|' We had tried to change option: illegal_macro_output_chars=`~$|' But it didnt help. Any hints/links/suggestion are always welcome with best regards, Thanks - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA and Latency
My Environment: 3 x Dell 2950 Dual DualCore and 8 GB of RAM One system runs checks against our Linux servers One runs checks against our Windows servers We are running SLES10 update 3 Both systems use nsca to send their check results to a third server that displays the service checks for our operators. All three systems are on the same vlan but separate cisco switchs. I am running nsca in daemon mode on the central server with this command /usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg -daemon Nsca.cfg is as follows: pid_file=/var/run/nsca.pidserver_port=5667#server_address=192.168.1.1nsc a_user=nagiosnsca_group=nagios#nsca_chroot=/var/run/nagios/rwdebug=1comm and_file=/usr/local/nagios/var/rw/nagios.cmdalternate_dump_file=/usr/loc al/nagios/var/rw/nsca.dumpaggregate_writes=1append_to_file=1max_packet_a ge=300password=xxdecryption_method=14 I just set the aggregate and append options to try and fix the problem they were not set before either way the results are the same. Ok so on the 2 servers doing the checks Everything runs fine even with the OCSP running my send_service_check_results script. My script is pretty much straight out of the book. #!/bin/sh# Arguments:# $1 = Hostname of the host (using the $HOSTNAME$ macro)# $2 = Service description of the service (using the $SERVICEDESC$ macro)# $3 = Service status id of the service (using the $SERVICESTATUSID$ macro)# $4 = Output of the Service Check (using the $SERVICEOUTPUT$ macro)/bin/echo $1,$2,$3,N3 - $4 | /usr/local/nagios/libexec/send_nsca -H 10.10.129.37 -c /usr/local/nagios/etc/send_nsca.cfg -d , Like I said everything is fine on the 2 servers even with OCSP on. Between the 2 servers we are running about 10k service checks, latency is very low just a few seconds. However if I turn on the NSCA Deamon on the central server my latency creeps up to about 1500+ seconds with in an hour and just gets worse from there on both remotes. The checks that should run every 5 minutes on the 2 remote servers end up running every few hours or less. The central server is doing 0 active checks. I set debug mode and that proved to provide very little insight into the problem. CPU and Mem stats are both very low on all three server. The same thing can be said for the network, network utilization is less than 2% and there are no errors on the interfaces. Overall hardware utilization is 10% or less on these three systems. So my question is has anyone had this kind of problem with NSCA? What am I missing? Should I be batching my service checks on the remote servers? Should I be using xinetd for NSCA instead of deamon mode? Thanks Brady - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA and Latency
NSCA just doesn't scale well within Nagios. You will need to try something like the OCP Daemon mentioned here: http://www.nagioscommunity.org/wiki/index.php/OCP_Daemon I believe Andreas Ericsson has also written a broker module for NSCA. It is apparently still in its testing/alpha stages so you would have to contact that person directly. Jonathan From: Maxwell,Brady [mailto:[EMAIL PROTECTED] Sent: Thursday, October 23, 2008 8:42 AM To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] NSCA and Latency My Environment: 3 x Dell 2950 Dual DualCore and 8 GB of RAM One system runs checks against our Linux servers One runs checks against our Windows servers We are running SLES10 update 3 Both systems use nsca to send their check results to a third server that displays the service checks for our operators. All three systems are on the same vlan but separate cisco switchs. I am running nsca in daemon mode on the central server with this command /usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg -daemon Nsca.cfg is as follows: pid_file=/var/run/nsca.pidserver_port=5667#server_address=192.168.1.1nsc a_user=nagiosnsca_group=nagios#nsca_chroot=/var/run/nagios/rwdebug=1comm and_file=/usr/local/nagios/var/rw/nagios.cmdalternate_dump_file=/usr/loc al/nagios/var/rw/nsca.dumpaggregate_writes=1append_to_file=1max_packet_a ge=300password=xxdecryption_method=14 I just set the aggregate and append options to try and fix the problem they were not set before either way the results are the same. Ok so on the 2 servers doing the checks Everything runs fine even with the OCSP running my send_service_check_results script. My script is pretty much straight out of the book. #!/bin/sh# Arguments:# $1 = Hostname of the host (using the $HOSTNAME$ macro)# $2 = Service description of the service (using the $SERVICEDESC$ macro)# $3 = Service status id of the service (using the $SERVICESTATUSID$ macro)# $4 = Output of the Service Check (using the $SERVICEOUTPUT$ macro)/bin/echo $1,$2,$3,N3 - $4 | /usr/local/nagios/libexec/send_nsca -H 10.10.129.37 -c /usr/local/nagios/etc/send_nsca.cfg -d , Like I said everything is fine on the 2 servers even with OCSP on. Between the 2 servers we are running about 10k service checks, latency is very low just a few seconds. However if I turn on the NSCA Deamon on the central server my latency creeps up to about 1500+ seconds with in an hour and just gets worse from there on both remotes. The checks that should run every 5 minutes on the 2 remote servers end up running every few hours or less. The central server is doing 0 active checks. I set debug mode and that proved to provide very little insight into the problem. CPU and Mem stats are both very low on all three server. The same thing can be said for the network, network utilization is less than 2% and there are no errors on the interfaces. Overall hardware utilization is 10% or less on these three systems. So my question is has anyone had this kind of problem with NSCA? What am I missing? Should I be batching my service checks on the remote servers? Should I be using xinetd for NSCA instead of deamon mode? Thanks Brady This email message is intended for the use of the person to whom it has been sent, and may contain information that is confidential or legally protected. If you are not the intended recipient or have received this message in error, you are not authorized to copy, distribute, or otherwise use this message or its attachments. Please notify the sender immediately by return e-mail and permanently delete this message and any attachments. Verio, Inc. makes no warranty that this email is error or virus free. Thank you. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA and Latency
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Jonathan Call schrieb: I believe Andreas Ericsson has also written a broker module for NSCA. It is apparently still in its testing/alpha stages so you would have to contact that person directly. Or you give it a direct try and feed Andreas with some scaling information: http://git.op5.org/git/?p=nagios/pnsca.git;a=summary Jonathan *From:* Maxwell,Brady [mailto:[EMAIL PROTECTED] *Sent:* Thursday, October 23, 2008 8:42 AM *To:* nagios-users@lists.sourceforge.net *Subject:* [Nagios-users] NSCA and Latency My Environment: 3 x Dell 2950 Dual DualCore and 8 GB of RAM One system runs checks against our Linux servers One runs checks against our Windows servers We are running SLES10 update 3 Both systems use nsca to send their check results to a third server that displays the service checks for our operators. All three systems are on the same vlan but separate cisco switchs. I am running nsca in daemon mode on the central server with this command /usr/local/nagios/bin/nsca -c /usr/local/nagios/etc/nsca.cfg –daemon Nsca.cfg is as follows: pid_file=/var/run/nsca.pidserver_port=5667#server_address=192.168.1.1nsca_user=nagiosnsca_group=nagios#nsca_chroot=/var/run/nagios/rwdebug=1command_file=/usr/local/nagios/var/rw/nagios.cmdalternate_dump_file=/usr/local/nagios/var/rw/nsca.dumpaggregate_writes=1append_to_file=1max_packet_age=300password=xxdecryption_method=14 I just set the aggregate and append options to try and fix the problem they were not set before either way the results are the same. Ok so on the 2 servers doing the checks…. Everything runs fine even with the OCSP running my send_service_check_results script. My script is pretty much straight out of the book. #!/bin/sh# Arguments:# $1 = Hostname of the host (using the $HOSTNAME$ macro)# $2 = Service description of the service (using the $SERVICEDESC$ macro)# $3 = Service status id of the service (using the $SERVICESTATUSID$ macro)# $4 = Output of the Service Check (using the $SERVICEOUTPUT$ macro)/bin/echo $1,$2,$3,N3 - $4 | /usr/local/nagios/libexec/send_nsca -H 10.10.129.37 -c /usr/local/nagios/etc/send_nsca.cfg -d , Like I said everything is fine on the 2 servers even with OCSP on. Between the 2 servers we are running about 10k service checks, latency is very low just a few seconds. However if I turn on the NSCA Deamon on the central server my latency creeps up to about 1500+ seconds with in an hour and just gets worse from there on both remotes. The checks that should run every 5 minutes on the 2 remote servers end up running every few hours or less. The central server is doing 0 active checks. I set debug mode and that proved to provide very little insight into the problem. CPU and Mem stats are both very low on all three server. The same thing can be said for the network, network utilization is less than 2% and there are no errors on the interfaces. Overall hardware utilization is 10% or less on these three systems. So my question is has anyone had this kind of problem with NSCA? What am I missing? Should I be batching my service checks on the remote servers? Should I be using xinetd for NSCA instead of deamon mode? Thanks Brady This email message is intended for the use of the person to whom it has been sent, and may contain information that is confidential or legally protected. If you are not the intended recipient or have received this message in error, you are not authorized to copy, distribute, or otherwise use this message or its attachments. Please notify the sender immediately by return e-mail and permanently delete this message and any attachments. Verio, Inc. makes no warranty that this email is error or virus free. Thank you. - This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100url=/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.8 (Darwin) iEYEARECAAYFAkkA14gACgkQlI0PwfxLQjlRgQCdGcEJtZOA9j/zDGBmlTi/FNdm OEUAn0QR2LqsriHHfjnhSWUSKgwAycHX =MW1U -END PGP SIGNATURE
Re: [Nagios-users] NSCA not receiving or send_nsca not sending
Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Im Auftrag von Anthony Montibello Gesendet: Mittwoch, 25. Juni 2008 05:17 An: Jason Cc: nagios-users@lists.sourceforge.net Betreff: Re: [Nagios-users] NSCA not receiving or send_nsca not sending There was a post on the time syncronization of NSCA just 2 weeks ago, the following is a copy of the email (I did not verify that this works) TOny SNIP Thanks guys! That would explain why the messages are not being received from the Windows machine as the time is, in fact, later than the time on the server. However, it does not make sense to me that it starts working when I reboot the Windows machine or in the case of the Linux machine I restart the nsca daemon. At any rate, I will give it a go. Regards, Jim Mohr - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA not receiving or send_nsca not se nding
Thanks guys! That would explain why the messages are not being received from the Windows machine as the time is, in fact, later than the time on the server. However, it does not make sense to me that it starts working when I reboot the Windows machine or in the case of the Linux machine I restart the nsca daemon. At any rate, I will give it a go. Regards, Jim Mohr In any case, the logs at Nagios server that i mentioned earlier should give u info on the source of the problem. If u dont receive logs at all, then most probably the client is unable to connect to the nsca daemon - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA not receiving or send_nsca not sending
Hi All! We're running Nagios 3.0.1 on RedHat ES 5.1, and have a problem where after a while no new messages are being received by NSCA. Sometimes it is from a Windows 2003 machine (send_nsca 2.4). Currently, this is the only Windows machine that is sending messages to the Nagios server, so I cannot tell if there is a generic problem with Windows. However, there have been occassions when no new messages can be sent from another machine (Linux). When the Linux machines have problems, the solution is to restart the NSCA daemon. However, with Windows there were a few dozen send_nsca processes and restarting the NSCA daemon did not help. Rather than killing all of the send_nsca processes individually, I rebooted the Windows machine and that worked. Perhaps one of the problems is the Linux machine running the send_nsca is the Nagios server. (I did try it from a different machine and no messages were sent either) It might seem sort of wierd doing things like that, but there is a reason. We need to check the state and usage of volumes on a NetApp. For all of the real important volumes there is a service defined. However, we also what to know if any of the others filesystems have problems or are getting full. So, we have one active service that runs a script to check *all* of the filesystems. The status is then send to the server (the same machine) using send_nsca. This is done whether there is a service defined or not because the script has no way of telling if there is a real service (although I guess I could build something into the script). If there is a real service, then the status is set. If not, the message is swollowed by Nagios. To avoid the problem of too many send_nsca connections, I thought about writing something directly to the cmd-file rather than going through send_nsca. I also thought about having two scripts. One that checks for a single filesystem and would be an active check. The second would check all of the filesystems that do not have their own service and then would send just a single message using send_nsca. Suggestions, ideas, comments, etc are all welcome. Regards, Jim Mohr - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA not receiving or send_nsca not se nding
eems to me that it is a time synchronization issue. NSCA only works when the time between the client and server is sync veri accurately. (even 30 secs diff will cause packets to be dropped). Check your logs at your Nagios server for any errors like dropping packet due to future timestamp. (/var/log/message, /var/log/messages, /var/log/syslog) If the problem exists, sync client's time with the server and set the client's sync interval (default for windows is to sync once per week) or u can get a ntp server - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA not receiving or send_nsca not sending
There was a post on the time syncronization of NSCA just 2 weeks ago, the following is a copy of the email (I did not verify that this works) TOny dateFri, Jun 13, 2008 at 10:44 AM subjectRe: [Nagios-users] Dropping packet with future timestamp, NSCA mailing listnagios-users.lists.sourceforge.net Filter messages from this mailing list mailed-bylists.sourceforge.net hide details Jun 13 (12 days ago) Reply Just change the source: ---8--- /* check the timestamp in the packet */ packet_time=(time_t)ntohl(receive_packet.timestamp); time(current_time); //if(packet_timecurrent_time){ //syslog(LOG_ERR,Dropping packet with future //timestamp.); ///*return;*/ // close(sock); //if(mode==SINGLE_PROCESS_DAEMON) //return; //else //do_exit(STATE_OK); //} //else{ packet_age= abs ( (unsigned long)(current_time-packet_time) ); syslog(LOG_ERR,Time diff: %lu seconds, packet_age); if(max_packet_age0 (packet_agemax_packet_age)){ syslog(LOG_ERR,Dropping packet with stale timestamp - packet was %lu seconds old or new.,packet_age); /*return;*/ close(sock); if(mode==SINGLE_PROCESS_DAEMON) return; else do_exit(STATE_OK); } //} ---8--- If you want, you can see now the Time diff in the syslog. Workes for me. Perhaps that should be an option in the next release! Chris - Hide quoted text - On Do, Jun 12, 2008 at 06:08:10 +0200, Vinogratzky wrote: Hi list, I have some passive checks with NSCA and my clients are a few seconds ahead. The server rejects the package with: ---8--- nsca[10986]: Dropping packet with future timestamp. ---8--- max_packet_age=0 doesn't help here. Any hints? (No, I'm not able to sync the Clients with ntp.) Thx, Chris On Tue, Jun 24, 2008 at 9:20 PM, Jason [EMAIL PROTECTED] wrote: eems to me that it is a time synchronization issue. NSCA only works when the time between the client and server is sync veri accurately. (even 30 secs diff will cause packets to be dropped). Check your logs at your Nagios server for any errors like dropping packet due to future timestamp. (/var/log/message, /var/log/messages, /var/log/syslog) If the problem exists, sync client's time with the server and set the client's sync interval (default for windows is to sync once per week) or u can get a ntp server - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] nsca and comments questions..
So I have 2 servers server1 server2 - - server1 lives inside my network and server2 lives in a DMZ. All of my customers login to server2 in order to see the status of the networks I monitor. The two servers replicate the information with nsca. It seems that when a systems goes down it replicates perfectly but when I comment on it the comments or anything do not go. Also does server2 have to ping all the systems as well? They it currently is pinging everything along with server1. What did I do wrong? Thanks everyone. - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nsca and comments questions..
On Jun 5, 2008, at 10:56 AM, matt wells wrote: So I have 2 servers server1 server2 - - server1 lives inside my network and server2 lives in a DMZ. All of my customers login to server2 in order to see the status of the networks I monitor. The two servers replicate the information with nsca. One sends to the other or both send to each other? It seems that when a systems goes down it replicates perfectly but when I comment on it the comments or anything do not go. That's correct. Event handlers, which I assume you are using to call send_nsca, only fire on check results. There's no built-in way to replicate comments, acknowledgements, or any other CGI initiated action (except forced checks). Also does server2 have to ping all the systems as well? They it currently is pinging everything along with server1. Not if you disable active checks or set the check period to none. -- Marc - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] nsca and comments questions..
Please always reply on list so that others now, or in the future, can benefit from your experience. You'll also have better luck finding out your options since I only have my personal experiences to draw on. On Jun 5, 2008, at 1:04 PM, matt wells wrote: Server1 sends to server2 only. One way communications. You mention There's no built-in way to replicate comments, acknowledgements, or any other CGI initiated action (except forced checks). Is there another way to do this? Nothing currently exists that I am aware of but there are certainly ways to do it. - create a client/daemon similar to NSCA that can send arbitrary external commands to the nagios external command pipe on remote machines - write an event-broker module that intercepts those events and sends appropriate commands to the above client or - tail -f nagios.log looking for those events and send appropriate commands to the above client. The client/daemon might be as simple as an authorized_keys ssh (untested) -- ssh [EMAIL PROTECTED] 'echo [timestamp] EXTERNALCOMMAND;options / usr/local/nagios/var/rw/nagios.cmd' I'd probably opt for calling a script, that the key was limited to running, that accepted the external command and options as variables then wrote to the pipe. -- Marc - Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA and long output
As it turns out, getting long output working was a little easier than I anticipated. The UI turns \n into BR. So I'm able to bundle the long output into the standard plugin output. All I had to do in the end was pass $LONGSERVICEOUTPUT$ with $SERVICEOUTPUT$ like this: define command{ command_namesubmit_check_result command_line /usr/local/nagios/libexec/eventhandlers/submit_check_result $HOSTNAME$ '$SERVICEDESC$' $SERVICESTATE$ '$SERVICEOUTPUT$' '$LONGSERVICEOUTPUT$' } Then I copied submit_check_result script from the Distributed Monitoring Doc that's on the web, and modified it accordingly: ...snip... case $3 in OK) return_code=0 ;; WARNING) return_code=1 ;; CRITICAL) return_code=2 ;; UNKNOWN) return_code=-1 ;; esac if [[ $5 ]] then LONGOUT=$4\n$5 else LONGOUT=$4 fi printf %s\t%s\t%s\t%s\n $1 $2 $return_code $LONGOUT | /usr/local/nagios/bin/send_nsca -H myhost.com -c /usr/local/nagios/etc/send_nsca.cfg ...snip... Just don't ask me how to turn this back into $LONGSERVICEOUTPUT$ once it's on the master Nagios server. Aaron Mills Systems Administrator Return Path, Inc. http://www.returnpath.net [EMAIL PROTECTED] - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA and long output
What I'm really trying to get is multi-line output. Nagios/NRPE supports this (v3.x) e.g.: TEXT OUTPUT | OPTIONAL PERFDATA LONG TEXT LINE 1 LONG TEXT LINE 2 This works for NRPE, but not with NSCA, since NSCA can only take a single line as an argument. What I'm trying to achieve is sending process monitoring data back passively like so: PROCS OK: Proc1: 3 processes running Proc2: 1 process running Proc3: 1 process running Any ideas? -Aaron Aaron Mills Systems Administrator Return Path, Inc. http://www.returnpath.net [EMAIL PROTECTED] From: Mike Hamrick [EMAIL PROTECTED] Subject: Re: [Nagios-users] NSCA and long output To: Nagios Users Mailinglist nagios-users@lists.sourceforge.net Message-ID: [EMAIL PROTECTED] Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Hi Aaron, You wrote: I've been trying to figure out if this is possible for a while. I'm using NRPE and $LONGHOSTOUTPUT$ for a number of tests, which is great, except for passive monitoring. We have several data centers that run their own Nagios boxes and then ship the data back to the master Nagios server via NSCA. The problem is that I can't get NSCA to utilize the $LONGHOSTOUTPUT$ - this is kind of critical for things like log file checks, etc. With NSCA this data doesn't get passed. Looking at the NSCA sources, common.h has: #define MAX_PLUGINOUTPUT_LENGTH 512 I'm guessing that's the issue right there. The first thing I'd try is to bump that up to 4096, and recompile send_nsca and nsca. I haven't looked very carefully at the source or tried this myself, but it seems like a good place to start. Mike - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA and long output
On May 28, 2008, at 11:54 AM, Aaron Mills wrote: What I'm really trying to get is multi-line output. Nagios/NRPE supports this (v3.x) e.g.: TEXT OUTPUT | OPTIONAL PERFDATA LONG TEXT LINE 1 LONG TEXT LINE 2 This works for NRPE, but not with NSCA, since NSCA can only take a single line as an argument. What I'm trying to achieve is sending process monitoring data back passively like so: PROCS OK: Proc1: 3 processes running Proc2: 1 process running Proc3: 1 process running Any ideas? AFAIK, you're in uncharted territory. Based on what I've seen, the longer output support has only really been applied to active commands but the documentation does indicate that support for passive results is there. NSCA hasn't been updated to handle multi-line output as you've seen nor have I seen any talk that it's coming soon. I also expect that you might see problems using the normal mechanism of writing the check results to the external command _pipe_ due to OS size limitations there (~1K typically). I expect that would only be a problem with large data returns (nagios internally supports $OUTPUT$ + $LONGOUTPUT$ of about ~4K). At this point I think you're going to have to roll your own check result return mechanism that either writes appropriately sized commands to the external pipe or larger ones to a file and uses the external command PROCESS_FILE to reap the results. -- Marc - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA and long output
All, I've been trying to figure out if this is possible for a while. I'm using NRPE and $LONGHOSTOUTPUT$ for a number of tests, which is great, except for passive monitoring. We have several data centers that run their own Nagios boxes and then ship the data back to the master Nagios server via NSCA. The problem is that I can't get NSCA to utilize the $LONGHOSTOUTPUT$ - this is kind of critical for things like log file checks, etc. With NSCA this data doesn't get passed. Has anyone run into this problem before? If so, is there an easy way around the problem other than strictly using NRPE and active monitoring? Thanks. -Aaron Aaron Mills Systems Administrator Return Path, Inc. http://www.returnpath.net [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA and long output
Hi Aaron, You wrote: I've been trying to figure out if this is possible for a while. I'm using NRPE and $LONGHOSTOUTPUT$ for a number of tests, which is great, except for passive monitoring. We have several data centers that run their own Nagios boxes and then ship the data back to the master Nagios server via NSCA. The problem is that I can't get NSCA to utilize the $LONGHOSTOUTPUT$ - this is kind of critical for things like log file checks, etc. With NSCA this data doesn't get passed. Looking at the NSCA sources, common.h has: #define MAX_PLUGINOUTPUT_LENGTH 512 I'm guessing that's the issue right there. The first thing I'd try is to bump that up to 4096, and recompile send_nsca and nsca. I haven't looked very carefully at the source or tried this myself, but it seems like a good place to start. Mike - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA processes hanging around
No one seems to have replied to this post. So I thought I may as well reply to myself :) I think I have some more information about the cause of my issue and maybe someone else can confirm or refute my suggestion. Whilst running the Nagios beta versions we were experiencing memory leaks (as documented in the change logs). Rather than roll back to 2.9 I was happy to script a restart of Nagios daily and wait for the problem to be fixed. Now I have removed the restarting of Nagios I no longer get any nsca processes hanging around. So can anyone confirm the behaviour of NSCA when nagios is shutting down or has shutdown? As always, any and all help is appreciated. Thanks, Gareth. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Gareth Watson Sent: 17 March 2008 09:03 To: nagios-users@lists.sourceforge.net Subject: [Nagios-users] NSCA processes hanging around Hello, Over time I notice a number (a large number, ~500) of nsca processes hanging around. I used netstat to see what they were up to and they where all in the TIME_WAIT status. Reading on the internet I think I understand what this means. Linux is not closing the socket as it is waiting for any delayed traffic on the network to appear. I have no problem with this as my first thought would be that I will see a slow shift in the process numbers as sockets are eventually closed and opened by new requests. This is not, however, the behaviour I observed! The processes would wait indefinitely. Now, clutching at straws I think I have managed to fix the problem. I removed the REUSE flag from the nsca xinetd configuration: # default: on # description: NSCA service nsca { #Commented out the line below in the vain hope it would fix nsca's wagon #FLAGS = REUSE socket_type = stream wait = no user = nagios group = nagios server = /usr/local/nagios/bin/nsca server_args = -c /usr/local/nagios/etc/nsca.cfg --inetd log_on_failure += USERID disable = no } Again some research on the net has told me a little bit about the REUSE flag but I wanted to gather the opinions from those with much more knowledge than I. Therefore, can anyone tell me if this is a reasonable thing to do? Have I made a grievous error without even knowing? Has anyone else experienced this behaviour? I really appreciate anyone who has taken the time to read this and would love some feedback if you get the chance. Many thanks, Gareth Watson - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA processes hanging around
Hello, Over time I notice a number (a large number, ~500) of nsca processes hanging around. I used netstat to see what they were up to and they where all in the TIME_WAIT status. Reading on the internet I think I understand what this means. Linux is not closing the socket as it is waiting for any delayed traffic on the network to appear. I have no problem with this as my first thought would be that I will see a slow shift in the process numbers as sockets are eventually closed and opened by new requests. This is not, however, the behaviour I observed! The processes would wait indefinitely. Now, clutching at straws I think I have managed to fix the problem. I removed the REUSE flag from the nsca xinetd configuration: # default: on # description: NSCA service nsca { #Commented out the line below in the vain hope it would fix nsca's wagon #FLAGS = REUSE socket_type = stream wait = no user = nagios group = nagios server = /usr/local/nagios/bin/nsca server_args = -c /usr/local/nagios/etc/nsca.cfg --inetd log_on_failure += USERID disable = no } Again some research on the net has told me a little bit about the REUSE flag but I wanted to gather the opinions from those with much more knowledge than I. Therefore, can anyone tell me if this is a reasonable thing to do? Have I made a grievous error without even knowing? Has anyone else experienced this behaviour? I really appreciate anyone who has taken the time to read this and would love some feedback if you get the chance. Many thanks, Gareth Watson - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA problem
On Feb 15, 2008, at 6:07 AM, Giulio Botto wrote: Hi all, We have a distributed Nagios network that monitors about a hundred hosts and the relative services on our customer's networks. Most of them have Nagios servers which send check results using NSCA. On the master server Nagios 2.10 collects data through the NSCA daemon. As services and hosts increas in number so do checks and it appears NSCA daemon is having some problems. We have seen up to several hundred daemons hang on the server and Please post the output of /path/to/nagios -s /path/to/nagios.cfg. You'll probably also want to adjust the value of command_check_interval in nagios.cfg if you haven't changed it from the default of 1 minute. -- Marc - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA problem
Please always respond on-list so that others can help and learn from your experience. -Original Message- From: Giulio Botto [mailto:[EMAIL PROTECTED] Sent: Friday, February 15, 2008 8:02 AM To: Marc Powell Subject: Re: [Nagios-users] NSCA problem In the meantime I realized the central Nagios is still a 2.7 while NSCA is 2.7.2. At the moment upgrading this specific Nagios is not an option. It doesn't matter. They're not tied in that manner. command_check_interval=-1 Did you just change this or is this what it was? # nagios -s /etc/nagios/nagios.cfg Nagios 2.7 Copyright (c) 1999-2007 Ethan Galstad (http://www.nagios.org) Last Modified: 01-19-2007 License: GPL Projected scheduling information for host and service checks is listed below. This information assumes that you are going to start running Nagios with your current config files. HOST SCHEDULING INFORMATION --- Total hosts: 105 Total scheduled hosts: 0 Host inter-check delay method: SMART Average host check interval: 0.00 sec Host inter-check delay: 0.00 sec Max host check spread: 30 min First scheduled check: N/A Last scheduled check:N/A SERVICE SCHEDULING INFORMATION --- Total services: 398 Total scheduled services: 124 Service inter-check delay method: SMART Average service check interval: 224.03 sec Inter-check delay: 1.81 sec Interleave factor method: SMART Average services per host: 3.79 Service interleave factor: 2 Max service check spread: 30 min First scheduled check: Fri Feb 15 15:00:18 2008 Last scheduled check: Fri Feb 15 15:04:00 2008 CHECK PROCESSING INFORMATION Service check reaper interval: 10 sec Max concurrent service checks: Unlimited PERFORMANCE SUGGESTIONS --- I have no suggestions - things look okay. This is all good. You don't have a lot of services at all. How frequently are you sending results? On my own systems I'm easily processing at least 13 results/sec. I know that there are others doing more but there is some point at which nagios can't keep up. Are there other problems with the machine? Disk or memory issues? Do you have embedded perl or performance data processing enabled? -- Marc - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA problem
Marc Powell wrote: Please always respond on-list so that others can help and learn from your experience. Yep, sorry: that's why I later forwarded my reply to the list. -Original Message- From: Giulio Botto [mailto:[EMAIL PROTECTED] Sent: Friday, February 15, 2008 8:02 AM To: Marc Powell Subject: Re: [Nagios-users] NSCA problem In the meantime I realized the central Nagios is still a 2.7 while NSCA is 2.7.2. At the moment upgrading this specific Nagios is not an option. It doesn't matter. They're not tied in that manner. command_check_interval=-1 Did you just change this or is this what it was? I changed this some months ago, at least 4 I'd say. [...] PERFORMANCE SUGGESTIONS --- I have no suggestions - things look okay. This is all good. You don't have a lot of services at all. How frequently are you sending results? On my own systems I'm easily processing at least 13 results/sec. I know that there are others doing more but there is some point at which nagios can't keep up. The problem we have seems to lay in the number of nsca daemons on the master machine. Also I do not understand why running nsca with the --single option only processes the first message it receives and discards the rest. Are there other problems with the machine? Disk or memory issues? Do you have embedded perl or performance data processing enabled? CPU and RAM seem ok up until something goes badly wrong and nagios starts forking and allocating more memory, then swap. We have embedded perl and no data processing. -- Giulio Botto -- [EMAIL PROTECTED] PGP fingerprint = 1979 A78A 8F82 DB5E 55E9 D6D6 6AB6 0BA9 FDB7 6789 - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA problem
Marc Powell wrote: -Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Giulio Botto Sent: Friday, February 15, 2008 9:54 AM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] NSCA problem Marc Powell wrote: This is all good. You don't have a lot of services at all. How frequently are you sending results? On my own systems I'm easily processing at least 13 results/sec. I know that there are others doing more but there is some point at which nagios can't keep up. The problem we have seems to lay in the number of nsca daemons on the master machine. The number would increase if the NSCA daemons were unable to write to the external command pipe. That could be because the results are coming in faster than nagios is processing them. The command pipe is only going to hold about 4K of data then block until it's cleared. If I read the docs correctly external_command_buffer_slots=4096 hold 4096 commands in the queue before it starts blocking. That's the value we have at the moment. Also I do not understand why running nsca with the --single option only processes the first message it receives and discards the rest. Nor do I. I run with -s and haven't experienced that problem. Running nsca-2.1 here. Have you tried putting nsca into debug mode and monitoring that? Running strace on the process would be informative as well. Will do over the weekend: it's something we still haven't had time to do. I did have a similar problem years ago on a machine with a failing disk. The failures seemed harmless but ended up causing regular backlogs of NSCA processes as you indicate. Fixing the disk problem resolved the issue. That's why I asked. I see how this could cause problems especially since the server is also a mail content filter, but disks are hardware RAID5 on a Dell Perc controller monitored by Nagios and they appear fine. Thanks, -- Giulio Botto -- [EMAIL PROTECTED] PGP fingerprint = 1979 A78A 8F82 DB5E 55E9 D6D6 6AB6 0BA9 FDB7 6789 - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA problem
I see how this could cause problems especially since the server is also a mail content filter, but disks are hardware RAID5 on a Dell Perc controller monitored by Nagios and they appear fine. If you are sending a lot of stand-alone NSCA results, it is only natural that you will start to get delays and hanging processes at some point (before you reach ~1000/checks minute at current hardware). But this should materialize itself also at the sending servers - is latency there OK still? Anyway, if this is the case, solution is to bundle up nsca checks and only send a batch every 10-20 seconds. Should reduce load and latency both at the sending and receiving server. Best regards, Steffen Poulsen - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA problem
-Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Giulio Botto Sent: Friday, February 15, 2008 9:54 AM To: nagios-users@lists.sourceforge.net Subject: Re: [Nagios-users] NSCA problem Marc Powell wrote: This is all good. You don't have a lot of services at all. How frequently are you sending results? On my own systems I'm easily processing at least 13 results/sec. I know that there are others doing more but there is some point at which nagios can't keep up. The problem we have seems to lay in the number of nsca daemons on the master machine. The number would increase if the NSCA daemons were unable to write to the external command pipe. That could be because the results are coming in faster than nagios is processing them. The command pipe is only going to hold about 4K of data then block until it's cleared. Also I do not understand why running nsca with the --single option only processes the first message it receives and discards the rest. Nor do I. I run with -s and haven't experienced that problem. Running nsca-2.1 here. Have you tried putting nsca into debug mode and monitoring that? Running strace on the process would be informative as well. Are there other problems with the machine? Disk or memory issues? Do you have embedded perl or performance data processing enabled? CPU and RAM seem ok up until something goes badly wrong and nagios starts forking and allocating more memory, then swap. We have embedded perl and no data processing. I did have a similar problem years ago on a machine with a failing disk. The failures seemed harmless but ended up causing regular backlogs of NSCA processes as you indicate. Fixing the disk problem resolved the issue. That's why I asked. -- Marc - This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2008. http://clk.atdmt.com/MRT/go/vse012070mrt/direct/01/ ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
[Nagios-users] NSCA Confusing host check results as service checks
Original msg: Warning: Passive check result was received for service '0' on host 'server1', but the service could not be found! NSCA daemon makes the host/service check determination based simply on the number of arguments. If you take a remote nagios instance which is reporting passive host/service checks, you can bump the debugging_level=384 and see something like this: [1196258852.155446] [256.1] [pid=20619] Running command '/usr/local/libexec/nagios/send_host_check fbsd05 0 'PING OK - Packet loss = 0%, RTA = 0.67 ms''... In this case, there are the proper number of arguments: 3 But NSCA recieving this msg is seeing 4 arguments. A closer look at /usr/local/libexec/nagios/send_host_check reveals why: #!/bin/sh /bin/echo $1,$2,$3 | /usr/local/sbin/send_nsca -H master.tld \ -c /usr/local/etc/nagios/send_nsca.cfg -d , This means that depending on how shit is escaped within: - Nagios on remote node - /bin/sh - send_nsca binary Therefore, a $3 value of: 'PING OK - Packet loss = 0%, RTA = 0.67 ms' May or may not be truncated into three values. I'm going to track it down further. ~BAS -- Forwarded message -- Date: Mon, 19 Nov 2007 13:52:21 -0500 (EST) From: Brian A. Seklecki [EMAIL PROTECTED] To: [EMAIL PROTECTED] Cc: nagios-users@lists.sourceforge.net, [EMAIL PROTECTED] Subject: RE: [Nagios-users] nsca PROCESS_HOST_CHECK_RESULT problem Bratislav: Did you ever resolve these bizaare PROCESS_HOST_CHECK_RESULT / PROCESS_SERVICE_CHECK_RESULT errors? I'm having trouble debugging them as well ~BAS If I simulate host down I got HOST_CHECK from nagios.log on remote host: EXTERNAL COMMAND: PROCESS_HOST_CHECK_RESULT;server1;1;PING CRITICAL - Packet loss = 100% but regularly, it is triggered like it is service check, and PROCESS_HOST_CHECK_RESULT is not performed at all. [1175605177] EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;server1;0;0; RTA = 0.18 ms [1175605177] Warning: Passive check result was received for service '0' on host 'server1', but the service could not be found! l8* -lava (Brian A. Seklecki - Pittsburgh, PA, USA) http://www.spiritual-machines.org/ Guilty? Yeah. But he knows it. I mean, you're guilty. You just don't know it. So who's really in jail? ~Maynard James Keenan - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA Confusing host check results as service checks
It looks like the problem is caused by using , as the delimiter. There is a , in the status string PING OK... which would provide NSCA with the fouth argument, and make it a service result rather than a host result. -- Bob Sloane, University of Kansas Computer Center, Lawrence, KS, 66045 Email:[EMAIL PROTECTED] http://www.ku.edu/home/sloane Phone:(785)864-0444 -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Brian A. Seklecki Sent: Wednesday, November 28, 2007 10:07 AM To: [EMAIL PROTECTED]; nagios-users@lists.sourceforge.net; [EMAIL PROTECTED] Subject: [Nagios-users] NSCA Confusing host check results as service checks Original msg: Warning: Passive check result was received for service '0' on host 'server1', but the service could not be found! NSCA daemon makes the host/service check determination based simply on the number of arguments. If you take a remote nagios instance which is reporting passive host/service checks, you can bump the debugging_level=384 and see something like this: [1196258852.155446] [256.1] [pid=20619] Running command '/usr/local/libexec/nagios/send_host_check fbsd05 0 'PING OK - Packet loss = 0%, RTA = 0.67 ms''... In this case, there are the proper number of arguments: 3 But NSCA recieving this msg is seeing 4 arguments. A closer look at /usr/local/libexec/nagios/send_host_check reveals why: #!/bin/sh /bin/echo $1,$2,$3 | /usr/local/sbin/send_nsca -H master.tld \ -c /usr/local/etc/nagios/send_nsca.cfg -d , This means that depending on how shit is escaped within: - Nagios on remote node - /bin/sh - send_nsca binary Therefore, a $3 value of: 'PING OK - Packet loss = 0%, RTA = 0.67 ms' May or may not be truncated into three values. I'm going to track it down further. ~BAS -- Forwarded message -- Date: Mon, 19 Nov 2007 13:52:21 -0500 (EST) From: Brian A. Seklecki [EMAIL PROTECTED] To: [EMAIL PROTECTED] Cc: nagios-users@lists.sourceforge.net, [EMAIL PROTECTED] Subject: RE: [Nagios-users] nsca PROCESS_HOST_CHECK_RESULT problem Bratislav: Did you ever resolve these bizaare PROCESS_HOST_CHECK_RESULT / PROCESS_SERVICE_CHECK_RESULT errors? I'm having trouble debugging them as well ~BAS If I simulate host down I got HOST_CHECK from nagios.log on remote host: EXTERNAL COMMAND: PROCESS_HOST_CHECK_RESULT;server1;1;PING CRITICAL - Packet loss = 100% but regularly, it is triggered like it is service check, and PROCESS_HOST_CHECK_RESULT is not performed at all. [1175605177] EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;server1;0;0; RTA = 0.18 ms [1175605177] Warning: Passive check result was received for service '0' on host 'server1', but the service could not be found! l8* -lava (Brian A. Seklecki - Pittsburgh, PA, USA) http://www.spiritual-machines.org/ Guilty? Yeah. But he knows it. I mean, you're guilty. You just don't know it. So who's really in jail? ~Maynard James Keenan -- --- SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA Confusing host check results as service checks
On Wed, 28 Nov 2007, Sloane, Robert Raymond wrote: It looks like the problem is caused by using , as the delimiter. There is a , in the status string PING OK... which would provide Well, yea, but the question is how to address it. It really seems like /usr/local/libexec/nagios/send_{host,service}_check is superfluous. Its a lot of extra forking and pipeing on an already deadly inefficient system. Of course, one cheap hack deserves another and for the time being it seems that: /bin/echo $1//$2//$3 | /usr/local/sbin/send_nsca -H master \ -c /usr/local/etc/nagios/send_nsca.cfg -d // Using // as a delimeter is not likely to show up in any macro of plugin output (Except for maybe Samba? fuck screw that) ~BAS - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null
Re: [Nagios-users] NSCA Confusing host check results as service checks
-Original Message- From: [EMAIL PROTECTED] [mailto:nagios-users- [EMAIL PROTECTED] On Behalf Of Brian A. Seklecki Sent: Wednesday, November 28, 2007 10:26 AM To: nagios-users@lists.sourceforge.net; Sloane, Robert Raymond Subject: Re: [Nagios-users] NSCA Confusing host check results as service checks On Wed, 28 Nov 2007, Sloane, Robert Raymond wrote: It looks like the problem is caused by using , as the delimiter. There is a , in the status string PING OK... which would provide Well, yea, but the question is how to address it. Where did you get a send_nsca that expects comma-delimited input? The standard send_nsca expects tab-delimited input so this problem doesn't arise. -- Marc - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Nagios-users mailing list Nagios-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null