Re: [Nagios-users] problem with nsca

2006-02-15 Thread basile au siris

Marc Powell wrote:

 


-Original Message-
From: [EMAIL PROTECTED] [mailto:nagios-users-
[EMAIL PROTECTED] On Behalf Of basile au siris
Sent: Tuesday, February 14, 2006 8:29 AM
To: nagios-users@lists.sourceforge.net
Subject: [Nagios-users] problem with nsca

i install nagios and nsca on Fedora core 4
after a short time i have many process nsca and it seems it never
   


decrease
 


idea ?
   



Make sure you have proper permissions on the external command file (per
the documentation on External Commands)


Yes


. Make sure that Nagios is set to
check for external commands frequently enough (command_check_interval).
 


every 15 s


Try turning on NSCA debug and watch your messages log file for
interesting output.


just have nsca logs from checks and nsca : handling connection


If you have SELinux enabled, make sure that it's not
interfering with NSCA's ability to write to the external command file;
look for 'avc denied' messages related to the command file in
/var/log/messages.
 


selinux is disable
and in one hour i have 600 nsca process
what can i do
thanks
basile


--
Marc


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
 





---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] problem with nsca

2006-02-14 Thread basile au siris
i install nagios and nsca on Fedora core 4  
after a short time i have many process nsca and it seems it never decrease

idea ?
thanks
basile


---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] nsca / distributed monitoring result problem

2006-01-16 Thread basile au siris

hi
maybe i have the same problem
i have distributed monitoring and the central server sometimes freeze 
and i just

have to reboot it ( electric )
i suspect nsca ( or hardware problem ) because sometimes i note there 
are many ( 50 )

nsca process and if i restart it all become normal again
hope we solve our problem
basile


Chris Goosen wrote:


Hello all..

I am running my nagios central server on an HP 2.4ghz with 512mb ram.

At present, I am monitoring 65 hosts with approx. 400 services.

After a reboot, everything works perfectly, but the longer my server 
runs, the more sluggish it gets and eventually the nsca processes 
consume all the memory and the server stops responding. What also 
happens it that I start getting hosts that are reported as down even 
though they have the correct ping response.. the error says “PLUGIN 
TIMED OUT after 10 seconds”


Here is an example of what I mean:

Host State Information

Host Status:



DOWN

Status Information:



CRITICAL - Plugin timed out after 10 seconds

Last Status Check:



01-16-2006 12:06:28

Status Data Age:



0d 0h 2m 57s

Last State Change:



01-16-2006 10:20:44

Current State Duration:



0d 1h 48m 41s

Last Host Notification:



01-16-2006 10:20:44

Current Notification Number:



2

Is This Host Flapping?



N/A



OK 01-16-2006 12:05:47 63d 19h 30m 59s 1/3 PING OK - Packet loss = 0%, 
RTA = 0.42 ms


I assume that these are related and that the lack of memory caused 
this problem, would an upgrade to from nagios 1.2 to nagios 1.3 fix 
this? If so, what is the best way to perform that upgrade?


my /etc/xinetd.d/nsca file :
# default: on
# description: NSCA
service nsca
{
flags = REUSE
socket_type = stream
wait = no
user = nagios
group = nagios
server = /usr/sbin/nsca
server_args = -c /home/e-smith/nagios/nsca.cfg --inetd
cps = 9000 30
instances = UNLIMITED
log_on_failure += USERID
disable = no
only_from = ip1, ip2, ip3, etc..
}

command_check_interval= -1

System info:

SME server 6.01 (2.4.20-18.7, i686)

Perl v5.6.1

Apache/1.3.27

Nagios 1.2

Any advice would be great… thanks.

Chris





---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue.
::: Messages without supporting info will risk being sent to /dev/null


Re: [Nagios-users] problem check_traceroute

2006-01-12 Thread basile au siris

it s not round robin problem
i thinks it s router problem because i have same error on many routers 
on the route

i try with 72.14.238.124 ,  64.233.175.246 and i have same error
why is there -- sometimes between two ip address ?
it seems to failed the first time there is --
basile


Tedman Eng wrote:


You'll want to traceroute by IP rather than by name.  Google uses DNS round
robin, so each time you traceroute, you'll get a different IP, and likely a
different path.  You may also want to notify Google that you're using their
resources for your automated tests.

 


-Original Message-----
From: basile au siris [mailto:[EMAIL PROTECTED]
Sent: Wednesday, January 11, 2006 2:54 AM
To: nagios-users@lists.sourceforge.net
Subject: [Nagios-users] problem check_traceroute


hi
i do check_traceroute -r and obtain a route
i do check_traceroute -R this route and have errors
can someone explain me what it means ?
here are logs

./check_traceroute.pl  -r   -H www.google.fr
Ok. Traceroute to host "www.google.fr" via route 
"195.220.107.253-195.221.127.97-195.221.127.110-193.51.181.102

-193.51.180.158--193.51.179.1--193.51.179.34-194.68.129.242-80
.231.79.42-80.231.64.25--80.231.66.4-80.231.66.50-72.14.238.12
8-72.14.238.119-64.233.175.246-66.249.94.46-66.249.94.54-66.24
9.93.99".


./check_traceroute.pl  -R 
"195.220.107.253-195.221.127.97-195.221.127.110-193.51.181.102

-193.51.180.158--193.51.179.1--193.51.179.34-194.68.129.242-80
.231.79.42-80.231.64.25--80.231.66.4-80.231.66.50-72.14.238.12
8-72.14.238.119-64.233.175.246-66.249.94.46-66.249.94.54-66.24
9.93.99"   
-H www.google.fr
Failed. Last 2 routers () repeated, 
"195.220.107.253-195.221.127.97-195.221.127.110-193.51.181.102

-193.51.180.158--".

traceroute to www.l.google.com (66.249.93.99), 30 hops max, 
38 byte packets

1  XX-MSFC-1 (195.220.107.253)  1.064 ms  0.802 ms  0.776 ms
2  cr-odeon.rap.prd.fr (195.221.127.97)  1.898 ms  1.653 ms  1.357 ms
3  195.221.127.110 (195.221.127.110)  2.153 ms  2.075 ms  1.569 ms
4  jussieu-g0-1-165.cssi.renater.fr (193.51.181.102)  1.639 
ms  2.288 
ms  1.532 ms
5  nri-c-pos2-0.cssi.renater.fr (193.51.180.158)  2.181 ms  
1.914 ms  
2.421 ms

MPLS Label=273 CoS=5 TTL=1 S=0
6  nri-a-gi1-0-0-11.cssi.renater.fr (193.51.179.1)  3.028 ms 
2.687 ms  
2.550 ms

MPLS Label=27 CoS=5 TTL=1 S=0
7  193.51.179.34 (193.51.179.34)  2.596 ms  1.934 ms  1.573 ms
8  TELEGLOBE-FRANCE-INTERNATIONAL.sfinx.tm.fr 
(194.68.129.242)  2.810 
ms  2.464 ms  2.751 ms
9  if-8-0.core1.PG1-Paris.teleglobe.net (80.231.79.42)  
2.369 ms  2.647 
ms  3.451 ms
10  if-6-0.core1.FR1-Frankfurt.teleglobe.net (80.231.64.25)  
12.886 ms  
12.652 ms  12.210 ms

MPLS Label=17 CoS=5 TTL=1 S=0
11  Vlan2.msfc1.FR1-Frankfurt.teleglobe.net (80.231.66.4)  12.801 ms  
13.139 ms  12.335 ms
12  Vlan108.msfc1.FR1-Frankfurt.teleglobe.net (80.231.66.50)  
12.124 ms  
12.111 ms  14.045 ms
13  72.14.238.128 (72.14.238.128)  13.207 ms  12.388 ms 72.14.238.124 
(72.14.238.124)  12.277 ms

14  72.14.238.119 (72.14.238.119)  16.894 ms  16.596 ms  16.491 ms
15  64.233.175.246 (64.233.175.246)  20.153 ms  18.976 ms  18.996 ms
16  66.249.94.46 (66.249.94.46)  21.722 ms  19.380 ms 216.239.43.42 
(216.239.43.42)  19.112 ms

17  66.249.94.54 (66.249.94.54)  30.050 ms  24.097 ms  30.946 ms
18  66.249.93.99 (66.249.93.99)  19.916 ms  20.407 ms  19.316 ms

thanks
basile



---
This SF.net email is sponsored by: Splunk Inc. Do you grep 
through log files

for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  
DOWNLOAD SPLUNK!

http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS 
when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


   





---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


[Nagios-users] problem check_traceroute

2006-01-11 Thread basile au siris

hi
i do check_traceroute -r and obtain a route
i do check_traceroute -R this route and have errors
can someone explain me what it means ?
here are logs

./check_traceroute.pl  -r   -H www.google.fr
Ok. Traceroute to host "www.google.fr" via route 
"195.220.107.253-195.221.127.97-195.221.127.110-193.51.181.102-193.51.180.158--193.51.179.1--193.51.179.34-194.68.129.242-80.231.79.42-80.231.64.25--80.231.66.4-80.231.66.50-72.14.238.128-72.14.238.119-64.233.175.246-66.249.94.46-66.249.94.54-66.249.93.99".



./check_traceroute.pl  -R 
"195.220.107.253-195.221.127.97-195.221.127.110-193.51.181.102-193.51.180.158--193.51.179.1--193.51.179.34-194.68.129.242-80.231.79.42-80.231.64.25--80.231.66.4-80.231.66.50-72.14.238.128-72.14.238.119-64.233.175.246-66.249.94.46-66.249.94.54-66.249.93.99"   
-H www.google.fr
Failed. Last 2 routers () repeated, 
"195.220.107.253-195.221.127.97-195.221.127.110-193.51.181.102-193.51.180.158--".


traceroute to www.l.google.com (66.249.93.99), 30 hops max, 38 byte packets
1  XX-MSFC-1 (195.220.107.253)  1.064 ms  0.802 ms  0.776 ms
2  cr-odeon.rap.prd.fr (195.221.127.97)  1.898 ms  1.653 ms  1.357 ms
3  195.221.127.110 (195.221.127.110)  2.153 ms  2.075 ms  1.569 ms
4  jussieu-g0-1-165.cssi.renater.fr (193.51.181.102)  1.639 ms  2.288 
ms  1.532 ms
5  nri-c-pos2-0.cssi.renater.fr (193.51.180.158)  2.181 ms  1.914 ms  
2.421 ms

MPLS Label=273 CoS=5 TTL=1 S=0
6  nri-a-gi1-0-0-11.cssi.renater.fr (193.51.179.1)  3.028 ms  2.687 ms  
2.550 ms

MPLS Label=27 CoS=5 TTL=1 S=0
7  193.51.179.34 (193.51.179.34)  2.596 ms  1.934 ms  1.573 ms
8  TELEGLOBE-FRANCE-INTERNATIONAL.sfinx.tm.fr (194.68.129.242)  2.810 
ms  2.464 ms  2.751 ms
9  if-8-0.core1.PG1-Paris.teleglobe.net (80.231.79.42)  2.369 ms  2.647 
ms  3.451 ms
10  if-6-0.core1.FR1-Frankfurt.teleglobe.net (80.231.64.25)  12.886 ms  
12.652 ms  12.210 ms

MPLS Label=17 CoS=5 TTL=1 S=0
11  Vlan2.msfc1.FR1-Frankfurt.teleglobe.net (80.231.66.4)  12.801 ms  
13.139 ms  12.335 ms
12  Vlan108.msfc1.FR1-Frankfurt.teleglobe.net (80.231.66.50)  12.124 ms  
12.111 ms  14.045 ms
13  72.14.238.128 (72.14.238.128)  13.207 ms  12.388 ms 72.14.238.124 
(72.14.238.124)  12.277 ms

14  72.14.238.119 (72.14.238.119)  16.894 ms  16.596 ms  16.491 ms
15  64.233.175.246 (64.233.175.246)  20.153 ms  18.976 ms  18.996 ms
16  66.249.94.46 (66.249.94.46)  21.722 ms  19.380 ms 216.239.43.42 
(216.239.43.42)  19.112 ms

17  66.249.94.54 (66.249.94.54)  30.050 ms  24.097 ms  30.946 ms
18  66.249.93.99 (66.249.93.99)  19.916 ms  20.407 ms  19.316 ms

thanks
basile



---
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
___
Nagios-users mailing list
Nagios-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null