RE: Mon Server Goes Foobar, help!

2004-01-12 Thread Gary Richardson








Hey All,



Im doing some more research into
this. A ran d:Profile for perl and found that 96% of the time is spent in
Sys::Syslog::_syslog_send_socket. Is this normal?



Thanks.







-Original Message-
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On
Behalf Of Gary Richardson
Sent: Friday, January 09, 2004
8:09 PM
To: [EMAIL PROTECTED]
Subject: Mon Server Goes Foobar,
help!



Hey,



I have a mon server that has been
running fine for a few months. All of a sudden it is doing crazy things. We are
using mon.cgi for reporting. It is now timing out 9 out of 10 times. When you
telnet to the mon port and try to issue commands, sometimes it hangs for a long
time and others it hangs for 10 seconds.



Running top shows all of the
monitors going off at the same time instead of the normal random
intervals. I have a feeling this is related.



I have a feeling that a perl module
got upgraded in the background and is causing this problem. There havent
been any configuration changes since before Christmas.



Has anyone experienced this or
something similar before?



Thanks. 






___
mon mailing list
[EMAIL PROTECTED]
http://linux.kernel.org/mailman/listinfo/mon


RE: Mon Server Goes Foobar, help!

2004-01-12 Thread David Nolan


--On Monday, January 12, 2004 12:09 PM -0800 Gary Richardson 
[EMAIL PROTECTED] wrote:

I'm doing some more research into this. A ran --d:Profile for perl and
found that 96% of the time is spent in Sys::Syslog::_syslog_send_socket.
Is this normal?
Sounds like your syslog server may be having problems.  But then I've never 
run Mon in the Profiler, so I don't know if thats really abnormal.

Try disabling syslog's from Mon and see if that helps.

The other situation in which I've seen Mon have problems like yours is when 
an alert script is hanging.  But I don't have that problem any more, 
because I've long since patched my copy of Mon to handle fork alerts 
cleanly, and clean them up during the normal child processing code.

-David Nolan
Network Software Developer
Computing Services
Carnegie Mellon University
___
mon mailing list
[EMAIL PROTECTED]
http://linux.kernel.org/mailman/listinfo/mon


RE: Mon Server Goes Foobar, help!

2004-01-12 Thread Eric Sorenson
On Mon, 12 Jan 2004, Gary Richardson wrote:

 I'm doing some more research into this. A ran -d:Profile for perl and found
 that 96% of the time is spent in Sys::Syslog::_syslog_send_socket. Is this
 normal?

Maybe the output logfile is set to fsync-on-write. This is the (unfortunate)
default for some syslogs.  Try prepending the filename with a '-' to turn it
off, like:

local1.*-/var/log/mon.log

NB not all syslogs support this, check your local man pages for details.

-- 
  Eric Sorenson - Systems / Network Administrator, MIS - Transmeta Corporation


___
mon mailing list
[EMAIL PROTECTED]
http://linux.kernel.org/mailman/listinfo/mon


RE: Mon Server Goes Foobar, help!

2004-01-12 Thread Gary Richardson
For the mailing list archives, it turns out there was a routing problem
preventing our name servers from reaching our ISP's authoritative name
servers. This stopped us from getting reverse DNS for our internet address
space which caused our syslog server (which was functioning as a syslog
collector) to block. Mon would then wait on the block.

Thanks for all the help guys. Without it, it would have been a long time
before I found the problem.

Out.

-Original Message-
From: Eric Sorenson [mailto:[EMAIL PROTECTED] 
Sent: Monday, January 12, 2004 12:45 PM
To: Gary Richardson
Cc: [EMAIL PROTECTED]
Subject: RE: Mon Server Goes Foobar, help!

On Mon, 12 Jan 2004, Gary Richardson wrote:

 I'm doing some more research into this. A ran -d:Profile for perl and
found
 that 96% of the time is spent in Sys::Syslog::_syslog_send_socket. Is this
 normal?

Maybe the output logfile is set to fsync-on-write. This is the (unfortunate)
default for some syslogs.  Try prepending the filename with a '-' to turn it
off, like:

local1.*-/var/log/mon.log

NB not all syslogs support this, check your local man pages for details.

-- 
  Eric Sorenson - Systems / Network Administrator, MIS - Transmeta
Corporation




___
mon mailing list
[EMAIL PROTECTED]
http://linux.kernel.org/mailman/listinfo/mon


Mon Server Goes Foobar, help!

2004-01-09 Thread Gary Richardson








Hey,



I have a mon server that has been running fine for a few
months. All of a sudden it is doing crazy things. We are using mon.cgi for
reporting. It is now timing out 9 out of 10 times. When you telnet to the mon
port and try to issue commands, sometimes it hangs for a long time and others
it hangs for 10 seconds.



Running top shows all of the monitors going off at the same
time instead of the normal random intervals. I have a feeling
this is related.



I have a feeling that a perl module got upgraded in the
background and is causing this problem. There havent been any
configuration changes since before Christmas.



Has anyone experienced this or something similar before?



Thanks. 






___
mon mailing list
[EMAIL PROTECTED]
http://linux.kernel.org/mailman/listinfo/mon