Dirk,

Thanks for looking at this :-)

The check that follows the last one in the log is another URL check for a
different web site on the same web server. Both of these check for basic web
site content and fail with an SMTP message and an SAMessenger message if the
content does not exist.

I've looked at the logs on the web server in question and there were indeed
errors reported at this time the would have caused it to stop responding.
(Time to patch it up to date)

The SMTP server(s) used for alert messages are on different servers.

All I did to make SA continue was reboot the Webserver in question.

Steve

-----Original Message-----
From: Dirk Bulinckx [mailto:[EMAIL PROTECTED]]
Sent: 29 November 2002 09:43
To: [EMAIL PROTECTED]
Subject: RE: [SA-list] A failed check locks up Servers Alive


Also looking at the log it seems that SA did do the URL test and saw the
problem (timeout).  So the blocking is caused by something that is done
after that test. (could be the alerts or the next test for example).
Also did you do something on the SA machine to make SA continue to work?
The web server that was rebooted, was that also the SMTP server that should
have been used for the alerts?  What alerts do you use on the URL check
entrY?


dirk.



-----Original Message-----
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
Behalf Of Steve Davis
Sent: Fri Nov 29 10:31 AM
To: '[EMAIL PROTECTED]'
Subject: [SA-list] A failed check locks up Servers Alive
Importance: High


Hi,

For some reason a website check that timed out last night completely locked
up servers alive. Basically the web server failed, fine. Servers alive
detected this in the log file but then stopped functioning. It didn't send
any alerts either. It should have at least sent an SMTP message.

We rebooted the web server this morning and as soon as it came up, servers
alive sprung back to life.?!

we're running servers alive v3.3.1132 on Windows NT 4, sp6a

Why did this happen and how can I stop it?

cheers

Steve

This is a clip from the log file (these are all private servers):

29 November 2002 03:14:00 Free space on \\lisa\d$ is : 11,754.23 MBytes
29 November 2002 03:14:06 Free space on \\marge\c$ is : 1,742.48 MBytes
29 November 2002 03:14:07 Free space on \\marge\d$ is : 9,501.67 MBytes
29 November 2002 03:14:20 Free space on \\homer\c$ is : 948.16 MBytes
29 November 2002 03:14:21 Free space on \\homer\d$ is : 12,902.37 MBytes
29 November 2002 03:14:28 Free space on \\poseidon\c$ is : 4,084.38 MBytes
29 November 2002 03:14:29 Free space on \\poseidon\d$ is : 17,120.24 MBytes
29 November 2002 03:14:29 Not checking host poseidon because of checking
schedule
29 November 2002 03:14:31 Free space on \\hephaestus\c$ is : 18,066.18
MBytes
29 November 2002 03:14:32 Free space on \\hephaestus\d$ is : 19,004.25
MBytes
29 November 2002 03:14:32 Free space on \\hephaestus\e$ is : 38,255.43
MBytes
29 November 2002 03:24:35 Check cycle starts ( 1571- 1572)
29 November 2002 03:24:42 Free space on \\zeus\c$ is : 2,121.94 MBytes
29 November 2002 03:24:48 Free space on \\lisa\c$ is : 1,311.96 MBytes
29 November 2002 03:24:49 Free space on \\lisa\d$ is : 11,754.23 MBytes
29 November 2002 03:24:55 Free space on \\marge\c$ is : 1,742.48 MBytes
29 November 2002 03:24:56 Free space on \\marge\d$ is : 9,501.42 MBytes
29 November 2002 03:25:16 URL check (http://my.ebulletins.co.uk/) failed due
to Timeout.
29 November 2002 08:46:55 Perfmon (\\homer\Processor(0)\% Processor Time)
down
29 November 2002 08:46:56 ERR: connect to host failed:  53
29 November 2002 08:46:56 Can't do perfmon check, due to authentication
problem
29 November 2002 08:46:56 Perfmon (\\homer\Processor(1)\% Processor Time)
down
29 November 2002 08:46:56 ERR: connect to host failed:  53

It's usual for servers alive to fail the first round of checks after a
server has been rebooted. The second check went through fine with everything
working.
----------------------------------------------------------------------------
-----------------------------
This email and any files transmitted with it are confidential and
intended solely for the use of the individual or entity to whom
they are addressed.

If you have received this email in error please notify the
originator of the message. This footer also confirms that this
email message has been scanned for the presence of known computer viruses.

Any views expressed in this message are those of the individual
sender, except where the sender specifies and with authority,
states them to be the views of Electronic Media Ltd.


To unsubscribe from a list, send a mail message to [EMAIL PROTECTED]
With the following in the body of the message:
   unsubscribe SAlive



To unsubscribe from a list, send a mail message to [EMAIL PROTECTED]
With the following in the body of the message:
   unsubscribe SAlive
---------------------------------------------------------------------------------------------------------
This email and any files transmitted with it are confidential and
intended solely for the use of the individual or entity to whom
they are addressed.

If you have received this email in error please notify the
originator of the message. This footer also confirms that this
email message has been scanned for the presence of known computer viruses.

Any views expressed in this message are those of the individual
sender, except where the sender specifies and with authority,
states them to be the views of Electronic Media Ltd.


To unsubscribe from a list, send a mail message to [EMAIL PROTECTED]
With the following in the body of the message:
   unsubscribe SAlive

Reply via email to