Thanks Willy I prodded my hosts and they fixed it at their end.
Regards Richard On 2 Dec 2010, at 06:54, Willy Tarreau <w...@1wt.eu> wrote: > Hi Richard, > > On Tue, Nov 30, 2010 at 01:10:01PM +0000, Richard Shaw wrote: >> >> Hi, >> >> I've got two identical web servers physically close and on the same subnet, >> one of them shows a significantly higher number of server chk's (5320) than >> the other (0) and is periodically marked, very briefly (1-2s) as down. >> >> name,chk,dwn,dwntime >> w1,0,0,0s >> w2,5320,0,2s >> >> The configs for both are as follows >> >> server w1 neww1:80 weight 10 maxconn 120 cookie w1 check inter 2000 rise 1 >> fall 3 >> server w2 neww2:80 weight 10 maxconn 120 cookie w2 check inter 2000 rise 1 >> fall 3 >> >> and the check i'm performing is option httpchk /check.txt >> >> My understanding is that the 5320 is the number of httpchk's HAProxy has >> recorded as fail, is this correct ? > > Yes, that's correct. The 2s downtime with zero down transitions imply that > the server was detected as down upon start up. > > If both servers are the same, it is very likely that one of them has a faulty > network card or that the switch it's connected to has a problem on its port. > > Willy >