I have an LVS-DR cluster which has been running for seven months without a hitch. Recently, the cluster started to timeout on the majority of connections. Some connections were passed through to a real server and processed. I have tried for a week to figure out what happened. What I found was that one real server out of five is connecting and servicing the client request. The other four real servers have the HTTP connection stuck in the SYN_RECV state until it times out (60 seconds).
In summary, I have seven CentOS 6.4 servers (kernel 2.6.32-358.18.1.el6.x86_64). Two servers are configured as load balancers (a primary and a backup) and five real servers. I have setup LVS-DR using IPTables. The servers have a public IP bound to a NIC device and an internal VLAN bound to a second NIC. The VIP is configured on the real servers local loopback (lo:0) device. The /etc/sysconfig/ha/lvs.cf was setup properly and everything was running successfully for seven months. We installed new versions of our software for the web service we are running. Nothing network related. All five real servers were updated the same way. I am comparing the one working real server from the four that are not working. So far I have found nothing. Any ideas on trouble shooting points? -- Best Regards, Bruce _______________________________________________ Please read the documentation before posting - it's available at: http://www.linuxvirtualserver.org/ LinuxVirtualServer.org mailing list - lvs-users@LinuxVirtualServer.org Send requests to lvs-users-requ...@linuxvirtualserver.org or go to http://lists.graemef.net/mailman/listinfo/lvs-users