We are running into an issue where the active CAM becomes unavailable and HA does not failover to the standby CAM. We get notice of the problem because users are no longer able to login through their CAS server. They get the login page but they are unable to get a successful login. Behind the scenes it is occurring because the active CAM is no longer processing the authentication requests. Once we receive this problem report attempts to open the web GUI on the service IP and direct IP of the active CAM fail. For fail over to occur we have to manually cause the failover to happen, usually by dropping the network connection on the active(but faulty) CAM. A service restart on the faulty CAM will restore it to normal operation. So far logs have failed to show anything of merit regarding the failure. This has occurred under 4.5.0 and 4.5.1.
Has anyone seen this type of issue?
