I have a cluster of about 6 nodes, half of which suddenly cannot connect to my ambari-server at https://<ambari-server>:8440. The others can connect and heartbeat without an issue.
I noticed that if I run: openssl s_client -connect <host>:8440, it doesn't work either on the defective machines, but does work on the others. My initial thought is that the ambari-server and agent certs have diverged, and the agent cert needs to be resigned. I know during the host registration period, the server will sign the client cert; however, I am performing manual registration of my hosts, is that still the case? Roshan
