So after some debugging with Simone on irc, we've determined that the issue
is the agent timing out trying to communicate with the broker. The problem
is that we have no idea why.

Thread-942::INFO::2016-07-21
09:19:51,934::listener::134::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(setup)
Connection established Thread-942::INFO::2016-07-21
09:19:51,936::listener::186::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(handle)
Connection closed Thread-943::INFO::2016-07-21
09:19:53,099::listener::134::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(setup)
Connection established Thread-943::INFO::2016-07-21
09:19:53,554::listener::186::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(handle)
Connection closed

Thread-135::DEBUG::2016-07-21
09:47:34,941::util::69::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(socket_readline)
socket_readline in blocking mode Thread-135::DEBUG::2016-07-21
09:47:34,941::util::99::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(socket_readline)
Connection closed while reading from socket


So I tried to reinstall instead:

- host -> maintenance
- host removed from cluster
- yum remove ovirt\*
- yum install ovirt-hosted-engine-setup
- hosted-engine --deploy
  - chose new node id
  - reused same name/hostname

Once the host activated, it went right back to the same state.

I'm open to any suggestions to get me back on track. The engine is at
3.6.7, but functioning hosts are still at 3.5.x. Should I try to upgrade
the engine and a host to 4.0.x? I had planned on having a stable 3.6 system
for a few days before trying to jump to 4.0. Or is there some way to go
back to 3.5?


Robert

-- 
Senior Software Engineer @ Parsons

Attachment: pgpayyu94ifIL.pgp
Description: OpenPGP digital signature

_______________________________________________
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Reply via email to