Dan Kenigsberg has submitted this change and it was merged.

Change subject: After fail to connect to supervdsm more than 3 time vdsm gets 
into panic
......................................................................


After fail to connect to supervdsm more than 3 time vdsm gets into panic

Due to race between old supervdsm instance to the new instance after
prepareForShutdown, sometimes the socket is removed after
new supervdsm started to listen on it.
_pokeParent thread unlink the socket when distinguish that vdsm is dead.
This can take more time than the time that takes to vdsm to startup and
start the new instance of supervdsm. The unlink removes the socket file
and vdsm cannot communicate with supervdsm.
When the communication fails, vdsm calls panic and restart itself, this
will start supervdsm again as needed.

Change-Id: Iafe112893a76686edd2949d4f40b734646fd74df
Bug-Id: https://bugzilla.redhat.com/show_bug.cgi?id=910005
Signed-off-by: Yaniv Bronhaim <[email protected]>
---
M vdsm/supervdsm.py
1 file changed, 8 insertions(+), 1 deletion(-)

Approvals:
  Yaniv Bronhaim: Verified
  Saggi Mizrahi: Looks good to me, but someone else must approve
  Dan Kenigsberg: Looks good to me, approved


--
To view, visit http://gerrit.ovirt.org/11932
To unsubscribe, visit http://gerrit.ovirt.org/settings

Gerrit-MessageType: merged
Gerrit-Change-Id: Iafe112893a76686edd2949d4f40b734646fd74df
Gerrit-PatchSet: 6
Gerrit-Project: vdsm
Gerrit-Branch: master
Gerrit-Owner: Yaniv Bronhaim <[email protected]>
Gerrit-Reviewer: Barak Azulay <[email protected]>
Gerrit-Reviewer: Dan Kenigsberg <[email protected]>
Gerrit-Reviewer: Saggi Mizrahi <[email protected]>
Gerrit-Reviewer: Yaniv Bronhaim <[email protected]>
Gerrit-Reviewer: oVirt Jenkins CI Server
_______________________________________________
vdsm-patches mailing list
[email protected]
https://lists.fedorahosted.org/mailman/listinfo/vdsm-patches

Reply via email to