Greetings, I've written up a brief document entitled "STONITH Deathmatch Explained (and Some Hints for Resource Agent Authors and Systems Engineers)":
http://ourobengr.com/ha It's a description of causes of STONITH deathmatch in Heartbeat/Pacemaker HA clusters, where two nodes continually shoot each other, thus rendering the system less available than a non-HA system would be. Hopefully publishing this will save at least a few people from some of the pain myself and a couple of others experienced last year, in particular when trying to debug resource agents that were misbehaving in unexpected ways. Comments, feedback, etc. welcome. Thanks, Tim _______________________________________________ Pacemaker mailing list Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker