Re: [Linux-HA] suicide behavior for each process

Alan Robertson Thu, 06 Dec 2007 16:27:57 -0800

Junko IKEDA wrote:

Hi,


I found something rule like this;
When the following process was killed, the system would reboot.
* ccm
* cib
* lrmd
* crmd
* pengine
* tengine

These processes would be restarted when they are killed.
* FIFO
* media (ex. write/read bcast)
* stonithd
* attrd
* mgmtd
* respawn (ex. pingd)

If mcp is killed, Heartbeat2 is going to stop.
(but, lrmd and mgmtd might remain...)

Is there any policy what process is desired, restart itself or reboot the
system?
I think it wouldn't be hurt if the death of all process raise a reboot.
It's simple.

Heartbeat has always restarted "client" processes until recently, andrestarted itself when it's own processes died. The reboot action iscertainly simple, but if the recovery works, then it's certainly moregentle.

I recently changed it so that when the "media" processes died that werestarted them. Certain kinds of temporary hardware and administratormalfunctions most commonly cause them to mess up, and Lars specificallyasked that they not die in this case.

The FIFO process is certainly easy to restart, so I just added it torestart (it's only used in R1 configurations).

If our strategy works (and I think it does) then I think I like"soft/safe" recovery when it is not too complicated.


--
    Alan Robertson <[EMAIL PROTECTED]>

"Openness is the foundation and preservative of friendship... Let meclaim from you at all times your undisguised opinions." - WilliamWilberforce

_______________________________________________
Linux-HA mailing list
Linux-HA@lists.linux-ha.org
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] suicide behavior for each process

Reply via email to