On 12/12/12 17:51, Lars Marowsky-Bree wrote:
> On 2012-12-11T12:53:39, David Vossel <dvos...@redhat.com> wrote:
> 
> Excellent progress!
> 
> Just one aspect caught my eye:
> 
>>> - on-fail defaults "restart-container" for most actions,
>>>
>>>   except for stop op (Not sure what it means if a stop fails. A
>>>   nagios
>>> daemon cannot be terminated? Should it always return success?) ,
>>
>> A nagios "stop" action should always return success.  The nagio's agent 
>> doesn't even need a stop function, the lrmd can know to treat  a "stop" as a 
>> (no-op for stop) + (cancel all recurring actions).  In this case if the 
>> nagios agent doesn't stop successfully,  it is because of an lrmd failure 
>> which should result in a fencing action i'd imagine.
> 
> That's something that, IMHO, shouldn't be handled by the container
> abstraction, but - like you say - by the LRM/class code.
> 
> I think on-fail="restart-container" makes sense even for stop. If
> "stop" can't technically fail for a given class, even better. But it
> could mean that we actually need to stop some monitoring daemon or
> whatever.
> 
> The other logic might be to set it to "ignore", which would also work
> for me (even if a bit less obviously).
Makes sense. Now on-fail for stop op defaults to "restart-container":

https://github.com/gao-yan/pacemaker/commits/container

And yes, internally, failed stop operations are ignored so that the
container can be restarted, given that there's a mandatory order.

> 
> But really I'd not want to make "oh let's just skip stop for contained
> resources" here ;-)
> 
>>> - Failures of resources count against container's
>> What happens if someone wants to clear the container's failcount? Do we need 
>> to add some logic to go in and clear all the child resource's failures as 
>> well to make this happen correctly?
> 
> That appears to make sense.
Will do that.

Regards,
  Gao,Yan
-- 
Gao,Yan <y...@suse.com>
Software Engineer
China Server Team, SUSE.

_______________________________________________
Pacemaker mailing list: Pacemaker@oss.clusterlabs.org
http://oss.clusterlabs.org/mailman/listinfo/pacemaker

Project Home: http://www.clusterlabs.org
Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
Bugs: http://bugs.clusterlabs.org

Reply via email to