Richard Pieri wrote:
> Edward Ned Harvey (blu) wrote:
>> An active system will notice mysqld died, recognize that it's not
>> supposed to do that right now, and restart it.
> 
> Which is a stupid way to run in production. There's a reason why the
> daemon died. That reason needs to be identified so that corrective steps
> can be taken. Blind restarts can obfuscate this information, can cause
> damage to data, and can exacerbate existing damage.

Not to say your points are invalid, but Netflix would disagree with you.
They created a testing tool that intentionally kills random services on
their production systems just to test that automated recovery works
correctly.

 -Tom

-- 
Tom Metro
The Perl Shop, Newton, MA, USA
"Predictable On-demand Perl Consulting."
http://www.theperlshop.com/
_______________________________________________
Discuss mailing list
Discuss@blu.org
http://lists.blu.org/mailman/listinfo/discuss

Reply via email to