[Discuss] automatic daemon restarts

Edward Ned Harvey (blu) blu at nedharvey.com
Tue Sep 16 07:42:51 EDT 2014


> From: discuss-bounces+blu=nedharvey.com at blu.org [mailto:discuss-
> bounces+blu=nedharvey.com at blu.org] On Behalf Of Tom Metro
> 
> Richard Pieri wrote:
> > Edward Ned Harvey (blu) wrote:
> >> An active system will notice mysqld died, recognize that it's not
> >> supposed to do that right now, and restart it.
> >
> > Which is a stupid way to run in production. There's a reason why the
> > daemon died. That reason needs to be identified so that corrective steps
> > can be taken. Blind restarts can obfuscate this information, can cause
> > damage to data, and can exacerbate existing damage.
> 
> Not to say your points are invalid, but Netflix would disagree with you.
> They created a testing tool that intentionally kills random services on
> their production systems just to test that automated recovery works
> correctly.

I would rather receive notification that a production service was *restarted* rather than *is down*

Richard wants to say that's stupid.  I not only disagree, I think Richard's position is insulting and ignorantly one-sided.



More information about the Discuss mailing list