On Oct 30, 2006, at 22:10 , Luke Kanies wrote:

Luke Crawford wrote:
Most of the places I have worked, most of the 'fires' were caused by an admin mistake. Even now that my job is to put out other people's fires, configuration errors are close to half the problems I encounter.

If your sysadmins are still logging into machines and making manual changes, or they've got a bunch of hand-rolled scripts that do who- knows-what and aren't centrally version controlled, it's no wonder that there are so many bugs. If you are using current best practice and you're still having lots of problems, your admins are incompetent and should be fired.

"Logic is an organized way of going wrong with confidence." We've had a few instances of changes being tested and rolled out via CM, and breaking a significant number of machines because the testing missed some corner cases. The automated tools can actually make this worse by turning it into a global meltdown.

Sysadmins are human, and CM tools are only as good as the humans using them.

(This does not mean don't use them, but it does go along with starting small so the inevitable mistakes while learning are localized. You're far more likely to cause a global meltdown with cfengine etc. if you're a novice jumping in with both feet.)

--
brandon s. allbery    [linux,solaris,freebsd,perl]     [EMAIL PROTECTED]
system administrator [openafs,heimdal,too many hats] [EMAIL PROTECTED]
electrical and computer engineering, carnegie mellon university    KF8NH



_______________________________________________
lssconf-discuss mailing list
lssconf-discuss@inf.ed.ac.uk
http://lists.inf.ed.ac.uk/mailman/listinfo/lssconf-discuss

Reply via email to