SMARTD is what your looking for! We also have several servers that are running mcelog[1] for hardware failure monitoring. Backup is still important even if you have these tools!
[1]http://www.cyberciti.biz/tips/linux-server-predicting-hardware-failure.html -- Greek Ordono myppa: launchpad.net/~grexk/+archive/ppa --- On Fri, 9/24/10, Red Sancho <[email protected]> wrote: From: Red Sancho <[email protected]> Subject: Re: [plug] Disk preventive maintenance To: "Philippine Linux Users' Group (PLUG) Technical Discussion List" <[email protected]> Date: Friday, 24 September, 2010, 8:39 PM that's a long shot up sell but worth trying, we've already identified that risk of not having a backup policy but from where we stand right now adding cost to a losing business isnt gonna go far up the ladder... anyways, is it possible to prevent or at least detect an FS failure looming? This is what happend... after joining the company i found out that most of the servers are running on debian etch and i brought this to the attention of my boss and he agreed to perform upgrades to the majority of the etch servers. after performing the distro upgrade and rebooting, our sdb1 didn't mount and all of the files went into lost+found, i also got "missing journal for /dev/sdb1" from messages. i tried to mount it only to find that all of the files are now in lost+found, so i unmounted it and ran fsck (probably a bad thing). Now im wondering what could have caused this error in the fs. also one question, is it ok to run fsck read-only on a mounted device? Tia From: eric pareja <[email protected]> To: Philippine Linux Users' Group (PLUG) Technical Discussion List <[email protected]> Sent: Fri, September 24, 2010 8:11:25 PM Subject: Re: [plug] Disk preventive maintenance You will still need to perform backup to avoid data loss when migrating in any case. You have to define a backup policy first, your mechanism merely implements the policy. Data loss is more expensive than backup. You will have to integrate backup costs into your operations. On Fri, Sep 24, 2010 at 8:05 PM, Red Sancho <[email protected]> wrote: > thanks for pointing that out, currently there are no backup mechanism in > place for the "file servers" and since they are holding around 2TB of data > per server it would take a lot of time to backup all of the servers and also > the files are constantly changing... -- Eric Manuel Pareja ([email protected]) LPIC-2, NCLP | PGP/GPG Key 0xB82E42D9 Coordinator for Technology - National Telehealth Center University of the Philippines Manila Senior Linux Trainer - International Open Source Network - ASEAN+3 Ang mundo ay aklat, at iisang pahina lamang ang nababasa ng hindi naglalakbay. - San Agustin わかよたれぞ つねならむ _________________________________________________ Philippine Linux Users' Group (PLUG) Mailing List http://lists.linux.org.ph/mailman/listinfo/plug Searchable Archives: http://archives.free.net.ph -----Inline Attachment Follows----- _________________________________________________ Philippine Linux Users' Group (PLUG) Mailing List http://lists.linux.org.ph/mailman/listinfo/plug Searchable Archives: http://archives.free.net.ph
_________________________________________________ Philippine Linux Users' Group (PLUG) Mailing List http://lists.linux.org.ph/mailman/listinfo/plug Searchable Archives: http://archives.free.net.ph

