Dmitry, Look into cluster/system monitoring tools: nagios and ganglia are two to start with. - Aaron
On Tue, Feb 3, 2009 at 9:53 AM, Dmitry Pushkarev <u...@stanford.edu> wrote: > Dear hadoop users, > > > > Recently I have had a number of drive failures that slowed down processes a > lot until they were discovered. It is there any easy way or tool, to check > HDD performance and see if there any IO errors? > > Currently I wrote a simple script that looks at /var/log/messages and greps > everything abnormal for /dev/sdaX. But if you have better solution I'd > appreciate if you share it. > > > > --- > > Dmitry Pushkarev > > +1-650-644-8988 > > > >