Are there any standard or recommended scripts for the mapred.healthChecker
options in the mapred-site.xml configuration file for a linux box?
-Jeff
Mike,
To do this for the more general case of creating N map jobs with each job
receiving the one record i, n, where i ranges from 0 to n-1, I wrote
an InputFormat, InputSplit, and RecordReader Hadoop class. The sample code
is here http://goo.gl/npKfP. I think I wrote those for Hadoop 0.19, so
)
to achieve scaling. In such cases a low bandwidth network will impede
scaling. Bryan Duxbury has a nice blog post about networking a Hadoop
cluster here http://goo.gl/uVeoM.
More concisely, I would say that Hadoop scales on clusters with networks
that scale (up to ~4000 nodes).
--
Jeff Kubina