Hi.

Just wondering if someone have found some good Linux settings for I|O
intensive workload with Hadoop.

Since most usecases with Hadoop is I|O-bound and since it uses the network
frequently I guess that the tcp buffers and kernel buffers should be
tweaked. (Even with CPU-bound load).

I as well guess that you should choose an I|O scheduler like deadline or
perhaps cfq.

We will use many of the tricks found here:
http://www.gluster.org/docs/index.php/Guide_to_Optimizing_GlusterFS

Kindly

//Marcus

-- 
Marcus Herou CTO and co-founder Tailsweep AB
+46702561312
[EMAIL PROTECTED]
http://www.tailsweep.com/
http://blogg.tailsweep.com/

Reply via email to