Re: Optimal setup for a test problem

2010-04-13 Thread Andrew Nguyen
Correction, they are 100Mbps NIC's... iperf shows that we're getting about 95 Mbits/sec from one node to another. On Apr 12, 2010, at 1:05 PM, Andrew Nguyen wrote: @Todd: I do need the sorting behavior, eventually. However, I'll try it with zero reduce jobs to see. @Alex: Yes, I

Re: Optimal setup for a test problem

2010-04-13 Thread alex kamil
Andrew, here are some tips for hadoop runtime config: http://cloudepr.blogspot.com/2009/09/cluster-facilities-hardware-and.html also here are some results from my cluster (using 1GE NICs, Fiber), Dell 5500, 24GB, 8-core (16 hypervised), JBOD, i saw slightly better numbers on a different 4-nodes

Re: Optimal setup for a test problem

2010-04-13 Thread Andrew Nguyen
Good to know... The problem is that I'm in an academic environment that needs a lot of convincing regarding new computational technologies. I need to show proven benefit before getting the funds to actually implement anything. These servers were the best I could come up with for this

Optimal setup for a test problem

2010-04-12 Thread Andrew Nguyen
Hello, I recently setup a 5 node cluster (1 master, 4 slaves) and am looking to use it to process high volumes of patient physiologic data. As an initial exercise to gain a better understanding, I have attempted to run the following problem (which isn't the type of problem that Hadoop was

Re: Optimal setup for a test problem

2010-04-12 Thread alex kamil
Andrew, I would also suggest to run DFSIO benchmark to isolate io related issues hadoop jar hadoop-0.20.2-test.jar TestDFSIO -write -nrFiles 10 -fileSize 1000 hadoop jar hadoop-0.20.2-test.jar TestDFSIO -read -nrFiles 10 -fileSize 1000 there are additional tests specific for mapreduce - run

Re: Optimal setup for a test problem

2010-04-12 Thread Andrew Nguyen
I don't think you can :-). Sorry, they are 100Mbps NIC's... I get 95Mbit/sec from one node to another with iperf. Should I still be expecting such dismal performance with just 100Mbps? On Apr 12, 2010, at 1:31 PM, Todd Lipcon wrote: On Mon, Apr 12, 2010 at 1:05 PM, Andrew Nguyen

Re: Optimal setup for a test problem

2010-04-12 Thread Andrew Nguyen
I guess my question below can be rephrased as, What is the absolute minimum hw requirements for me to still see 'better-than-a-single-machine' performance? Thanks! On Apr 12, 2010, at 1:45 PM, Andrew Nguyen wrote: I don't think you can :-). Sorry, they are 100Mbps NIC's... I get