Hi Brad, how many taskstrackers did you have on each node in both cases? Thanks, Prashant
Sent from my iPhone On Dec 13, 2011, at 9:42 AM, Brad Sarsfield <b...@bing.com> wrote: > Praveenesh, > > Your question is not naïve; in fact, optimal hardware design can ultimately > be a very difficult question to answer on what would be "better". If you made > me pick one without much information I'd go for more machines. But... > > It all depends; and there is no right answer.... :) > > More machines > +May run your workload faster > +Will give you a higher degree of reliability protection from node / > hardware / hard drive failure. > +More aggregate IO capabilities > - capex / opex may be higher than allocating more cores > More cores > +May run your workload faster > +More cores may allow for more tasks to run on the same machine > +More cores/tasks may reduce network contention and increase increasing > task to task data flow performance. > > Notice "May run your workload faster" is in both; as it can be very workload > dependant. > > My Experience: > I did a recent experiment and found that given the same number of cores (64) > with the exact same network / machine configuration; > A: I had 8 machines with 8 cores > B: I had 28 machines with 2 cores (and 1x8 core head node) > > B was able to outperform A by 2x using teragen and terasort. These machines > were running in a virtualized environment; where some of the IO capabilities > behind the scenes were being regulated to 400Mbps per node when running in > the 2 core configuration vs 1Gbps on the 8 core. So I would expect the > non-throttled scenario to work even better. > > ~Brad > > > -----Original Message----- > From: praveenesh kumar [mailto:praveen...@gmail.com] > Sent: Monday, December 12, 2011 8:51 PM > To: common-user@hadoop.apache.org > Subject: More cores Vs More Nodes ? > > Hey Guys, > > So I have a very naive question in my mind regarding Hadoop cluster nodes ? > > more cores or more nodes - Shall I spend money on going from 2-4 core > machines, or spend money on buying more nodes less core eg. say 2 machines of > 2 cores for example? > > Thanks, > Praveenesh >