Hello all.

I've just tested MapReduce in C++ against the Java version.

I did run WordCount included in 0.14.1 release version on a 1 node Hadoop
cluster (Pentium D with 2GB of RAM).
There were 2 input files (one 4.5MB file + one 36MB file).
I also did take Combiner out of Java version WordCount MapReduce, as there
was no Combiner used for C++ version.

The result is.... as many of you have guessed, Java version won the race big
time. Java version was about 4 times quicker.

Here is more detailed result.


Java Version
C++ Version     Ratio (Java : C++)  Total Time Taken
 89 364     1 : 4  Longest Time Taken for Map
 41 83     1 : 2  Longest Time Taken for Reduce
 58 264     1 : 4.5

Any guess or idea on how to improve the performance of C++ MapReduce?

Taeho

-- 
Taeho Kang [tkang.blogspot.com]

Reply via email to