Re: Parallelism of sorts

Doug Cutting Mon, 05 May 2008 11:29:40 -0700

Brice Arnould wrote:

I was asking myself if it could be a good idea to parallelize some of the
alogorithms of Hadoop, such as MergeSorter, for the case a single job of
run on a multicore system.

One can already exploit parallelism on a multicore system by using"pseudo-distributed" mode and increasingmapred.tasktracker.map.tasks.maximum andmapred.tasktracker.reduce.tasks.maximum.

LocalRunner should also someday be enhanced to run multiple maps andreduces in separate threads, which would be more efficient, sinceintermediate data would not need to travel through the loopback networkinterface. But I don't see an urgent case for making the sort codeitself multi-threaded, since MapReduce itself performs parallel sorting.


Doug

Re: Parallelism of sorts

Reply via email to