Sorry, this mail was intended for somebody else. Please disregard.
On 2/25/09 2:33 PM, "Jothi Padmanabhan" <joth...@yahoo-inc.com> wrote: > Just to clarify -- setting test.build.data on the command line to point to > some arbitrary directory in /tmp should work > > ant -Dtestcase=TestMapReduceLocal -Dtest.output=yes > -Dtest.build.data=/tmp/foo test-core > > Jothi > > > On 2/25/09 10:53 AM, "Jothi Padmanabhan" <joth...@yahoo-inc.com> wrote: > >> If you had set the number of reduce tasks to 0, you should not see the >> reduce>sort. How did you set the number of reducers? >> You could do that by doing >> >> job.setNumReduceTasks(0); >> >> Jothi >> >> >> On 2/25/09 10:34 AM, "Vadim Zaliva" <kroko...@gmail.com> wrote: >> >>> On Thu, Feb 7, 2008 at 10:07, Owen O'Malley <o...@yahoo-inc.com> wrote: >>> >>>> Setting it to 0 skips all of the buffering, sorting, merging, and >>>> shuffling. >>>> It passes the objects straight from the mapper to the output format, which >>>> writes it straight to hdfs. >>> >>> I just tried to set number or Reduce tasks to 0, but Job Tracker shows >>> Reduce task working, doing "reduce > sort". I have a big data set and >>> it takes a while. It would be a good to find a way to skip it. >>> >>> Vadim >