job.setOutputFormatClass(NullOutputFormat.class);

2014-07-01 Thread Chris MacKenzie
Hi, What is the anticipated usage of the above with the new api ? Is there another way to remove the empty part-r files When using it with MultipleOutputs to remove empty part-r files I have no output ;O) Regards, Chris MacKenzie http://www.chrismackenziephotography.co.uk/

Re: job.setOutputFormatClass(NullOutputFormat.class);

2014-07-01 Thread M. Dale
NullOutputFormat does not generate any output. Good for jobs where counters or some other I/O are your output (for example, http://stackoverflow.com/questions/12707726/run-a-hadoop-job-without-output-file). From Tom White's book it sounds like

Re: job.setOutputFormatClass(NullOutputFormat.class);

2014-07-01 Thread Shahab Yunus
To get rid of empty *part files while using MultipleOutputs in the new API, LazyOutputFormat class' static method should be used to set the output format. Details are here at the official Java docs for MultipleOutputs :

Re: job.setOutputFormatClass(NullOutputFormat.class);

2014-07-01 Thread Chris MacKenzie
Hi Markus And Shahab, Thanks for getting back to me, I really appreciate it. LazyOutputFormat did the trick. I tried NUllOutputFormat (job.setOutputFormatClass(NullOutputFormat.class);) before writing to the group but was getting an empty folder. I looked at LazyOutputFormat, in fact, my mos