Hi,
What is the anticipated usage of the above with the new api ? Is there
another way to remove the empty part-r files
When using it with MultipleOutputs to remove empty part-r files I have no
output ;O)
Regards,
Chris MacKenzie
http://www.chrismackenziephotography.co.uk/
NullOutputFormat does not generate any output. Good for jobs where
counters or some other I/O are your output (for example,
http://stackoverflow.com/questions/12707726/run-a-hadoop-job-without-output-file).
From Tom White's book it sounds like
To get rid of empty *part files while using MultipleOutputs in the new API,
LazyOutputFormat class' static method should be used to set the output
format.
Details are here at the official Java docs for MultipleOutputs :
Hi Markus And Shahab,
Thanks for getting back to me, I really appreciate it. LazyOutputFormat did
the trick. I tried NUllOutputFormat
(job.setOutputFormatClass(NullOutputFormat.class);) before writing to the
group but was getting an empty folder.
I looked at LazyOutputFormat, in fact, my mos