On 09/06/2011 01:57 AM, Niels Basjes wrote:
Hi,
In the past i've had the same situation where I needed the data for
debugging. Back then I chose to create a second job with simply
SequenceFileInputFormat, IdentityMapper, IdentityReducer and finally
TextOutputFormat.
In my situation that worked great for my purpose.
I did similar at my last job, but rather than writing a 2nd map/reduce
job for this, we just wrote a simple command line app that used the
Hadoop Java API to dump the contents of the binary file as text (JSON)
to the console.
HTH,
DR