On 09/06/2011 01:57 AM, Niels Basjes wrote:
Hi,

In the past i've had the same situation where I needed the data for
debugging. Back then I chose to create a second job with simply
SequenceFileInputFormat, IdentityMapper, IdentityReducer and finally
TextOutputFormat.

In my situation that worked great for my purpose.

I did similar at my last job, but rather than writing a 2nd map/reduce job for this, we just wrote a simple command line app that used the Hadoop Java API to dump the contents of the binary file as text (JSON) to the console.

HTH,

DR

Reply via email to