Hello list,
I have a task where I have compare the entries of a big sequencefile
with the entries of many small sequencefiles.
Basically you could describe it like this:
for entry in bigSequenceFile:
for file in listOfSmallFiles:
for entry2 in file:
compare(entry, entry2)
My job is dying during a map task write. This happened in enough task
to kill the job although most tasks succeeded -
Any ideas as to where to start diagnosing the problem
Caused by: java.lang.NullPointerException
at
org.apache.hadoop.io.compress.BZip2Codec$BZip2CompressionOutputStrea