MapFileOutputFormat and SequenceFileOutputFormat use incorrect key/value
classes in map/reduce tasks
----------------------------------------------------------------------------------------------------
Key: HADOOP-920
URL: https://issues.apache.org/jira/browse/HADOOP-920
Project: Hadoop
Issue Type: Bug
Components: mapred
Affects Versions: 0.11.0
Reporter: Andrzej Bialecki
Fix For: 0.11.0
Let's assume a job uses different key/value class for the output of map tasks
and for the final output of reduce tasks.
When executing map tasks classes returned from JobConf.getMapOutputKeyClass() /
getMapOutputValueClass() should be used, and when executing reduce tasks
classes returned from JobConf.gtOutputKeyClass() / getOutputValueClass() should
be used.
Currently both map and reduce tasks will use
getMapOutputKeyClass/getMapOutputValueClass when using MapFileOutputFormat, or
they will always use getOutputKeyClassgetOutputValueClass when using
SequenceFileOutputFormat. This causes exceptions, because Mapper / Reducer
implementations will output different key/value classes than expected.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.