Re: [jira] Updated: (HADOOP-1535) Wrong comparator used to merge files in Reduce phase

Nigel Daley Thu, 28 Jun 2007 09:16:20 -0700

Hi Vivek,

Can you include a unit test for this fix?


On Jun 28, 2007, at 2:40 AM, Vivek Ratan (JIRA) wrote:

[ https://issues.apache.org/jira/browse/HADOOP-1535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Vivek Ratan updated HADOOP-1535:
--------------------------------

    Attachment: 1535_01.patch
We use the comparator returned by JobConf.getOutputKeyComparator()for the sort/merge phases of Map and Reduce. We use the comparatorreturned by JobConf.getOutputValueGroupingComparator() for theiterator across values for a given key. See 1535_01.patch.
Wrong comparator used to merge files in Reduce phase
----------------------------------------------------

                Key: HADOOP-1535
URL: https://issues.apache.org/jira/browse/HADOOP-1535
            Project: Hadoop
         Issue Type: Bug
         Components: mapred
   Affects Versions: 0.12.3, 0.13.0
           Reporter: Vivek Ratan
           Assignee: Vivek Ratan
            Fix For: 0.14.0

        Attachments: 1535_01.patch
As per the fix for HADOOP-485, we allow users to optionallyprovide a different comparator to group values when calling theuser's Reduce function. Devaraj and I were looking at the codeyesterday and we found that in ReduceTask.java, we use the user-supplied comparator to merge the output files from the Map tasks(we use the user-supplied comparator when creating a newSequenceFile.Sorter object). This is incorrect as the comparatorused to merge Map output files should be the same as that used tocreate those files in the Map phase. The user-supplied comparatorfor grouping values should be used only in the iterator passed tothe user's Reduce function (which is done correctly in the code).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Re: [jira] Updated: (HADOOP-1535) Wrong comparator used to merge files in Reduce phase

Reply via email to