[ 
https://issues.apache.org/jira/browse/MAPREDUCE-479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jiaqi Tan updated MAPREDUCE-479:
--------------------------------

    Release Note: Adds Reduce Attempt ID to ClientTrace log messages, and adds 
Reduce Attempt ID to HTTP query string sent to mapOutputServlet. Extracts 
partition number from attempt ID.   (was: Adds Reduce Attempt ID to ClientTrace 
log messages, and adds Reduce Attempt ID to HTTP query string sent to 
mapOutputServlet.)
          Status: Patch Available  (was: Open)

Did microbenchmark of shuffle durations with and without added reduce attempt 
ID transmission and reduce partition number extraction; shuffle times before 
and after this patch are statistically comparable (chi-squared test for 
distribution similarity of shuffle times, p-value 0.23 => null-hypothesis of 
statistically different distributions not rejected); thus this patch does not 
cause any performance impact.

> Add reduce ID to shuffle clienttrace
> ------------------------------------
>
>                 Key: MAPREDUCE-479
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-479
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>    Affects Versions: 0.21.0
>            Reporter: Jiaqi Tan
>            Assignee: Jiaqi Tan
>            Priority: Minor
>             Fix For: 0.21.0
>
>         Attachments: HADOOP-6013.patch, MAPREDUCE-479-1.patch, 
> MAPREDUCE-479-2.patch, MAPREDUCE-479-3.patch, MAPREDUCE-479-4.patch, 
> MAPREDUCE-479.patch
>
>
> Current clienttrace messages from shuffles note only the destination map ID 
> but not the source reduce ID. Having both source and destination ID of each 
> shuffle enables full tracing of execution. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to