Gera Shegalov created MAPREDUCE-5932:
----------------------------------------

             Summary: Provide an option to use a dedicated reduce-side shuffle 
log
                 Key: MAPREDUCE-5932
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5932
             Project: Hadoop Map/Reduce
          Issue Type: Improvement
          Components: mrv2
    Affects Versions: 2.4.0
            Reporter: Gera Shegalov
            Assignee: Gera Shegalov


For reducers in large jobs our users cannot easily spot portions of the log 
associated with problems with their code. An example reducer with INFO-level 
logging generates ~3500 lines / ~700KiB  lines per second. 95% of the log is 
the client-side of the shuffle {{org.apache.hadoop.mapreduce.task.reduce.*}}

{code}
$ wc syslog 
    3642   48192  691013 syslog
$ grep task.reduce syslog | wc 
    3424   46534  659038
$ grep task.reduce.ShuffleScheduler syslog | wc 
    1521   17745  251458
$ grep task.reduce.Fetcher syslog | wc 
    1045   15340  223683
$ grep task.reduce.InMemoryMapOutput syslog | wc 
     400    4800   72060
$ grep task.reduce.MergeManagerImpl syslog | wc 
     432    8200  106555
{code}

Byte percentage breakdown:
{code}
Shuffle total:           95%

ShuffleScheduler:        36%
Fetcher:                 32%
InMemoryMapOutput:       10%
MergeManagerImpl:        15%
{code}

While this is information is actually often useful for devops debugging shuffle 
performance issues, the job users are often lost. 

We propose to have a dedicated syslog.shuffle file.




--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to