[ 
https://issues.apache.org/jira/browse/MAPREDUCE-6251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vinod Kumar Vavilapalli updated MAPREDUCE-6251:
-----------------------------------------------
    Target Version/s: 2.8.0
              Status: Open  (was: Patch Available)

Okay, reviewing again after your responses
 - Configuration usually goes into MRJobConfig. Unless you explicitly don't 
want them publicly visible to end-users naturally.
 - Once you move them to MRJobConfig, the naming convention also changes. Names 
there follow MR_CLIENT_*, DEFAULT_* patterns
 - Document them in mapred-default.xml? Stating when they are needed, and how 
they should be used in contrast to the lower level retries.

> JobClient needs additional retries at a higher level to address 
> not-immediately-consistent dfs corner cases
> -----------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-6251
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6251
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: jobhistoryserver, mrv2
>    Affects Versions: 2.6.0
>            Reporter: Craig Welch
>            Assignee: Craig Welch
>         Attachments: MAPREDUCE-6251.0.patch, MAPREDUCE-6251.1.patch, 
> MAPREDUCE-6251.2.patch, MAPREDUCE-6251.3.patch
>
>
> The JobClient is used to get job status information for running and completed 
> jobs.  Final state and history for a job is communicated from the application 
> master to the job history server via a distributed file system - where the 
> history is uploaded by the application master to the dfs and then 
> scanned/loaded by the jobhistory server.  While HDFS has strong consistency 
> guarantees not all Hadoop DFS's do.  When used in conjunction with a 
> distributed file system which does not have this guarantee there will be 
> cases where the history server may not see an uploaded file, resulting in the 
> dreaded "no such job" and a null value for the RunningJob in the client.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to