[ 
https://issues.apache.org/jira/browse/MAPREDUCE-5547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13914835#comment-13914835
 ] 

Zhijie Shen commented on MAPREDUCE-5547:
----------------------------------------

This issue has been left for a while. According to the previous discussion, as 
to the problem that the history file is available on JSH while AM fails 
unregistration, the straightforward solution is to move the history file to the 
intermediate done dir after unregistration. If we don't do that, the job has a 
second retry, and the second retry fails, we will still see a successful job on 
JHS, because according to my investigation, JHS will neglect the duplicate 
history file.

Jason has the concern that if we do that, the client will be redirected to JHS 
before the history file is copied. I've done more investigation. The client 
service seems to be the last one to be stopped by MRAppMaster. On the other 
side, realProxy will cached until the connect get failed. Therefore, we just 
need to make sure copying the history file to the intermediate done dir after 
unregistration and before stopping the client service, and we should still be 
safe. Please correct me if I'm wrong.

Any thoughts?

> Job history should not be flushed to JHS until AM gets unregistered
> -------------------------------------------------------------------
>
>                 Key: MAPREDUCE-5547
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-5547
>             Project: Hadoop Map/Reduce
>          Issue Type: Sub-task
>            Reporter: Zhijie Shen
>            Assignee: Zhijie Shen
>




--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to