[ 
https://issues.apache.org/jira/browse/MAPREDUCE-7126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16584113#comment-16584113
 ] 

Jason Lowe commented on MAPREDUCE-7126:
---------------------------------------

There needs to be more information on this report to diagnose if there's a real 
issue here.  You will need to check the RM logs to see why it decided to 
respond to the AM as being unrecognized.  One theory is the AM could have been 
running on a node where the nodemanager crashed.  If that occurs then the RM 
will eventually expire the nodemanager due to lack of heartbeats and consider 
all of the containers on that node lost.  When the AM proceeds to heartbeat to 
the RM I would expect the RM to reply that the AM is no longer recognized since 
the RM considers that app attempt dead (being on a lost node).  Having the AM 
shut down (without unregistering!) is appropriate in that case.



> Error communicating with RM: Resource Manager doesn't recognize AttemptId: 
> appattempt_idxx
> ------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-7126
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7126
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mr-am
>    Affects Versions: 2.6.0
>            Reporter: Avdhesh kumar
>            Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org

Reply via email to