[ 
https://issues.apache.org/jira/browse/SPARK-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15031843#comment-15031843
 ] 

Thomas Graves commented on SPARK-4117:
--------------------------------------

[~devaraj.k]  thanks for explaining.  Sounds good on 
ApplicationMasterNotRegisteredException since the AMRMClientImpl is handling 
it.  

For ApplicationAttemptNotFoundException, you hit one of the places that 
allocate is called and that is on the first registration. There is another one 
in the launchReporterThread that regularly gets called after starting.  This is 
the one that catches exceptions and will wait for a number of failures before 
finally exits.  So if ApplicationAttemptNotFoundException is sent anytime after 
the application is running it will hit that logic.  I don't think its that big 
of an issue since it will eventually exit, it could just take a little longer.  
It looks like the only cases that should be thrown is if we have already 
unregistered or something weird happened on RM Where it lost the application.

> Spark on Yarn handle AM being told command from RM
> --------------------------------------------------
>
>                 Key: SPARK-4117
>                 URL: https://issues.apache.org/jira/browse/SPARK-4117
>             Project: Spark
>          Issue Type: Improvement
>          Components: YARN
>    Affects Versions: 1.2.0
>            Reporter: Thomas Graves
>
> In the allocateResponse from the RM it can send commands that the AM should 
> follow. for instance AM_RESYNC and AM_SHUTDOWN.  We should add support for 
> those.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to