GitHub user uncleGen opened a pull request:

    https://github.com/apache/spark/pull/1429

    Bug Fix: In yarn-cluster mode, ApplicationMaster does not clean up corre...

    ...ctly at the end of the job if users call sc.stop manually. There are two 
minor bugs:
    
    1. At the end of a job, "ApplicationMaster" will ask for resources and 
launch containers repeatedly. These newly launched containers keep trying to 
contact with the already-stopped driver and fianally fail. When the container 
failure number reaches the max value, this job's FinalStatus wil be marked as 
FAILED.
    2.  After the driver asks to stop executors, "BlockManager" of executors 
will try to contact  the already-stoppped "BlockManagerMasterActor" of the 
driver. Unfortunately, all the connecting attempts will fail, and then the 
executor process will crash with a non-zero Exit Code.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/uncleGen/spark master_bugfix

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/1429.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #1429
    
----
commit b2d7971ada929092cdb1e0eab443ef79266bd27d
Author: uncleGen <husty...@gmail.com>
Date:   2014-07-16T04:05:17Z

    Bug Fix: In yarn-cluster mode, ApplicationMaster does not clean up 
correctly at the end of the job if users call sc.stop manually

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

Reply via email to