[ 
https://issues.apache.org/jira/browse/FLINK-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14327632#comment-14327632
 ] 

ASF GitHub Bot commented on FLINK-1556:
---------------------------------------

GitHub user tillrohrmann opened a pull request:

    https://github.com/apache/flink/pull/422

    [FLINK-1556] Corrects faulty JobClient behaviour in case of a submission 
failure

    Corrects the behaviour of the ```JobClient``` in case of a submission 
failure. The PR also contains test cases for the job submission.
    
    Additionally, reworked how exceptions are transmitted from the 
```JobManager``` to the ```JobClient```. They are directly wrapped into a 
```akka.actor.Status.Failure``` and send to the ```JobClient```.
    
    This PR is based on #419.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/tillrohrmann/flink fixSubmissionExceptions

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/flink/pull/422.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #422
    
----
commit 8cc604d61d75370972146333c5a016b5fcdddc77
Author: Till Rohrmann <trohrm...@apache.org>
Date:   2015-02-19T10:04:56Z

    [FLINK-1584] [runtime][tests] Fixes TaskManagerFailsITCase by replacing the 
TestingCluster with a ForkableFlinkMiniCluster

commit 8ecca959d2bf96fa8be1961b413f4a2c45cf50e1
Author: Till Rohrmann <trohrm...@apache.org>
Date:   2015-02-19T11:44:32Z

    [FLINK-1556] [runtime] Fails jobs properly in case of a job submission 
exception
    
    Conflicts:
        
flink-runtime/src/test/scala/org/apache/flink/runtime/testingUtils/TestingUtils.scala
        
flink-tests/src/test/scala/org/apache/flink/api/scala/runtime/taskmanager/TaskManagerFailsITCase.scala

----


> JobClient does not wait until a job failed completely if submission exception
> -----------------------------------------------------------------------------
>
>                 Key: FLINK-1556
>                 URL: https://issues.apache.org/jira/browse/FLINK-1556
>             Project: Flink
>          Issue Type: Bug
>            Reporter: Till Rohrmann
>            Assignee: Till Rohrmann
>
> If an exception occurs during job submission the {{JobClient}} received a 
> {{SubmissionFailure}}. Upon receiving this message, the {{JobClient}} 
> terminates itself and returns the error to the {{Client}}. This indicates to 
> the user that the job has been completely failed which is not necessarily 
> true. 
> If the user directly after such a failure submits another job, then it might 
> be the case that not all slots of the formerly failed job are returned. This 
> can lead to a {{NoRessourceAvailableException}}.
> We can solve this problem by waiting for the completion of the job failure in 
> the {{JobClient}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to