[ 
https://issues.apache.org/jira/browse/FLINK-1484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14308540#comment-14308540
 ] 

ASF GitHub Bot commented on FLINK-1484:
---------------------------------------

Github user hsaputra commented on a diff in the pull request:

    https://github.com/apache/flink/pull/368#discussion_r24220454
  
    --- Diff: 
flink-runtime/src/main/scala/org/apache/flink/runtime/jobmanager/JobManager.scala
 ---
    @@ -125,6 +126,10 @@ Actor with ActorLogMessages with ActorLogging {
       override def postStop(): Unit = {
         log.info(s"Stopping job manager ${self.path}.")
     
    +    // disconnect the registered task managers
    +    instanceManager.getAllRegisteredInstances.asScala.foreach{
    +      _.getTaskManager ! Disconnected("JobManager is stopping")}
    +
         for((e,_) <- currentJobs.values){
           e.fail(new Exception("The JobManager is shutting down."))
    --- End diff --
    
    Since we are cleaning up messages, maybe remove "The" so it is consistent 
with other messages.


> JobManager restart does not notify the TaskManager
> --------------------------------------------------
>
>                 Key: FLINK-1484
>                 URL: https://issues.apache.org/jira/browse/FLINK-1484
>             Project: Flink
>          Issue Type: Bug
>            Reporter: Till Rohrmann
>
> In case of a JobManager restart, which can happen due to an uncaught 
> exception, the JobManager is restarted. However, connected TaskManager are 
> not informed about the disconnection and continue sending messages to a 
> JobManager with a reseted state. 
> TaskManager should be informed about a possible restart and cleanup their own 
> state in such a case. Afterwards, they can try to reconnect to a restarted 
> JobManager.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to