Joey Echeverria created FLINK-10078:
---------------------------------------

             Summary: Dispatcher should only block job submission during 
recovery
                 Key: FLINK-10078
                 URL: https://issues.apache.org/jira/browse/FLINK-10078
             Project: Flink
          Issue Type: Improvement
    Affects Versions: 1.5.0
            Reporter: Joey Echeverria


The Dispatcher currently doesn't confirm leadership until all jobs are 
recovered. This prevents any operations that require an active Dispatcher from 
working until after job recovery. This is primarily done to prevent race 
conditions between client retries and recovering jobs. An alternative approach 
would be to only block job submission while recovery is happening.

 

Note: we also need to check that no other RPCs change the internal state in 
such a way that it interferes with the job recovery.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to