Steven Zhen Wu created FLINK-8042:
-------------------------------------

             Summary: retry individual failover-strategy for some time first 
before reverting to full job restart
                 Key: FLINK-8042
                 URL: https://issues.apache.org/jira/browse/FLINK-8042
             Project: Flink
          Issue Type: Bug
          Components: ResourceManager
    Affects Versions: 1.3.2
            Reporter: Steven Zhen Wu


Let's we will a taskmanager node. When Flink tries to attempt fine grained 
recovery and fails replacement taskmanager node didn't come back in time, it 
reverts to full job restart. 

Stephan and Till was suggesting that Flink can/should retry fine grained 
recovery for some time before giving up and reverting full job restart



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to