Aitozi created FLINK-29308:
------------------------------

             Summary: NoResourceAvailableException fails the batch job
                 Key: FLINK-29308
                 URL: https://issues.apache.org/jira/browse/FLINK-29308
             Project: Flink
          Issue Type: Improvement
          Components: Runtime / Coordination
            Reporter: Aitozi


When running batch job configured with the following restart strategy
{code:java}
restart-strategy: fixed-delay
restart-strategy.fixed-delay.delay: 15 s
restart-strategy.fixed-delay.attempts: 10 {code}
If the cluster resource is not enough to run the single stage, it can run 
partial of the stage, but it still will fail after the 10 times 
\{{NoResourceAvailableException}}. IMO, for batch job the 
\{{NoResourceAvailableException}} do not necessary to trigger the job to fail. 
Or at least this failure reason are not share the same restart strategy with 
other failure reasons



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to