Yangze Guo created FLINK-24161:
----------------------------------

             Summary: Can not stop the job with savepoint while a task is 
finishing
                 Key: FLINK-24161
                 URL: https://issues.apache.org/jira/browse/FLINK-24161
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Checkpointing
    Affects Versions: 1.14.0
            Reporter: Yangze Guo
             Fix For: 1.14.0


When stop the job with savepoint, if there is a task is finishing, the action 
will be timeout.

Testing job: 
https://github.com/KarmaGYZ/flink/blob/test-147/flink-examples/flink-examples-streaming/src/main/java/org/apache/flink/streaming/examples/wordcount/WordCount.java

Flink conf:

{code:bash}
state.savepoints.dir: /tmp/flink-savepoints
state.backend: rocksdb
state.backend.incremental: true
state.checkpoints.dir: file:///tmp/flink-ckp/
execution.checkpointing.aligned-checkpoint-timeout: 30 s
execution.checkpointing.interval: 5 s
taskmanager.numberOfTaskSlots: 2
{code}

How to reproduce:

{code:bash}
bin/flink run -d -p 4 examples/streaming/WordCount.jar
# while one task is finishing
bin/flink stop $JOB_ID
{code}

Client log:

{code:bash}
------------------------------------------------------------
 The program finished with the following exception:

org.apache.flink.util.FlinkException: Could not stop with a savepoint job 
"e139a2eba7f8dc0b07fab65e84421ee4".
  at org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:581)
  at 
org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:1002)
  at org.apache.flink.client.cli.CliFrontend.stop(CliFrontend.java:569)
  at org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:1069)
  at 
org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1132)
  at 
org.apache.flink.runtime.security.contexts.NoOpSecurityContext.runSecured(NoOpSecurityContext.java:28)
  at org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1132)
Caused by: java.util.concurrent.TimeoutException
  at 
java.util.concurrent.CompletableFuture.timedGet(CompletableFuture.java:1771)
  at java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1915)
  at org.apache.flink.client.cli.CliFrontend.lambda$stop$5(CliFrontend.java:579)
  ... 6 more
{code}




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to