[ https://issues.apache.org/jira/browse/FLINK-22932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yun Tang updated FLINK-22932: ----------------------------- Fix Version/s: (was: 1.13.2) 1.13.3 > RocksDBStateBackendWindowITCase fails with savepoint timeout > ------------------------------------------------------------ > > Key: FLINK-22932 > URL: https://issues.apache.org/jira/browse/FLINK-22932 > Project: Flink > Issue Type: Bug > Components: Runtime / Coordination > Affects Versions: 1.13.1, 1.12.4 > Reporter: Roman Khachatryan > Priority: Major > Labels: auto-deprioritized-critical, test-stability > Fix For: 1.13.3 > > > Initially > [reported|https://issues.apache.org/jira/browse/FLINK-22067?focusedCommentId=17358306&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17358306] > in FLINK-22067 > https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=18709&view=logs&j=a8bc9173-2af6-5ba8-775c-12063b4f1d54&t=46a16c18-c679-5905-432b-9be5d8e27bc6&l=10183 > Savepoint is triggered but is not completed in time. > {noformat} > 2021-06-06T22:27:46.4845045Z Jun 06 22:27:46 java.lang.RuntimeException: > Failed to take savepoint > 2021-06-06T22:27:46.4846088Z Jun 06 22:27:46 at > org.apache.flink.state.api.utils.SavepointTestBase.takeSavepoint(SavepointTestBase.java:71) > 2021-06-06T22:27:46.4847049Z Jun 06 22:27:46 at > org.apache.flink.state.api.utils.SavepointTestBase.takeSavepoint(SavepointTestBase.java:46) > 2021-06-06T22:27:46.4848262Z Jun 06 22:27:46 at > org.apache.flink.state.api.SavepointWindowReaderITCase.testApplyEvictorWindowStateReader(SavepointWindowReaderITCase.java:350) > 2021-06-06T22:27:46.4854133Z Jun 06 22:27:46 at > sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > 2021-06-06T22:27:46.4855430Z Jun 06 22:27:46 at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > 2021-06-06T22:27:46.4856528Z Jun 06 22:27:46 at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > 2021-06-06T22:27:46.4857487Z Jun 06 22:27:46 at > java.lang.reflect.Method.invoke(Method.java:498) > 2021-06-06T22:27:46.4858685Z Jun 06 22:27:46 at > org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50) > 2021-06-06T22:27:46.4859773Z Jun 06 22:27:46 at > org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12) > 2021-06-06T22:27:46.4860964Z Jun 06 22:27:46 at > org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47) > 2021-06-06T22:27:46.4862306Z Jun 06 22:27:46 at > org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17) > 2021-06-06T22:27:46.4863756Z Jun 06 22:27:46 at > org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27) > 2021-06-06T22:27:46.4864993Z Jun 06 22:27:46 at > org.apache.flink.util.TestNameProvider$1.evaluate(TestNameProvider.java:45) > 2021-06-06T22:27:46.4866179Z Jun 06 22:27:46 at > org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55) > 2021-06-06T22:27:46.4867272Z Jun 06 22:27:46 at > org.junit.rules.RunRules.evaluate(RunRules.java:20) > 2021-06-06T22:27:46.4868255Z Jun 06 22:27:46 at > org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325) > 2021-06-06T22:27:46.4869045Z Jun 06 22:27:46 at > org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78) > 2021-06-06T22:27:46.4869902Z Jun 06 22:27:46 at > org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57) > 2021-06-06T22:27:46.4871038Z Jun 06 22:27:46 at > org.junit.runners.ParentRunner$3.run(ParentRunner.java:290) > 2021-06-06T22:27:46.4871756Z Jun 06 22:27:46 at > org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71) > 2021-06-06T22:27:46.4872502Z Jun 06 22:27:46 at > org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288) > 2021-06-06T22:27:46.4873389Z Jun 06 22:27:46 at > org.junit.runners.ParentRunner.access$000(ParentRunner.java:58) > 2021-06-06T22:27:46.4874150Z Jun 06 22:27:46 at > org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268) > 2021-06-06T22:27:46.4874914Z Jun 06 22:27:46 at > org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48) > 2021-06-06T22:27:46.4875661Z Jun 06 22:27:46 at > org.junit.rules.ExternalResource$1.evaluate(ExternalResource.java:48) > 2021-06-06T22:27:46.4876382Z Jun 06 22:27:46 at > org.junit.rules.RunRules.evaluate(RunRules.java:20) > 2021-06-06T22:27:46.4877018Z Jun 06 22:27:46 at > org.junit.runners.ParentRunner.run(ParentRunner.java:363) > 2021-06-06T22:27:46.4877661Z Jun 06 22:27:46 at > org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365) > 2021-06-06T22:27:46.4878522Z Jun 06 22:27:46 at > org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:273) > 2021-06-06T22:27:46.4879506Z Jun 06 22:27:46 at > org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:238) > 2021-06-06T22:27:46.4880246Z Jun 06 22:27:46 at > org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159) > 2021-06-06T22:27:46.4881025Z Jun 06 22:27:46 at > org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:384) > 2021-06-06T22:27:46.4881839Z Jun 06 22:27:46 at > org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:345) > 2021-06-06T22:27:46.4882650Z Jun 06 22:27:46 at > org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:126) > 2021-06-06T22:27:46.4883596Z Jun 06 22:27:46 at > org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:418) > 2021-06-06T22:27:46.4884971Z Jun 06 22:27:46 Caused by: > java.util.concurrent.ExecutionException: > java.util.concurrent.TimeoutException: Invocation of public default > java.util.concurrent.CompletableFuture > org.apache.flink.runtime.webmonitor.RestfulGateway.triggerSavepoint(org.apache.flink.api.common.JobID,java.lang.String,boolean,org.apache.flink.api.common.time.Time) > timed out. > 2021-06-06T22:27:46.4886218Z Jun 06 22:27:46 at > java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357) > 2021-06-06T22:27:46.4887018Z Jun 06 22:27:46 at > java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1928) > 2021-06-06T22:27:46.4887787Z Jun 06 22:27:46 at > org.apache.flink.state.api.utils.SavepointTestBase.takeSavepoint(SavepointTestBase.java:69) > 2021-06-06T22:27:46.4888521Z Jun 06 22:27:46 ... 34 more > 2021-06-06T22:27:46.4889560Z Jun 06 22:27:46 Caused by: > java.util.concurrent.TimeoutException: Invocation of public default > java.util.concurrent.CompletableFuture > org.apache.flink.runtime.webmonitor.RestfulGateway.triggerSavepoint(org.apache.flink.api.common.JobID,java.lang.String,boolean,org.apache.flink.api.common.time.Time) > timed out. > 2021-06-06T22:27:46.4890708Z Jun 06 22:27:46 at > com.sun.proxy.$Proxy32.triggerSavepoint(Unknown Source) > 2021-06-06T22:27:46.4891470Z Jun 06 22:27:46 at > org.apache.flink.runtime.minicluster.MiniCluster.lambda$triggerSavepoint$8(MiniCluster.java:716) > 2021-06-06T22:27:46.4892292Z Jun 06 22:27:46 at > java.util.concurrent.CompletableFuture.uniApply(CompletableFuture.java:616) > 2021-06-06T22:27:46.4893139Z Jun 06 22:27:46 at > java.util.concurrent.CompletableFuture.uniApplyStage(CompletableFuture.java:628) > 2021-06-06T22:27:46.4894022Z Jun 06 22:27:46 at > java.util.concurrent.CompletableFuture.thenApply(CompletableFuture.java:1996) > 2021-06-06T22:27:46.4894810Z Jun 06 22:27:46 at > org.apache.flink.runtime.minicluster.MiniCluster.runDispatcherCommand(MiniCluster.java:751) > 2021-06-06T22:27:46.4895876Z Jun 06 22:27:46 at > org.apache.flink.runtime.minicluster.MiniCluster.triggerSavepoint(MiniCluster.java:714) > 2021-06-06T22:27:46.4896736Z Jun 06 22:27:46 at > org.apache.flink.client.program.MiniClusterClient.triggerSavepoint(MiniClusterClient.java:101) > 2021-06-06T22:27:46.4897610Z Jun 06 22:27:46 at > org.apache.flink.state.api.utils.SavepointTestBase.triggerSavepoint(SavepointTestBase.java:93) > 2021-06-06T22:27:46.4898651Z Jun 06 22:27:46 at > org.apache.flink.state.api.utils.SavepointTestBase.lambda$takeSavepoint$0(SavepointTestBase.java:68) > 2021-06-06T22:27:46.4899492Z Jun 06 22:27:46 at > java.util.concurrent.CompletableFuture.uniCompose(CompletableFuture.java:966) > 2021-06-06T22:27:46.4900311Z Jun 06 22:27:46 at > java.util.concurrent.CompletableFuture$UniCompose.tryFire(CompletableFuture.java:940) > 2021-06-06T22:27:46.4901105Z Jun 06 22:27:46 at > java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:488) > 2021-06-06T22:27:46.4901882Z Jun 06 22:27:46 at > java.util.concurrent.CompletableFuture$AsyncRun.run(CompletableFuture.java:1646) > 2021-06-06T22:27:46.4902703Z Jun 06 22:27:46 at > java.util.concurrent.CompletableFuture$AsyncRun.exec(CompletableFuture.java:1632) > 2021-06-06T22:27:46.4903544Z Jun 06 22:27:46 at > java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289) > 2021-06-06T22:27:46.4904457Z Jun 06 22:27:46 at > java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056) > 2021-06-06T22:27:46.4905221Z Jun 06 22:27:46 at > java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692) > 2021-06-06T22:27:46.4905948Z Jun 06 22:27:46 at > java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:175) > 2021-06-06T22:27:46.4908488Z Jun 06 22:27:46 Caused by: > akka.pattern.AskTimeoutException: Ask timed out on > [Actor[akka://flink/user/rpc/dispatcher_2#1085446192]] after [10000 ms]. > Message of type [org.apache.flink.runtime.rpc.messages.LocalFencedMessage]. A > typical reason for `AskTimeoutException` is that the recipient actor didn't > send a reply. > 2021-06-06T22:27:46.4909806Z Jun 06 22:27:46 at > akka.pattern.PromiseActorRef$.$anonfun$defaultOnTimeout$1(AskSupport.scala:635) > 2021-06-06T22:27:46.4910572Z Jun 06 22:27:46 at > akka.pattern.PromiseActorRef$.$anonfun$apply$1(AskSupport.scala:650) > 2021-06-06T22:27:46.4911233Z Jun 06 22:27:46 at > akka.actor.Scheduler$$anon$4.run(Scheduler.scala:205) > 2021-06-06T22:27:46.4911980Z Jun 06 22:27:46 at > scala.concurrent.Future$InternalCallbackExecutor$.unbatchedExecute(Future.scala:870) > 2021-06-06T22:27:46.4912770Z Jun 06 22:27:46 at > scala.concurrent.BatchingExecutor.execute(BatchingExecutor.scala:109) > 2021-06-06T22:27:46.4913636Z Jun 06 22:27:46 at > scala.concurrent.BatchingExecutor.execute$(BatchingExecutor.scala:103) > 2021-06-06T22:27:46.4914406Z Jun 06 22:27:46 at > scala.concurrent.Future$InternalCallbackExecutor$.execute(Future.scala:868) > 2021-06-06T22:27:46.4915259Z Jun 06 22:27:46 at > akka.actor.LightArrayRevolverScheduler$TaskHolder.executeTask(LightArrayRevolverScheduler.scala:328) > 2021-06-06T22:27:46.4916164Z Jun 06 22:27:46 at > akka.actor.LightArrayRevolverScheduler$$anon$3.executeBucket$1(LightArrayRevolverScheduler.scala:279) > 2021-06-06T22:27:46.4917078Z Jun 06 22:27:46 at > akka.actor.LightArrayRevolverScheduler$$anon$3.nextTick(LightArrayRevolverScheduler.scala:283) > 2021-06-06T22:27:46.4917924Z Jun 06 22:27:46 at > akka.actor.LightArrayRevolverScheduler$$anon$3.run(LightArrayRevolverScheduler.scala:235) > 2021-06-06T22:27:46.4918737Z Jun 06 22:27:46 at > java.lang.Thread.run(Thread.java:748) > {noformat} -- This message was sent by Atlassian Jira (v8.3.4#803005)