检查一下作业是否有反压的情况?目前我们也有遇到这种情况就是作业在反压的情况下,对作业做一次 savepoint 其实是很难完成的,经常超时,社区目前的版本还不支持单独设置 savepoint 的超时时间。
刘海 <liuha...@163.com> 于2021年1月21日周四 上午10:24写道: > Hi > 我目前在进行保存点相关的测试,目前执行命令报如下错误,从错误内容上看是超时,但是没有更多的信息了,有知道大致原因希望指点一下,拜谢 > > > flink1.12 yarn-per-job 模式 > jobID:fea3d87f138ef4c260ffe9324acc0e51 > yarnID : application_1610788069646_0021 > 执行的命令如下: > ./bin/flink savepoint -t yarn-per-job -D > yarn.application.id=application_1610788069646_0021 > fea3d87f138ef4c260ffe9324acc0e51 > > > 报错如下: > > > org.apache.flink.util.FlinkException: Triggering a savepoint for the job > fea3d87f138ef4c260ffe9324acc0e51 failed. > at > org.apache.flink.client.cli.CliFrontend.triggerSavepoint(CliFrontend.java:712) > at > org.apache.flink.client.cli.CliFrontend.lambda$savepoint$9(CliFrontend.java:690) > at > org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:919) > at > org.apache.flink.client.cli.CliFrontend.savepoint(CliFrontend.java:687) > at > org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:989) > at > org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1047) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1875) > at > org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41) > at > org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1047) > Caused by: java.util.concurrent.TimeoutException > at > org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1168) > at > org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:211) > at > org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:549) > at > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180) > at > java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > at java.lang.Thread.run(Thread.java:748) > > > 祝好! > | | > 刘海 > | > | > liuha...@163.com > | > 签名由网易邮箱大师定制