检查一下作业是否有反压的情况?目前我们也有遇到这种情况就是作业在反压的情况下,对作业做一次 savepoint
其实是很难完成的,经常超时,社区目前的版本还不支持单独设置 savepoint 的超时时间。


刘海 <liuha...@163.com> 于2021年1月21日周四 上午10:24写道:

> Hi
>  我目前在进行保存点相关的测试,目前执行命令报如下错误,从错误内容上看是超时,但是没有更多的信息了,有知道大致原因希望指点一下,拜谢
>
>
> flink1.12 yarn-per-job 模式
> jobID:fea3d87f138ef4c260ffe9324acc0e51
> yarnID : application_1610788069646_0021
> 执行的命令如下:
> ./bin/flink savepoint -t yarn-per-job -D 
> yarn.application.id=application_1610788069646_0021
> fea3d87f138ef4c260ffe9324acc0e51
>
>
> 报错如下:
>
>
> org.apache.flink.util.FlinkException: Triggering a savepoint for the job
> fea3d87f138ef4c260ffe9324acc0e51 failed.
>         at
> org.apache.flink.client.cli.CliFrontend.triggerSavepoint(CliFrontend.java:712)
>         at
> org.apache.flink.client.cli.CliFrontend.lambda$savepoint$9(CliFrontend.java:690)
>         at
> org.apache.flink.client.cli.CliFrontend.runClusterAction(CliFrontend.java:919)
>         at
> org.apache.flink.client.cli.CliFrontend.savepoint(CliFrontend.java:687)
>         at
> org.apache.flink.client.cli.CliFrontend.parseAndRun(CliFrontend.java:989)
>         at
> org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:1047)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at javax.security.auth.Subject.doAs(Subject.java:422)
>         at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1875)
>         at
> org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
>         at
> org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:1047)
> Caused by: java.util.concurrent.TimeoutException
>         at
> org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:1168)
>         at
> org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:211)
>         at
> org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$15(FutureUtils.java:549)
>         at
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>         at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>         at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
>         at
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
>         at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>         at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>         at java.lang.Thread.run(Thread.java:748)
>
>
> 祝好!
> | |
> 刘海
> |
> |
> liuha...@163.com
> |
> 签名由网易邮箱大师定制

回复