Hi 这个作业的 application 有起来吗?起来了的话,可以看看 JM log,如果没有起来,可以从提交客户端的那看看有没有更详细的提交日志。日志目录默认在 `/opt/flink-1.10.0/log` 下面
Best, Congxian Zhou Zach <wander...@163.com> 于2020年6月19日周五 下午8:15写道: > 我是per job模式,不是yarn session模式啊 > > > > > > > > > > > > > > > > > > At 2020-06-19 20:06:47, "Rui Li" <lirui.fu...@gmail.com> wrote: > >那得重启yarn session,再把作业提交上去 > > > >On Fri, Jun 19, 2020 at 6:22 PM Zhou Zach <wander...@163.com> wrote: > > > >> > >> > >> > >> > >> > >> > >> 用yarn application kill flink job把yarn的application杀掉了,杀掉后yarn没有重启flink > job > >> > >> > >> > >> > >> > >> > >> > >> > >> > >> > >> > >> 在 2020-06-19 17:54:45,"Rui Li" <lirui.fu...@gmail.com> 写道: > >> >用yarn application kill flink job是说把yarn的application杀掉了吗?杀掉以后有没有重启呀 > >> > > >> >On Fri, Jun 19, 2020 at 4:09 PM Zhou Zach <wander...@163.com> wrote: > >> > > >> >> > >> >> > >> >> 在flink-1.10.0/conf/flink-conf.yaml中加了下面两个超时参数,不起作用 > >> >> akka.client.timeout: 600000000 > >> >> akka.ask.timeout: 6000000 > >> >> > >> >> 有大佬知道是什么原因吗 > >> >> > >> >> > >> >> > >> >> > >> >> > >> >> > >> >> > >> >> > >> >> > >> >> > >> >> > >> >> > >> >> > >> >> > >> >> 在 2020-06-19 14:57:05,"Zhou Zach" <wander...@163.com> 写道: > >> >> > > >> >> > > >> >> > > >> >> > > >> >> >用yarn application kill flink job后, > >> >> >执行/opt/flink-1.10.0/bin/flink run -s > >> >> > >> > /user/flink10/checkpoints/69e450574d8520ac5961e20a6fc4798a/chk-18/_metadata > >> >> -d -c dataflow.sql.FromKafkaSinkJdbcForCountPerSecond > >> >> /data/warehouse/streaming/data-flow-1.0.jar > >> >> > > >> >> > > >> >> > > >> >> > > >> >> > > >> >> > > >> >> > > >> >> > > >> >> >2020-06-19 14:39:54,563 INFO > >> >> > >> > org.apache.flink.shaded.curator.org.apache.curator.framework.state.ConnectionStateManager > >> >> - State change: CONNECTED > >> >> >2020-06-19 14:39:54,664 INFO > >> >> > >> > org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService - > >> >> Starting ZooKeeperLeaderRetrievalService /leader/rest_server_lock. > >> >> >2020-06-19 14:40:24,728 INFO > >> >> > >> > org.apache.flink.runtime.leaderretrieval.ZooKeeperLeaderRetrievalService - > >> >> Stopping ZooKeeperLeaderRetrievalService /leader/rest_server_lock. > >> >> >2020-06-19 14:40:24,729 INFO > >> >> > >> > org.apache.flink.shaded.curator.org.apache.curator.framework.imps.CuratorFrameworkImpl > >> >> - backgroundOperationsLoop exiting > >> >> >2020-06-19 14:40:24,733 INFO > >> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ZooKeeper - > >> >> Session: 0x272b776faca2414 closed > >> >> >2020-06-19 14:40:24,733 INFO > >> >> org.apache.flink.shaded.zookeeper.org.apache.zookeeper.ClientCnxn - > >> >> EventThread shut down for session: 0x272b776faca2414 > >> >> >2020-06-19 14:40:24,734 ERROR > org.apache.flink.client.cli.CliFrontend > >> >> - Error while running the command. > >> >> >org.apache.flink.client.program.ProgramInvocationException: The main > >> >> method caused an error: java.util.concurrent.ExecutionException: > >> >> org.apache.flink.runtime.client.JobSubmissionException: Failed to > submit > >> >> JobGraph. > >> >> > at > >> >> > >> > org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:335) > >> >> > at > >> >> > >> > org.apache.flink.client.program.PackagedProgram.invokeInteractiveModeForExecution(PackagedProgram.java:205) > >> >> > at > >> >> > org.apache.flink.client.ClientUtils.executeProgram(ClientUtils.java:138) > >> >> > at > >> >> > >> > org.apache.flink.client.cli.CliFrontend.executeProgram(CliFrontend.java:664) > >> >> > at > >> >> org.apache.flink.client.cli.CliFrontend.run(CliFrontend.java:213) > >> >> > at > >> >> > >> > org.apache.flink.client.cli.CliFrontend.parseParameters(CliFrontend.java:895) > >> >> > at > >> >> > >> > org.apache.flink.client.cli.CliFrontend.lambda$main$10(CliFrontend.java:968) > >> >> > at java.security.AccessController.doPrivileged(Native > Method) > >> >> > at javax.security.auth.Subject.doAs(Subject.java:422) > >> >> > at > >> >> > >> > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1875) > >> >> > at > >> >> > >> > org.apache.flink.runtime.security.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41) > >> >> > at > >> >> org.apache.flink.client.cli.CliFrontend.main(CliFrontend.java:968) > >> >> >Caused by: java.lang.RuntimeException: > >> >> java.util.concurrent.ExecutionException: > >> >> org.apache.flink.runtime.client.JobSubmissionException: Failed to > submit > >> >> JobGraph. > >> >> > at > >> >> org.apache.flink.util.ExceptionUtils.rethrow(ExceptionUtils.java:199) > >> >> > at > >> >> > >> > org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.executeAsync(StreamExecutionEnvironment.java:1741) > >> >> > at > >> >> > >> > org.apache.flink.streaming.api.environment.StreamContextEnvironment.executeAsync(StreamContextEnvironment.java:94) > >> >> > at > >> >> > >> > org.apache.flink.streaming.api.environment.StreamContextEnvironment.execute(StreamContextEnvironment.java:63) > >> >> > at > >> >> > >> > org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:1620) > >> >> > at > >> >> > >> > org.apache.flink.table.planner.delegation.StreamExecutor.execute(StreamExecutor.java:42) > >> >> > at > >> >> > >> > org.apache.flink.table.api.internal.TableEnvironmentImpl.execute(TableEnvironmentImpl.java:643) > >> >> > at > >> >> > >> > cn.ibobei.qile.dataflow.sql.FromKafkaSinkJdbcForCountPerSecond$.main(FromKafkaSinkJdbcForCountPerSecond.scala:120) > >> >> > at > >> >> > >> > cn.ibobei.qile.dataflow.sql.FromKafkaSinkJdbcForCountPerSecond.main(FromKafkaSinkJdbcForCountPerSecond.scala) > >> >> > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > >> >> > at > >> >> > >> > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > >> >> > at > >> >> > >> > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > >> >> > at java.lang.reflect.Method.invoke(Method.java:498) > >> >> > at > >> >> > >> > org.apache.flink.client.program.PackagedProgram.callMainMethod(PackagedProgram.java:321) > >> >> > ... 11 more > >> >> >Caused by: java.util.concurrent.ExecutionException: > >> >> org.apache.flink.runtime.client.JobSubmissionException: Failed to > submit > >> >> JobGraph. > >> >> > at > >> >> > >> > java.util.concurrent.CompletableFuture.reportGet(CompletableFuture.java:357) > >> >> > at > >> >> > java.util.concurrent.CompletableFuture.get(CompletableFuture.java:1895) > >> >> > at > >> >> > >> > org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.executeAsync(StreamExecutionEnvironment.java:1736) > >> >> > ... 23 more > >> >> >Caused by: org.apache.flink.runtime.client.JobSubmissionException: > >> Failed > >> >> to submit JobGraph. > >> >> > at > >> >> > >> > org.apache.flink.client.program.rest.RestClusterClient.lambda$submitJob$7(RestClusterClient.java:359) > >> >> > at > >> >> > >> > java.util.concurrent.CompletableFuture.uniExceptionally(CompletableFuture.java:870) > >> >> > at > >> >> > >> > java.util.concurrent.CompletableFuture$UniExceptionally.tryFire(CompletableFuture.java:852) > >> >> > at > >> >> > >> > java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:474) > >> >> > at > >> >> > >> > java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:1977) > >> >> > at > >> >> > >> > org.apache.flink.runtime.concurrent.FutureUtils.lambda$retryOperationWithDelay$8(FutureUtils.java:274) > >> >> > at > >> >> > >> > java.util.concurrent.CompletableFuture.uniWhenComplete(CompletableFuture.java:760) > >> >> > at > >> >> > >> > java.util.concurrent.CompletableFuture$UniWhenComplete.tryFire(CompletableFuture.java:736) > >> >> > at > >> >> > >> > java.util.concurrent.CompletableFuture.postComplete(CompletableFuture.java:474) > >> >> > at > >> >> > >> > java.util.concurrent.CompletableFuture.completeExceptionally(CompletableFuture.java:1977) > >> >> > at > >> >> > >> > org.apache.flink.runtime.concurrent.FutureUtils$Timeout.run(FutureUtils.java:999) > >> >> > at > >> >> > >> > org.apache.flink.runtime.concurrent.DirectExecutorService.execute(DirectExecutorService.java:211) > >> >> > at > >> >> > >> > org.apache.flink.runtime.concurrent.FutureUtils.lambda$orTimeout$14(FutureUtils.java:427) > >> >> > at > >> >> > java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > >> >> > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > >> >> > at > >> >> > >> > java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180) > >> >> > at > >> >> > >> > java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293) > >> >> > at > >> >> > >> > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > >> >> > at > >> >> > >> > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > >> >> > at java.lang.Thread.run(Thread.java:748) > >> >> >Caused by: java.util.concurrent.TimeoutException > >> >> > > >> >> > >> > > >> > > >> >-- > >> >Best regards! > >> >Rui Li > >> > > > > > >-- > >Best regards! > >Rui Li >