Juha created FLINK-24496:
----------------------------
Summary: Querying job details in the REST API fails
Key: FLINK-24496
URL: https://issues.apache.org/jira/browse/FLINK-24496
Project: Flink
Issue Type: Bug
Components: Runtime / Web Frontend
Affects Versions: 1.14.0
Reporter: Juha
Hello,
In my setup there are three servers in a standalone the Flink cluster using
ZooKeeper HA Services running Flink 1.14.0. There's one TaskManager and one
JobManager on all servers. ZooKeeper is running on all servers. All the servers
have been just started and one simple job has been deployed.
One of the JobManagers is the leading JobManager.
If I query (curl) the leading job manager with /v1/jobs/<job_id>, the response
is correct.
But if I query a JobManager that is not the leading JobManager, the HTTP
request fails and the following can be seen in the logs.
{code}
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: [2021-10-08 09:28:09,998]
ERROR Unhandled exception.
(org.apache.flink.runtime.rest.handler.job.metrics.JobVertexWatermarksHandler:260)
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]:
java.util.concurrent.CancellationException: null
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
java.util.concurrent.CompletableFuture.cancel(CompletableFuture.java:2396)
~[?:?]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.handler.legacy.DefaultExecutionGraphCache.getExecutionGraphInternal(DefaultExecutionGraphCache.java:98)
~[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.handler.legacy.DefaultExecutionGraphCache.getExecutionGraphInfo(DefaultExecutionGraphCache.java:67)
~[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.handler.job.AbstractExecutionGraphHandler.handleRequest(AbstractExecutionGraphHandler.java:81)
~[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.handler.AbstractRestHandler.respondToRequest(AbstractRestHandler.java:83)
~[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.handler.AbstractHandler.respondAsLeader(AbstractHandler.java:195)
~[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.handler.LeaderRetrievalHandler.lambda$channelRead0$0(LeaderRetrievalHandler.java:83)
~[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
java.util.Optional.ifPresent(Optional.java:183) [?:?]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.util.OptionalConsumer.ifPresent(OptionalConsumer.java:45)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.handler.LeaderRetrievalHandler.channelRead0(LeaderRetrievalHandler.java:80)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.handler.LeaderRetrievalHandler.channelRead0(LeaderRetrievalHandler.java:49)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.SimpleChannelInboundHandler.channelRead(SimpleChannelInboundHandler.java:99)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:357)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.handler.router.RouterHandler.routed(RouterHandler.java:115)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.handler.router.RouterHandler.channelRead0(RouterHandler.java:94)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.handler.router.RouterHandler.channelRead0(RouterHandler.java:55)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.SimpleChannelInboundHandler.channelRead(SimpleChannelInboundHandler.java:99)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:357)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.handler.codec.MessageToMessageDecoder.channelRead(MessageToMessageDecoder.java:103)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:357)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.FileUploadHandler.channelRead0(FileUploadHandler.java:238)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.runtime.rest.FileUploadHandler.channelRead0(FileUploadHandler.java:71)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.SimpleChannelInboundHandler.channelRead(SimpleChannelInboundHandler.java:99)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:357)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.CombinedChannelDuplexHandler$DelegatingChannelHandlerContext.fireChannelRead(CombinedChannelDuplexHandler.java:436)
[flink-dist_2>
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.handler.codec.ByteToMessageDecoder.fireChannelRead(ByteToMessageDecoder.java:324)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.handler.codec.ByteToMessageDecoder.channelRead(ByteToMessageDecoder.java:296)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.CombinedChannelDuplexHandler.channelRead(CombinedChannelDuplexHandler.java:251)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:09 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:357)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.DefaultChannelPipeline$HeadContext.channelRead(DefaultChannelPipeline.java:1410)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:379)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:365)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.DefaultChannelPipeline.fireChannelRead(DefaultChannelPipeline.java:919)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.nio.AbstractNioByteChannel$NioByteUnsafe.read(AbstractNioByteChannel.java:166)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.processSelectedKey(NioEventLoop.java:719)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.processSelectedKeysOptimized(NioEventLoop.java:655)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.processSelectedKeys(NioEventLoop.java:581)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:493)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:989)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
org.apache.flink.shaded.netty4.io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74)
[flink-dist_2.12-1.14.0.jar:1.14.0]
Oct 08 09:28:10 flink-perf-vy5z2cda-2 java[108]: at
java.lang.Thread.run(Thread.java:829) [?:?]
{code}
I'm not sure if this happens always, but I did manage to create two separate
Flink clusters and reproduce the issue.
As far as I know the job is running fine.
The issue doesn't exist on 1.13.0 or 1.13.2.
Regards
Juha
--
This message was sent by Atlassian Jira
(v8.3.4#803005)