您好:
我们线上flink集群一个pod更新configmap时报错,我们有两个pod做的k8s原生高可用。
pod1 日志:(也是当时configmap里面保存的leader pod, ip: 10.20.0.39)
2021-04-15 20:42:26,058 INFO
org.apache.flink.kubernetes.kubeclient.resources.KubernetesLeaderElector []
- New leader elected 7d4a9b5c-39aa-4103-963b-eaf24ea6435a for
tuiwen-flink-restserver-leader.
2021-04-15 20:42:26,069 INFO
org.apache.flink.runtime.rpc.akka.AkkaRpcService [] - Starting
RPC endpoint for
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager at
akka://flink/user/rpc/resourcemanager_0 .
2021-04-15 20:42:26,069 INFO
org.apache.flink.runtime.dispatcher.DispatcherRestEndpoint [] -
http://10.20.0.39:8081 was granted leadership with
leaderSessionID=a314d756-aa7c-4be4-a2a0-14267465d648
2021-04-15 20:42:26,261 INFO
org.apache.flink.kubernetes.kubeclient.resources.KubernetesLeaderElector []
- Create KubernetesLeaderElector tuiwen-flink-dispatcher-leader with lock
identity 7d4a9b5c-39aa-4103-963b-eaf24ea6435a.
2021-04-15 20:42:26,660 INFO
org.apache.flink.kubernetes.kubeclient.resources.KubernetesLeaderElector []
- New leader elected 6b1aac24-cf40-4aac-bb50-6812290a1f34 for
tuiwen-flink-dispatcher-leader.
2021-04-15 20:42:26,765 INFO
org.apache.flink.runtime.leaderelection.DefaultLeaderElectionService [] -
Starting DefaultLeaderElectionService with
KubernetesLeaderElectionDriver{configMapName='tuiwen-flink-dispatcher-leader'}.
2021-04-15 20:42:26,960 INFO
org.apache.flink.runtime.leaderretrieval.DefaultLeaderRetrievalService [] -
Starting DefaultLeaderRetrievalService with
KubernetesLeaderRetrievalDriver{configMapName='tuiwen-flink-resourcemanager-leader'}.
2021-04-15 20:42:27,258 INFO
org.apache.flink.runtime.leaderretrieval.DefaultLeaderRetrievalService [] -
Starting DefaultLeaderRetrievalService with
KubernetesLeaderRetrievalDriver{configMapName='tuiwen-flink-dispatcher-leader'}.
2021-04-15 20:42:30,457 INFO
org.apache.flink.kubernetes.KubernetesResourceManagerDriver [] - Recovered
2 pods from previous attempts, current attempt id is 2.
2021-04-15 20:42:30,458 INFO
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager [] -
Recovered 2 workers from previous attempt.
2021-04-15 20:42:30,458 INFO
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager [] -
Worker tuiwen-flink-taskmanager-1-12 recovered from previous attempt.
2021-04-15 20:42:30,458 INFO
org.apache.flink.runtime.resourcemanager.active.ActiveResourceManager [] -
Worker tuiwen-flink-taskmanager-1-2 recovered from previous attempt.
2021-04-15 20:42:30,458 INFO
org.apache.flink.kubernetes.kubeclient.resources.KubernetesLeaderElector []
- Create KubernetesLeaderElector tuiwen-flink-resourcemanager-leader with
lock identity 7d4a9b5c-39aa-4103-963b-eaf24ea6435a.
2021-04-15 20:42:30,959 INFO
org.apache.flink.runtime.leaderelection.DefaultLeaderElectionService [] -
Starting DefaultLeaderElectionService with
KubernetesLeaderElectionDriver{configMapName='tuiwen-flink-resourcemanager-leader'}.
2021-04-15 20:42:30,978 INFO
org.apache.flink.kubernetes.kubeclient.resources.KubernetesLeaderElector []
- New leader elected 6b1aac24-cf40-4aac-bb50-6812290a1f34 for
tuiwen-flink-resourcemanager-leader.
2021-04-15 23:11:15,866 WARN
org.apache.flink.runtime.webmonitor.retriever.impl.RpcGatewayRetriever [] -
Error while retrieving the leader gateway. Retrying to connect to
akka.tcp://flink@10.20.0.39:6123/user/rpc/dispatcher_1.
2021-04-15 23:11:30,626 WARN
org.apache.flink.runtime.webmonitor.retriever.impl.RpcGatewayRetriever [] -
Error while retrieving the leader gateway. Retrying to connect to
akka.tcp://flink@10.20.0.39:6123/user/rpc/dispatcher_1.
2021-04-15 23:11:32,438 WARN
org.apache.flink.runtime.webmonitor.retriever.impl.RpcGatewayRetriever [] -
Error while retrieving the leader gateway. Retrying to connect to
akka.tcp://flink@10.20.0.39:6123/user/rpc/dispatcher_1.
2021-04-15 23:11:33,325 WARN
org.apache.flink.runtime.webmonitor.retriever.impl.RpcGatewayRetriever [] -
Error while retrieving the leader gateway. Retrying to connect to
akka.tcp://flink@10.20.0.39:6123/user/rpc/dispatcher_1.
2021-04-15 23:11:35,948 WARN
org.apache.flink.runtime.webmonitor.retriever.impl.RpcGatewayRetriever [] -
Error while retrieving the leader gateway. Retrying to connect to
akka.tcp://flink@10.20.0.39:6123/user/rpc/dispatcher_1.
2021-04-15 23:11:39,387 WARN
org.apache.flink.runtime.webmonitor.retriever.impl.RpcGatewayRetriever [] -
Error while retrieving the leader gateway. Retrying to connect to
akka.tcp://flink@10.20.0.39:6123/user/rpc/dispatcher_1.
2021-04-15 23:11:40,336 WARN
org.apache.flink.runtime.webmonitor.retriever.impl.RpcGatewayRetriever [] -
Error while retrieving the leader gateway. Retrying to connect to
akka.tcp://flink@10.20.0.39:6123/user/rpc/dispatcher_1.
2021-04-15 23:11:41,485 WARN
org.apache.flink.runtime.webmonitor.retriever.impl.RpcGatewayRetriever [] -
Error while retrieving the leader gateway. Retrying to connect to