Hi Till, This is the taskManager log As you see, the logs print ‘line 92 -- Could not connect to flink-jobmanager:6123’ then print ‘line 128 --Could not resolve ResourceManager address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*, retrying in 10000 ms: Could not connect to rpc endpoint under address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*.’ And repeat print this
A few minutes later, the taskmanger shut down and restart This is my yaml files, could u help me to confirm did I omitted something? Thanks a lot! --------------------------------------------------- flink-configuration-configmap.yaml apiVersion: v1 kind: ConfigMap metadata: name: flink-config labels: app: flink data: flink-conf.yaml: |+ jobmanager.rpc.address: flink-jobmanager taskmanager.numberOfTaskSlots: 1 blob.server.port: 6124 jobmanager.rpc.port: 6123 taskmanager.rpc.port: 6122 queryable-state.proxy.ports: 6125 jobmanager.memory.process.size: 1024m taskmanager.memory.process.size: 1024m parallelism.default: 1 log4j-console.properties: |+ rootLogger.level = INFO rootLogger.appenderRef.console.ref = ConsoleAppender rootLogger.appenderRef.rolling.ref = RollingFileAppender logger.akka.name = akka logger.akka.level = INFO logger.kafka.name= org.apache.kafka logger.kafka.level = INFO logger.hadoop.name = org.apache.hadoop logger.hadoop.level = INFO logger.zookeeper.name = org.apache.zookeeper logger.zookeeper.level = INFO appender.console.name = ConsoleAppender appender.console.type = CONSOLE appender.console.layout.type = PatternLayout appender.console.layout.pattern = %d{yyyy-MM-dd HH:mm:ss,SSS} %-5p %-60c %x - %m%n appender.rolling.name = RollingFileAppender appender.rolling.type = RollingFile appender.rolling.append = false appender.rolling.fileName = ${sys:log.file} appender.rolling.filePattern = ${sys:log.file}.%i appender.rolling.layout.type = PatternLayout appender.rolling.layout.pattern = %d{yyyy-MM-dd HH:mm:ss,SSS} %-5p %-60c %x - %m%n appender.rolling.policies.type = Policies appender.rolling.policies.size.type = SizeBasedTriggeringPolicy appender.rolling.policies.size.size=100MB appender.rolling.strategy.type = DefaultRolloverStrategy appender.rolling.strategy.max = 10 logger.netty.name = org.apache.flink.shaded.akka.org.jboss.netty.channel.DefaultChannelPipeline logger.netty.level = OFF --------------------------------------------------- jobmanager-service.yaml apiVersion: v1 kind: Service metadata: name: flink-jobmanager spec: type: ClusterIP ports: - name: rpc port: 6123 - name: blob-server port: 6124 - name: webui port: 8081 selector: app: flink component: jobmanager -------------------------------------------------- jobmanager-session-deployment.yaml apiVersion: apps/v1 kind: Deployment metadata: name: flink-jobmanager spec: replicas: 1 selector: matchLabels: app: flink component: jobmanager template: metadata: labels: app: flink component: jobmanager spec: containers: - name: jobmanager image: registry.cn-hangzhou.aliyuncs.com/superainbower/flink:1.11.1 args: ["jobmanager"] ports: - containerPort: 6123 name: rpc - containerPort: 6124 name: blob-server - containerPort: 8081 name: webui livenessProbe: tcpSocket: port: 6123 initialDelaySeconds: 30 periodSeconds: 60 volumeMounts: - name: flink-config-volume mountPath: /opt/flink/conf securityContext: runAsUser: 9999 # refers to user _flink_ from official flink image, change if necessary volumes: - name: flink-config-volume configMap: name: flink-config items: - key: flink-conf.yaml path: flink-conf.yaml - key: log4j-console.properties path: log4j-console.properties imagePullSecrets: - name: regcred --------------------------------------------------- taskmanager-session-deployment.yaml apiVersion: apps/v1 kind: Deployment metadata: name: flink-taskmanager spec: replicas: 1 selector: matchLabels: app: flink component: taskmanager template: metadata: labels: app: flink component: taskmanager spec: containers: - name: taskmanager image: registry.cn-hangzhou.aliyuncs.com/superainbower/flink:1.11.1 args: ["taskmanager"] ports: - containerPort: 6122 name: rpc - containerPort: 6125 name: query-state livenessProbe: tcpSocket: port: 6122 initialDelaySeconds: 30 periodSeconds: 60 volumeMounts: - name: flink-config-volume mountPath: /opt/flink/conf/ securityContext: runAsUser: 9999 # refers to user _flink_ from official flink image, change if necessary volumes: - name: flink-config-volume configMap: name: flink-config items: - key: flink-conf.yaml path: flink-conf.yaml - key: log4j-console.properties path: log4j-console.properties imagePullSecrets: - name: regcred | | superainbower | | superainbo...@163.com | 签名由网易邮箱大师定制 On 09/2/2020 20:38,Till Rohrmann<trohrm...@apache.org> wrote: Hmm, this is indeed strange. Could you share the logs of the TaskManager with us? Ideally you set the log level to debug. Thanks a lot. Cheers, Till On Wed, Sep 2, 2020 at 12:45 PM art <superainbo...@163.com> wrote: Hi Till, The full information when I run command ' kubectl get all’ like this: NAME READY STATUS RESTARTS AGE pod/flink-jobmanager-85bdbd98d8-ppjmf 1/1 Running 0 2m34s pod/flink-taskmanager-74c68c6f48-6jb5v 1/1 Running 0 2m34s NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE service/flink-jobmanager ClusterIP 10.103.207.75 <none> 6123/TCP,6124/TCP,8081/TCP 2m34s service/kubernetes ClusterIP 10.96.0.1 <none> 443/TCP 5d2h NAME READY UP-TO-DATE AVAILABLE AGE deployment.apps/flink-jobmanager 1/1 1 1 2m34s deployment.apps/flink-taskmanager 1/1 1 1 2m34s NAME DESIRED CURRENT READY AGE replicaset.apps/flink-jobmanager-85bdbd98d8 1 1 1 2m34s replicaset.apps/flink-taskmanager-74c68c6f48 1 1 1 2m34s And I can open flink ui but the task manger is 0 ,so the job manger is work well I think the problem is taksmanger can not register itself to jobmanger, did I miss some configure? 在 2020年9月2日,下午5:24,Till Rohrmann <trohrm...@apache.org> 写道: Hi art, could you check what `kubectl get services` returns? Usually if you run `kubectl get all` you should also see the services. But in your case there are no services listed. You have see something like service/flink-jobmanager otherwise the flink-jobmanager service (K8s service) is not running. Cheers, Till On Wed, Sep 2, 2020 at 11:15 AM art <superainbo...@163.com> wrote: Hi Till, I’m sure the job manager-service is started, I can find it in Kubernetes DashBoard When I run command ' kubectl get deployment’ I can got this: flink-jobmanager 1/1 1 1 33s flink-taskmanager 1/1 1 1 33s When I run command ' kubectl get all’ I can got this: NAME READY STATUS RESTARTS AGE pod/flink-jobmanager-85bdbd98d8-ppjmf 1/1 Running 0 2m34s pod/flink-taskmanager-74c68c6f48-6jb5v 1/1 Running 0 2m34s So, I think flink-jobmanager works well, but taskmannger is restarted every few minutes My minikube version: v1.12.3 Flink version:v1.11.1 在 2020年9月2日,下午4:27,Till Rohrmann <trohrm...@apache.org> 写道: Hi art, could you verify that the jobmanager-service has been started? It looks as if the name flink-jobmanager is not resolvable. It could also help to know the Minikube and K8s version you are using. Cheers, Till On Wed, Sep 2, 2020 at 9:50 AM art <superainbo...@163.com> wrote: Hi,I’m going to deploy flink on minikube referring to https://ci.apache.org/projects/flink/flink-docs-release-1.11/zh/ops/deployment/kubernetes.html; kubectl create -f flink-configuration-configmap.yaml kubectl create -f jobmanager-service.yaml kubectl create -f jobmanager-session-deployment.yaml kubectl create -f taskmanager-session-deployment.yaml But I got this 2020-09-02 06:45:42,664 WARN akka.remote.ReliableDeliverySupervisor [] - Association with remote system [akka.tcp://flink@flink-jobmanager:6123] has failed, address is now gated for [50] ms. Reason: [Association failed with [akka.tcp://flink@flink-jobmanager:6123]] Caused by: [java.net.UnknownHostException: flink-jobmanager: Temporary failure in name resolution] 2020-09-02 06:45:42,691 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Could not resolve ResourceManager address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*, retrying in 10000 ms: Could not connect to rpc endpoint under address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*. 2020-09-02 06:46:02,731 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Could not resolve ResourceManager address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*, retrying in 10000 ms: Could not connect to rpc endpoint under address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*. 2020-09-02 06:46:12,731 INFO akka.remote.transport.ProtocolStateActor [] - No response from remote for outbound association. Associate timed out after [20000 ms]. And when I run the command 'kubectl exec -ti flink-taskmanager-74c68c6f48-9tkvd -- /bin/bash’ && ‘ping flink-jobmanager’ , I find I cannot ping flink-jobmanager from taskmanager I am new to k8s, can anyone give me some tutorial? Thanks a lot !
2020-09-03 00:44:04,081 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -------------------------------------------------------------------------------- 2020-09-03 00:44:04,085 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - Preconfiguration: 2020-09-03 00:44:04,085 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - TM_RESOURCE_PARAMS extraction logs: jvm_params: -Xmx161061270 -Xms161061270 -XX:MaxDirectMemorySize=201326592 -XX:MaxMetaspaceSize=268435456 dynamic_configs: -D taskmanager.memory.framework.off-heap.size=134217728b -D taskmanager.memory.network.max=67108864b -D taskmanager.memory.network.min=67108864b -D taskmanager.memory.framework.heap.size=134217728b -D taskmanager.memory.managed.size=241591914b -D taskmanager.cpu.cores=1.0 -D taskmanager.memory.task.heap.size=26843542b -D taskmanager.memory.task.off-heap.size=0b logs: INFO [] - Loading configuration property: jobmanager.rpc.address, flink-jobmanager INFO [] - Loading configuration property: taskmanager.numberOfTaskSlots, 1 INFO [] - Loading configuration property: blob.server.port, 6124 INFO [] - Loading configuration property: jobmanager.rpc.port, 6123 INFO [] - Loading configuration property: taskmanager.rpc.port, 6122 INFO [] - Loading configuration property: queryable-state.proxy.ports, 6125 INFO [] - Loading configuration property: jobmanager.memory.process.size, 1024m INFO [] - Loading configuration property: taskmanager.memory.process.size, 1024m INFO [] - Loading configuration property: parallelism.default, 1 INFO [] - The derived from fraction jvm overhead memory (102.400mb (107374184 bytes)) is less than its min value 192.000mb (201326592 bytes), min value will be used instead INFO [] - The derived from fraction network memory (57.600mb (60397978 bytes)) is less than its min value 64.000mb (67108864 bytes), min value will be used instead INFO [] - Final TaskExecutor Memory configuration: INFO [] - Total Process Memory: 1024.000mb (1073741824 bytes) INFO [] - Total Flink Memory: 576.000mb (603979776 bytes) INFO [] - Total JVM Heap Memory: 153.600mb (161061270 bytes) INFO [] - Framework: 128.000mb (134217728 bytes) INFO [] - Task: 25.600mb (26843542 bytes) INFO [] - Total Off-heap Memory: 422.400mb (442918506 bytes) INFO [] - Managed: 230.400mb (241591914 bytes) INFO [] - Total JVM Direct Memory: 192.000mb (201326592 bytes) INFO [] - Framework: 128.000mb (134217728 bytes) INFO [] - Task: 0 bytes INFO [] - Network: 64.000mb (67108864 bytes) INFO [] - JVM Metaspace: 256.000mb (268435456 bytes) INFO [] - JVM Overhead: 192.000mb (201326592 bytes) 2020-09-03 00:44:04,086 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -------------------------------------------------------------------------------- 2020-09-03 00:44:04,086 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - Starting TaskManager (Version: 1.11.1, Scala: 2.12, Rev:7eb514a, Date:2020-07-15T07:02:09+02:00) 2020-09-03 00:44:04,087 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - OS current user: flink 2020-09-03 00:44:04,087 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - Current Hadoop/Kerberos user: <no hadoop dependency found> 2020-09-03 00:44:04,087 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - JVM: OpenJDK 64-Bit Server VM - Oracle Corporation - 1.8/25.265-b01 2020-09-03 00:44:04,087 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - Maximum heap size: 154 MiBytes 2020-09-03 00:44:04,087 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - JAVA_HOME: /usr/local/openjdk-8 2020-09-03 00:44:04,087 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - No Hadoop Dependency available 2020-09-03 00:44:04,087 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - JVM Options: 2020-09-03 00:44:04,087 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -XX:+UseG1GC 2020-09-03 00:44:04,088 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -Xmx161061270 2020-09-03 00:44:04,088 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -Xms161061270 2020-09-03 00:44:04,088 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -XX:MaxDirectMemorySize=201326592 2020-09-03 00:44:04,088 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -XX:MaxMetaspaceSize=268435456 2020-09-03 00:44:04,088 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -Dlog.file=/opt/flink/log/flink--taskexecutor-0-flink-taskmanager-74c68c6f48-pbnwc.log 2020-09-03 00:44:04,088 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -Dlog4j.configuration=file:/opt/flink/conf/log4j-console.properties 2020-09-03 00:44:04,088 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -Dlog4j.configurationFile=file:/opt/flink/conf/log4j-console.properties 2020-09-03 00:44:04,088 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -Dlogback.configurationFile=file:/opt/flink/conf/logback-console.xml 2020-09-03 00:44:04,088 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - Program Arguments: 2020-09-03 00:44:04,088 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - --configDir 2020-09-03 00:44:04,088 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - /opt/flink/conf 2020-09-03 00:44:04,089 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -D 2020-09-03 00:44:04,090 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - taskmanager.memory.framework.off-heap.size=134217728b 2020-09-03 00:44:04,090 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -D 2020-09-03 00:44:04,090 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - taskmanager.memory.network.max=67108864b 2020-09-03 00:44:04,090 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -D 2020-09-03 00:44:04,090 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - taskmanager.memory.network.min=67108864b 2020-09-03 00:44:04,090 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -D 2020-09-03 00:44:04,090 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - taskmanager.memory.framework.heap.size=134217728b 2020-09-03 00:44:04,090 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -D 2020-09-03 00:44:04,090 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - taskmanager.memory.managed.size=241591914b 2020-09-03 00:44:04,090 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -D 2020-09-03 00:44:04,090 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - taskmanager.cpu.cores=1.0 2020-09-03 00:44:04,091 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -D 2020-09-03 00:44:04,092 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - taskmanager.memory.task.heap.size=26843542b 2020-09-03 00:44:04,092 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -D 2020-09-03 00:44:04,092 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - taskmanager.memory.task.off-heap.size=0b 2020-09-03 00:44:04,092 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - Classpath: /opt/flink/lib/flink-connector-jdbc_2.11-1.11.1.jar:/opt/flink/lib/flink-csv-1.11.1.jar:/opt/flink/lib/flink-format-changelog-json-1.0.0.jar:/opt/flink/lib/flink-json-1.11.1.jar:/opt/flink/lib/flink-shaded-zookeeper-3.4.14.jar:/opt/flink/lib/flink-sql-connector-elasticsearch7_2.11-1.11.1.jar:/opt/flink/lib/flink-sql-connector-kafka-0.11_2.11-1.11.1.jar:/opt/flink/lib/flink-sql-connector-mysql-cdc-1.1.0.jar:/opt/flink/lib/flink-table-blink_2.12-1.11.1.jar:/opt/flink/lib/flink-table_2.12-1.11.1.jar:/opt/flink/lib/log4j-1.2-api-2.12.1.jar:/opt/flink/lib/log4j-api-2.12.1.jar:/opt/flink/lib/log4j-core-2.12.1.jar:/opt/flink/lib/log4j-slf4j-impl-2.12.1.jar:/opt/flink/lib/mysql-connector-java-5.1.26.jar:/opt/flink/lib/flink-dist_2.12-1.11.1.jar::: 2020-09-03 00:44:04,092 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - -------------------------------------------------------------------------------- 2020-09-03 00:44:04,096 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - Registered UNIX signal handlers for [TERM, HUP, INT] 2020-09-03 00:44:04,100 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - Maximum number of open file descriptors is 1048576. 2020-09-03 00:44:04,111 INFO org.apache.flink.configuration.GlobalConfiguration [] - Loading configuration property: jobmanager.rpc.address, flink-jobmanager 2020-09-03 00:44:04,111 INFO org.apache.flink.configuration.GlobalConfiguration [] - Loading configuration property: taskmanager.numberOfTaskSlots, 1 2020-09-03 00:44:04,111 INFO org.apache.flink.configuration.GlobalConfiguration [] - Loading configuration property: blob.server.port, 6124 2020-09-03 00:44:04,111 INFO org.apache.flink.configuration.GlobalConfiguration [] - Loading configuration property: jobmanager.rpc.port, 6123 2020-09-03 00:44:04,112 INFO org.apache.flink.configuration.GlobalConfiguration [] - Loading configuration property: taskmanager.rpc.port, 6122 2020-09-03 00:44:04,112 INFO org.apache.flink.configuration.GlobalConfiguration [] - Loading configuration property: queryable-state.proxy.ports, 6125 2020-09-03 00:44:04,112 INFO org.apache.flink.configuration.GlobalConfiguration [] - Loading configuration property: jobmanager.memory.process.size, 1024m 2020-09-03 00:44:04,112 INFO org.apache.flink.configuration.GlobalConfiguration [] - Loading configuration property: taskmanager.memory.process.size, 1024m 2020-09-03 00:44:04,112 INFO org.apache.flink.configuration.GlobalConfiguration [] - Loading configuration property: parallelism.default, 1 2020-09-03 00:44:04,193 INFO org.apache.flink.core.fs.FileSystem [] - Hadoop is not in the classpath/dependencies. The extended set of supported File Systems via Hadoop is not available. 2020-09-03 00:44:04,240 INFO org.apache.flink.runtime.security.modules.HadoopModuleFactory [] - Cannot create Hadoop Security Module because Hadoop cannot be found in the Classpath. 2020-09-03 00:44:04,248 INFO org.apache.flink.runtime.security.modules.JaasModule [] - Jaas file will be created as /tmp/jaas-5988359564689122894.conf. 2020-09-03 00:44:04,258 INFO org.apache.flink.runtime.security.contexts.HadoopSecurityContextFactory [] - Cannot install HadoopSecurityContext because Hadoop cannot be found in the Classpath. 2020-09-03 00:44:04,377 INFO org.apache.flink.configuration.Configuration [] - Config uses fallback configuration key 'jobmanager.rpc.address' instead of key 'rest.address' 2020-09-03 00:44:04,390 INFO org.apache.flink.runtime.util.LeaderRetrievalUtils [] - Trying to select the network interface and address to use by connecting to the leading JobManager. 2020-09-03 00:44:04,390 INFO org.apache.flink.runtime.util.LeaderRetrievalUtils [] - TaskManager will try to connect for PT10S before falling back to heuristics 2020-09-03 00:44:24,583 WARN org.apache.flink.runtime.net.ConnectionUtils [] - Could not connect to flink-jobmanager:6123. Selecting a local address using heuristics. 2020-09-03 00:44:24,583 WARN org.apache.flink.runtime.net.ConnectionUtils [] - Could not find any IPv4 address that is not loopback or link-local. Using localhost address. 2020-09-03 00:44:24,583 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - TaskManager will use hostname/address 'flink-taskmanager-74c68c6f48-pbnwc' (172.18.0.6) for communication. 2020-09-03 00:44:24,591 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils [] - Trying to start actor system, external address 172.18.0.6:6122, bind address 0.0.0.0:6122. 2020-09-03 00:44:25,150 INFO akka.event.slf4j.Slf4jLogger [] - Slf4jLogger started 2020-09-03 00:44:25,179 INFO akka.remote.Remoting [] - Starting remoting 2020-09-03 00:44:25,312 INFO akka.remote.Remoting [] - Remoting started; listening on addresses :[akka.tcp://flink@172.18.0.6:6122] 2020-09-03 00:44:25,413 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils [] - Actor system started at akka.tcp://flink@172.18.0.6:6122 2020-09-03 00:44:25,438 INFO org.apache.flink.runtime.metrics.MetricRegistryImpl [] - No metrics reporter configured, no metrics will be exposed/reported. 2020-09-03 00:44:25,440 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils [] - Trying to start actor system, external address 172.18.0.6:0, bind address 0.0.0.0:0. 2020-09-03 00:44:25,466 INFO akka.event.slf4j.Slf4jLogger [] - Slf4jLogger started 2020-09-03 00:44:25,471 INFO akka.remote.Remoting [] - Starting remoting 2020-09-03 00:44:25,480 INFO akka.remote.Remoting [] - Remoting started; listening on addresses :[akka.tcp://flink-metrics@172.18.0.6:35567] 2020-09-03 00:44:25,488 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcServiceUtils [] - Actor system started at akka.tcp://flink-metrics@172.18.0.6:35567 2020-09-03 00:44:25,505 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcService [] - Starting RPC endpoint for org.apache.flink.runtime.metrics.dump.MetricQueryService at akka://flink-metrics/user/rpc/MetricQueryService_352f749e35037b5b13b9e7e500fd7752 . 2020-09-03 00:44:25,516 INFO org.apache.flink.runtime.blob.PermanentBlobCache [] - Created BLOB cache storage directory /tmp/blobStore-ed51a2f8-e52c-4233-a155-ef7fa8fcd60d 2020-09-03 00:44:25,519 INFO org.apache.flink.runtime.blob.TransientBlobCache [] - Created BLOB cache storage directory /tmp/blobStore-e8edeb70-a5eb-402d-8075-d497c8728f4b 2020-09-03 00:44:25,521 INFO org.apache.flink.runtime.externalresource.ExternalResourceUtils [] - Enabled external resources: [] 2020-09-03 00:44:25,521 INFO org.apache.flink.runtime.externalresource.ExternalResourceUtils [] - Enabled external resources: [] 2020-09-03 00:44:25,521 INFO org.apache.flink.runtime.taskexecutor.TaskManagerRunner [] - Starting TaskManager with ResourceID: 352f749e35037b5b13b9e7e500fd7752 2020-09-03 00:44:25,539 INFO org.apache.flink.runtime.taskexecutor.TaskManagerServices [] - Temporary file directory '/tmp': total 39 GB, usable 25 GB (64.10% usable) 2020-09-03 00:44:25,541 INFO org.apache.flink.runtime.io.disk.FileChannelManagerImpl [] - FileChannelManager uses directory /tmp/flink-io-1d809880-4df8-452f-9f5b-63f6bb2a6bed for spill files. 2020-09-03 00:44:25,548 INFO org.apache.flink.runtime.io.network.netty.NettyConfig [] - NettyConfig [server address: /0.0.0.0, server port: 0, ssl enabled: false, memory segment size (bytes): 32768, transport type: AUTO, number of server threads: 1 (manual), number of client threads: 1 (manual), server connect backlog: 0 (use Netty's default), client connect timeout (sec): 120, send/receive buffer size (bytes): 0 (use Netty's default)] 2020-09-03 00:44:25,549 INFO org.apache.flink.runtime.io.disk.FileChannelManagerImpl [] - FileChannelManager uses directory /tmp/flink-netty-shuffle-7f397d76-b448-4b90-b807-8a95bf970f32 for spill files. 2020-09-03 00:44:25,630 INFO org.apache.flink.runtime.io.network.buffer.NetworkBufferPool [] - Allocated 64 MB for network buffer pool (number of memory segments: 2048, bytes per segment: 32768). 2020-09-03 00:44:25,635 INFO org.apache.flink.runtime.io.network.NettyShuffleEnvironment [] - Starting the network environment and its components. 2020-09-03 00:44:25,699 INFO org.apache.flink.runtime.io.network.netty.NettyClient [] - Transport type 'auto': using EPOLL. 2020-09-03 00:44:25,701 INFO org.apache.flink.runtime.io.network.netty.NettyClient [] - Successful initialization (took 65 ms). 2020-09-03 00:44:25,705 INFO org.apache.flink.runtime.io.network.netty.NettyServer [] - Transport type 'auto': using EPOLL. 2020-09-03 00:44:25,739 INFO org.apache.flink.runtime.io.network.netty.NettyServer [] - Successful initialization (took 36 ms). Listening on SocketAddress /0.0.0.0:35573. 2020-09-03 00:44:25,740 INFO org.apache.flink.runtime.taskexecutor.KvStateService [] - Starting the kvState service and its components. 2020-09-03 00:44:25,758 INFO org.apache.flink.runtime.rpc.akka.AkkaRpcService [] - Starting RPC endpoint for org.apache.flink.runtime.taskexecutor.TaskExecutor at akka://flink/user/rpc/taskmanager_0 . 2020-09-03 00:44:25,770 INFO org.apache.flink.runtime.taskexecutor.DefaultJobLeaderService [] - Start job leader service. 2020-09-03 00:44:25,771 INFO org.apache.flink.runtime.filecache.FileCache [] - User file cache uses directory /tmp/flink-dist-cache-0116ac9a-98a8-4599-a00c-385fc537134b 2020-09-03 00:44:25,772 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Connecting to ResourceManager akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*(00000000000000000000000000000000). 2020-09-03 00:44:25,857 WARN akka.remote.ReliableDeliverySupervisor [] - Association with remote system [akka.tcp://flink@flink-jobmanager:6123] has failed, address is now gated for [50] ms. Reason: [Association failed with [akka.tcp://flink@flink-jobmanager:6123]] Caused by: [java.net.UnknownHostException: flink-jobmanager] 2020-09-03 00:44:25,874 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Could not resolve ResourceManager address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*, retrying in 10000 ms: Could not connect to rpc endpoint under address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*. 2020-09-03 00:44:45,916 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Could not resolve ResourceManager address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*, retrying in 10000 ms: Could not connect to rpc endpoint under address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*. 2020-09-03 00:44:55,918 INFO akka.remote.transport.ProtocolStateActor [] - No response from remote for outbound association. Associate timed out after [20000 ms]. 2020-09-03 00:44:55,919 WARN akka.remote.ReliableDeliverySupervisor [] - Association with remote system [akka.tcp://flink@flink-jobmanager:6123] has failed, address is now gated for [50] ms. Reason: [Association failed with [akka.tcp://flink@flink-jobmanager:6123]] Caused by: [No response from remote for outbound association. Associate timed out after [20000 ms].] 2020-09-03 00:44:55,937 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Could not resolve ResourceManager address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*, retrying in 10000 ms: Could not connect to rpc endpoint under address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*. 2020-09-03 00:45:15,977 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Could not resolve ResourceManager address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*, retrying in 10000 ms: Could not connect to rpc endpoint under address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*. 2020-09-03 00:45:25,977 INFO akka.remote.transport.ProtocolStateActor [] - No response from remote for outbound association. Associate timed out after [20000 ms]. 2020-09-03 00:45:25,978 WARN akka.remote.ReliableDeliverySupervisor [] - Association with remote system [akka.tcp://flink@flink-jobmanager:6123] has failed, address is now gated for [50] ms. Reason: [Association failed with [akka.tcp://flink@flink-jobmanager:6123]] Caused by: [No response from remote for outbound association. Associate timed out after [20000 ms].] 2020-09-03 00:45:25,998 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Could not resolve ResourceManager address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*, retrying in 10000 ms: Could not connect to rpc endpoint under address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*. 2020-09-03 00:45:46,037 INFO org.apache.flink.runtime.taskexecutor.TaskExecutor [] - Could not resolve ResourceManager address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*, retrying in 10000 ms: Could not connect to rpc endpoint under address akka.tcp://flink@flink-jobmanager:6123/user/rpc/resourcemanager_*.