Piotr Nowojski created FLINK-8826: ------------------------------------- Summary: In Flip6 mode, when starting yarn cluster, configured taskmanager.heap.mb is ignored Key: FLINK-8826 URL: https://issues.apache.org/jira/browse/FLINK-8826 Project: Flink Issue Type: Bug Components: ResourceManager, YARN Affects Versions: 1.5.0 Reporter: Piotr Nowojski
When I tried running some job on the cluster, despite setting taskmanager.heap.mb = 3072 taskmanager.network.memory.fraction: 0.4 and reported in the console {{ Cluster specification: ClusterSpecification\{masterMemoryMB=768, taskManagerMemoryMB=3072, numberTaskManagers=92, slotsPerTaskManager=1} }} The actual settings were: {{ 2018-03-01 14:53:18,918 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - -------------------------------------------------------------------------------- 2018-03-01 14:53:18,921 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - Starting YARN TaskExecutor runner (Version: 1.5-SNAPSHOT, Rev:e92eb39, Date:28.02.2018 @ 17:43:39 UTC) 2018-03-01 14:53:18,921 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - OS current user: yarn 2018-03-01 14:53:19,780 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - Current Hadoop/Kerberos user: hadoop 2018-03-01 14:53:19,781 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - JVM: OpenJDK 64-Bit Server VM - Oracle Corporation - 1.8/25.161-b14 2018-03-01 14:53:19,781 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - Maximum heap size: 245 MiBytes 2018-03-01 14:53:19,781 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - JAVA_HOME: /usr/lib/jvm/java-openjdk 2018-03-01 14:53:19,783 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - Hadoop version: 2.4.1 2018-03-01 14:53:19,783 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - JVM Options: 2018-03-01 14:53:19,783 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - -Xms255m 2018-03-01 14:53:19,784 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - -Xmx255m 2018-03-01 14:53:19,784 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - -XX:MaxDirectMemorySize=769m 2018-03-01 14:53:19,784 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - -Dlog.file=/var/log/hadoop-yarn/containers/application_1516373731080_1150/container_1516373731080_1150_01_000105/taskmanager.log 2018-03-01 14:53:19,784 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - -Dlogback.configurationFile=file:./logback.xml 2018-03-01 14:53:19,784 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - -Dlog4j.configuration=file:./log4j.properties 2018-03-01 14:53:19,784 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - Program Arguments: 2018-03-01 14:53:19,784 INFO org.apache.flink.yarn.YarnTaskExecutorRunner - --configDir }} Heap was set to 255, while with default cuts of it should be 1383. 255MB seems like coming from default taskmanager.heap.mb value of 1024. When starting in non flip6 everything works as expected: {{ 2018-03-01 14:04:49,650 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - -------------------------------------------------------------------------------- 2018-03-01 14:04:49,700 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - Starting YARN TaskManager (Version: 1.5-SNAPSHOT, Rev:e92eb39, Date:28.02.2018 @ 17:43:39 UTC) 2018-03-01 14:04:49,700 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - OS current user: yarn 2018-03-01 14:04:53,277 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - Current Hadoop/Kerberos user: hadoop 2018-03-01 14:04:53,278 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - JVM: OpenJDK 64-Bit Server VM - Oracle Corporation - 1.8/25.161-b14 2018-03-01 14:04:53,279 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - Maximum heap size: 1326 MiBytes 2018-03-01 14:04:53,279 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - JAVA_HOME: /usr/lib/jvm/java-openjdk 2018-03-01 14:04:53,282 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - Hadoop version: 2.4.1 2018-03-01 14:04:53,284 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - JVM Options: 2018-03-01 14:04:53,284 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - -Xms1383m 2018-03-01 14:04:53,284 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - -Xmx1383m 2018-03-01 14:04:53,284 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - -XX:MaxDirectMemorySize=1689m 2018-03-01 14:04:53,284 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - -Dlog.file=/var/log/hadoop-yarn/containers/application_1516373731080_1138/container_1516373731080_1138_01_000063/taskmanager.log 2018-03-01 14:04:53,285 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - -Dlogback.configurationFile=file:./logback.xml 2018-03-01 14:04:53,286 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - -Dlog4j.configuration=file:./log4j.properties 2018-03-01 14:04:53,287 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - Program Arguments: 2018-03-01 14:04:53,287 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - --configDir 2018-03-01 14:04:53,287 INFO org.apache.flink.yarn.YarnTaskManagerRunnerFactory - . }} CC [~till.rohrmann] -- This message was sent by Atlassian JIRA (v7.6.3#76005)