[ https://issues.apache.org/jira/browse/FLINK-16636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17090318#comment-17090318 ]
Caizhi Weng commented on FLINK-16636: ------------------------------------- Sorry for all the non-sense guessing above... I've tested on {{ProcessBuilder}} today and it doesn't seem to double the memory usage. However, I discover a [bug report|https://bugs.openjdk.java.net/browse/JDK-8054841] saying that there are native memory leak issues in {{ProcessBuilder}}. From {{container.log}} the testing JVM is indeed using a lot of virtual memory, a lot more than the configured 2GB heap size. {code} [ 1641.681910] [ pid ] uid tgid total_vm rss pgtables_bytes swapents oom_score_adj name [...] [ 1641.681977] [14686] 2000 14686 919178 226907 2179072 0 0 java [ 1641.681979] [18492] 2000 18492 1127 155 53248 0 0 sh [ 1641.681980] [18494] 2000 18494 1290898 928196 7905280 0 0 java [ 1641.681982] [18506] 2000 18506 1127 154 53248 0 0 sh [ 1641.681983] [18510] 2000 18510 1128815 776451 6701056 0 0 java [ 1641.681985] [21228] 2000 21228 4292 79 69632 0 0 sleep [ 1641.681986] Out of memory: Kill process 18494 (java) score 455 or sacrifice child [ 1641.689731] Killed process 18494 (java) total-vm:5163592kB, anon-rss:3712784kB, file-rss:0kB, shmem-rss:0kB [ 1641.849849] oom_reaper: reaped process 18494 (java), now anon-rss:0kB, file-rss:0kB, shmem-rss:0kB {code} Here {{total_vm}} and {{rss}} are counted by 4kB blocks and it seems that one single JVM process is using up to 5GB virtual memory and 3.7GB of anonymous rss. Is this memory usage, especially non-heap memory usage reasonable? Another strange thing I noticed is that the sum of total_vm or rss seems to be larger than the container memory limit. Is this an expected or reasonable behavior? > TableEnvironmentITCase is crashing on Travis > -------------------------------------------- > > Key: FLINK-16636 > URL: https://issues.apache.org/jira/browse/FLINK-16636 > Project: Flink > Issue Type: Bug > Components: Table SQL / Planner > Affects Versions: 1.11.0 > Reporter: Jark Wu > Assignee: Caizhi Weng > Priority: Blocker > Labels: pull-request-available, test-stability > Fix For: 1.11.0 > > Time Spent: 20m > Remaining Estimate: 0h > > Here is the instance and exception stack: > https://api.travis-ci.org/v3/job/663408376/log.txt > But there is not too much helpful information there, maybe a accidental maven > problem. > {code} > 09:55:07.703 [ERROR] Failed to execute goal > org.apache.maven.plugins:maven-surefire-plugin:2.22.1:test > (integration-tests) on project flink-table-planner-blink_2.11: There are test > failures. > 09:55:07.703 [ERROR] > 09:55:07.703 [ERROR] Please refer to > /home/travis/build/apache/flink/flink-table/flink-table-planner-blink/target/surefire-reports > for the individual test results. > 09:55:07.703 [ERROR] Please refer to dump files (if any exist) [date].dump, > [date]-jvmRun[N].dump and [date].dumpstream. > 09:55:07.703 [ERROR] ExecutionException The forked VM terminated without > properly saying goodbye. VM crash or System.exit called? > 09:55:07.703 [ERROR] Command was /bin/sh -c cd > /home/travis/build/apache/flink/flink-table/flink-table-planner-blink/target > && /usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java -Xms256m -Xmx2048m > -Dmvn.forkNumber=1 -XX:+UseG1GC -jar > /home/travis/build/apache/flink/flink-table/flink-table-planner-blink/target/surefire/surefirebooter714252487017838305.jar > > /home/travis/build/apache/flink/flink-table/flink-table-planner-blink/target/surefire > 2020-03-17T09-34-41_826-jvmRun1 surefire4625103637332937565tmp > surefire_43192129054983363633tmp > 09:55:07.703 [ERROR] Error occurred in starting fork, check output in log > 09:55:07.703 [ERROR] Process Exit Code: 137 > 09:55:07.703 [ERROR] Crashed tests: > 09:55:07.703 [ERROR] org.apache.flink.table.api.TableEnvironmentITCase > 09:55:07.703 [ERROR] > org.apache.maven.surefire.booter.SurefireBooterForkException: > ExecutionException The forked VM terminated without properly saying goodbye. > VM crash or System.exit called? > 09:55:07.703 [ERROR] Command was /bin/sh -c cd > /home/travis/build/apache/flink/flink-table/flink-table-planner-blink/target > && /usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java -Xms256m -Xmx2048m > -Dmvn.forkNumber=1 -XX:+UseG1GC -jar > /home/travis/build/apache/flink/flink-table/flink-table-planner-blink/target/surefire/surefirebooter714252487017838305.jar > > /home/travis/build/apache/flink/flink-table/flink-table-planner-blink/target/surefire > 2020-03-17T09-34-41_826-jvmRun1 surefire4625103637332937565tmp > surefire_43192129054983363633tmp > 09:55:07.703 [ERROR] Error occurred in starting fork, check output in log > 09:55:07.703 [ERROR] Process Exit Code: 137 > 09:55:07.703 [ERROR] Crashed tests: > 09:55:07.703 [ERROR] org.apache.flink.table.api.TableEnvironmentITCase > 09:55:07.703 [ERROR] at > org.apache.maven.plugin.surefire.booterclient.ForkStarter.awaitResultsDone(ForkStarter.java:510) > 09:55:07.704 [ERROR] at > org.apache.maven.plugin.surefire.booterclient.ForkStarter.runSuitesForkOnceMultiple(ForkStarter.java:382) > 09:55:07.704 [ERROR] at > org.apache.maven.plugin.surefire.booterclient.ForkStarter.run(ForkStarter.java:297) > 09:55:07.704 [ERROR] at > org.apache.maven.plugin.surefire.booterclient.ForkStarter.run(ForkStarter.java:246) > 09:55:07.704 [ERROR] at > org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeProvider(AbstractSurefireMojo.java:1183) > 09:55:07.704 [ERROR] at > org.apache.maven.plugin.surefire.AbstractSurefireMojo.executeAfterPreconditionsChecked(AbstractSurefireMojo.java:1011) > 09:55:07.704 [ERROR] at > org.apache.maven.plugin.surefire.AbstractSurefireMojo.execute(AbstractSurefireMojo.java:857) > 09:55:07.704 [ERROR] at > org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo(DefaultBuildPluginManager.java:132) > 09:55:07.704 [ERROR] at > org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:208) > 09:55:07.704 [ERROR] at > org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:153) > 09:55:07.704 [ERROR] at > org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:145) > 09:55:07.704 [ERROR] at > org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:116) > 09:55:07.704 [ERROR] at > org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:80) > 09:55:07.704 [ERROR] at > org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build(SingleThreadedBuilder.java:51) > 09:55:07.704 [ERROR] at > org.apache.maven.lifecycle.internal.LifecycleStarter.execute(LifecycleStarter.java:120) > 09:55:07.704 [ERROR] at > org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:355) > 09:55:07.704 [ERROR] at > org.apache.maven.DefaultMaven.execute(DefaultMaven.java:155) > 09:55:07.704 [ERROR] at > org.apache.maven.cli.MavenCli.execute(MavenCli.java:584) > 09:55:07.704 [ERROR] at > org.apache.maven.cli.MavenCli.doMain(MavenCli.java:216) > 09:55:07.704 [ERROR] at org.apache.maven.cli.MavenCli.main(MavenCli.java:160) > 09:55:07.704 [ERROR] at sun.reflect.NativeMethodAccessorImpl.invoke0(Native > Method) > 09:55:07.704 [ERROR] at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) > 09:55:07.704 [ERROR] at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) > 09:55:07.704 [ERROR] at java.lang.reflect.Method.invoke(Method.java:498) > 09:55:07.704 [ERROR] at > org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced(Launcher.java:289) > 09:55:07.704 [ERROR] at > org.codehaus.plexus.classworlds.launcher.Launcher.launch(Launcher.java:229) > 09:55:07.704 [ERROR] at > org.codehaus.plexus.classworlds.launcher.Launcher.mainWithExitCode(Launcher.java:415) > 09:55:07.704 [ERROR] at > org.codehaus.plexus.classworlds.launcher.Launcher.main(Launcher.java:356) > 09:55:07.704 [ERROR] Caused by: > org.apache.maven.surefire.booter.SurefireBooterForkException: The forked VM > terminated without properly saying goodbye. VM crash or System.exit called? > 09:55:07.704 [ERROR] Command was /bin/sh -c cd > /home/travis/build/apache/flink/flink-table/flink-table-planner-blink/target > && /usr/lib/jvm/java-8-openjdk-amd64/jre/bin/java -Xms256m -Xmx2048m > -Dmvn.forkNumber=1 -XX:+UseG1GC -jar > /home/travis/build/apache/flink/flink-table/flink-table-planner-blink/target/surefire/surefirebooter714252487017838305.jar > > /home/travis/build/apache/flink/flink-table/flink-table-planner-blink/target/surefire > 2020-03-17T09-34-41_826-jvmRun1 surefire4625103637332937565tmp > surefire_43192129054983363633tmp > 09:55:07.704 [ERROR] Error occurred in starting fork, check output in log > 09:55:07.704 [ERROR] Process Exit Code: 137 > 09:55:07.704 [ERROR] Crashed tests: > 09:55:07.704 [ERROR] org.apache.flink.table.api.TableEnvironmentITCase > 09:55:07.704 [ERROR] at > org.apache.maven.plugin.surefire.booterclient.ForkStarter.fork(ForkStarter.java:669) > 09:55:07.704 [ERROR] at > org.apache.maven.plugin.surefire.booterclient.ForkStarter.access$600(ForkStarter.java:115) > 09:55:07.704 [ERROR] at > org.apache.maven.plugin.surefire.booterclient.ForkStarter$1.call(ForkStarter.java:371) > 09:55:07.704 [ERROR] at > org.apache.maven.plugin.surefire.booterclient.ForkStarter$1.call(ForkStarter.java:347) > 09:55:07.704 [ERROR] at > java.util.concurrent.FutureTask.run(FutureTask.java:266) > 09:55:07.704 [ERROR] at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > 09:55:07.704 [ERROR] at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > 09:55:07.704 [ERROR] at java.lang.Thread.run(Thread.java:748) > 09:55:07.704 [ERROR] -> [Help 1] > 09:55:07.704 [ERROR] > 09:55:07.704 [ERROR] To see the full stack trace of the errors, re-run Maven > with the -e switch. > 09:55:07.704 [ERROR] Re-run Maven using the -X switch to enable full debug > logging. > 09:55:07.704 [ERROR] > 09:55:07.704 [ERROR] For more information about the errors and possible > solutions, please read the following articles: > 09:55:07.704 [ERROR] [Help 1] > http://cwiki.apache.org/confluence/display/MAVEN/MojoExecutionException > 09:55:07.704 [ERROR] > 09:55:07.704 [ERROR] After correcting the problems, you can resume the build > with the command > 09:55:07.704 [ERROR] mvn <goals> -rf :flink-table-planner-blink_2.11 > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)