[jira] [Created] (YARN-6082) Webservice connection gets cutoff when it has to send back a large response (webservice)
Sunil G created YARN-6082: - Summary: Webservice connection gets cutoff when it has to send back a large response (webservice) Key: YARN-6082 URL: https://issues.apache.org/jira/browse/YARN-6082 Project: Hadoop YARN Issue Type: Bug Reporter: Sunil G Priority: Critical {noformat} 2017-01-11 07:17:11,475 WARN ipc.Server (Server.java:run(2202)) - Large response size 4476919 for call org.apache.hadoop.yarn.api.ApplicationClientProtocolPB.getApplications from 172.27.0.101:39950 Call#951474 Retry#0 {noformat} In such cases, json output will get cutoff and client will not get clean response. For eg: {noformat} Unexpected token I in JSON at position 851 {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org
[jira] [Created] (YARN-6081) LeafQueue#getTotalPendingResourcesConsideringUserLimit should deduct reserved from pending to avoid unnecessary preemption of reserved container
Wangda Tan created YARN-6081: Summary: LeafQueue#getTotalPendingResourcesConsideringUserLimit should deduct reserved from pending to avoid unnecessary preemption of reserved container Key: YARN-6081 URL: https://issues.apache.org/jira/browse/YARN-6081 Project: Hadoop YARN Issue Type: Bug Reporter: Wangda Tan Assignee: Wangda Tan Priority: Critical While doing YARN-5864 tests, found an issue when a queue's reserved > pending. PreemptionResourceCalculator will preempt reserved container even if there's only one active queue in the cluster. To fix the problem, we need to deduct reserved from pending when getting total-pending resource for LeafQueue. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org
[jira] [Resolved] (YARN-6079) simple spelling errors in yarn test code
[ https://issues.apache.org/jira/browse/YARN-6079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Junping Du resolved YARN-6079. -- Resolution: Fixed Hadoop Flags: Reviewed Fix Version/s: 3.0.0-alpha2 2.9.0 > simple spelling errors in yarn test code > > > Key: YARN-6079 > URL: https://issues.apache.org/jira/browse/YARN-6079 > Project: Hadoop YARN > Issue Type: Bug > Components: test >Reporter: Grant Sohn >Assignee: vijay >Priority: Trivial > Fix For: 2.9.0, 3.0.0-alpha2 > > Attachments: YARN-6079.001.patch > > > charactor -> character > hadoop-yarn/hadoop-yarn-common/src/test/java/org/apache/hadoop/yarn/nodelabels/TestCommonNodeLabelsManager.java: > Assert.assertTrue("invalid label charactor should not add to repo", > caught); > expteced -> expected > hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/TestRMWebApp.java: > Assert.fail("Exception is not expteced."); > Exepected -> Expected > hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/TestLeafQueue.java: > "Exepected AbsoluteUsedCapacity > 0.95, got: " > expteced -> expected > hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/TestRMWebApp.java: > Assert.fail("Exception is not expteced."); > macthing -> matching > hadoop-yarn/hadoop-yarn-client/src/test/java/org/apache/hadoop/yarn/client/api/impl/TestAMRMClient.java: > assertEquals("Expected no macthing requests.", matches.size(), 0); > propogated -> propagated > hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/TestNodeHealthService.java: > Assert.assertTrue("Node script time out message not propogated", > protential -> potential > hadoop-yarn/hadoop-yarn-common/src/test/java/org/apache/hadoop/yarn/api/BasePBImplRecordsTest.java: > LOG.info(String.format("Exclude protential property: %s\n", > gsp.propertyName)); > recevied -> received > hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/localizer/TestResourceLocalizationService.java: > throw new Exception("Unexpected resource recevied."); > shouldnt -> shouldn't > hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/TestRMWebServiceAppsNodelabel.java: > fail("resourceInfo object shouldnt be available for finished apps"); > Transistion -> Transition > hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestRMHA.java: > Assert.fail("Transistion to Active should have failed for > refreshAll()"); > Unhelathy -> Unhealthy > hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestRMNodeTransitions.java: > Assert.assertEquals("Unhelathy Nodes", initialUnHealthy, -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org
[jira] [Created] (YARN-6080) Create WebServiceUtils to have common functions used in RMWebService, NMWebService and AHSWebService
Xuan Gong created YARN-6080: --- Summary: Create WebServiceUtils to have common functions used in RMWebService, NMWebService and AHSWebService Key: YARN-6080 URL: https://issues.apache.org/jira/browse/YARN-6080 Project: Hadoop YARN Issue Type: Bug Reporter: Xuan Gong Create WebServiceUtils to remove the duplicate code. Also, provide the pattern to create webService call which could be used by client. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org
[jira] [Created] (YARN-6079) simple spelling errors in yarn test code
Grant Sohn created YARN-6079: Summary: simple spelling errors in yarn test code Key: YARN-6079 URL: https://issues.apache.org/jira/browse/YARN-6079 Project: Hadoop YARN Issue Type: Bug Components: test Reporter: Grant Sohn Assignee: Grant Sohn Priority: Trivial charactor -> character hadoop-yarn/hadoop-yarn-common/src/test/java/org/apache/hadoop/yarn/nodelabels/TestCommonNodeLabelsManager.java: Assert.assertTrue("invalid label charactor should not add to repo", caught); expteced -> expected hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/TestRMWebApp.java: Assert.fail("Exception is not expteced."); Exepected -> Expected hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/TestLeafQueue.java: "Exepected AbsoluteUsedCapacity > 0.95, got: " expteced -> expected hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/TestRMWebApp.java: Assert.fail("Exception is not expteced."); macthing -> matching hadoop-yarn/hadoop-yarn-client/src/test/java/org/apache/hadoop/yarn/client/api/impl/TestAMRMClient.java: assertEquals("Expected no macthing requests.", matches.size(), 0); propogated -> propagated hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/TestNodeHealthService.java: Assert.assertTrue("Node script time out message not propogated", protential -> potential hadoop-yarn/hadoop-yarn-common/src/test/java/org/apache/hadoop/yarn/api/BasePBImplRecordsTest.java: LOG.info(String.format("Exclude protential property: %s\n", gsp.propertyName)); recevied -> received hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/localizer/TestResourceLocalizationService.java: throw new Exception("Unexpected resource recevied."); shouldnt -> shouldn't hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/TestRMWebServiceAppsNodelabel.java: fail("resourceInfo object shouldnt be available for finished apps"); Transistion -> Transition hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestRMHA.java: Assert.fail("Transistion to Active should have failed for refreshAll()"); Unhelathy -> Unhealthy hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestRMNodeTransitions.java: Assert.assertEquals("Unhelathy Nodes", initialUnHealthy, -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org
Apache Hadoop qbt Report: trunk+JDK8 on Linux/ppc64le
For more details, see https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-ppc/213/ [Jan 9, 2017 4:40:39 PM] (wangda) YARN-3955. Support for application priority ACLs in queues of [Jan 9, 2017 6:32:18 PM] (arp) HDFS-11301. Double wrapping over RandomAccessFile in [Jan 9, 2017 11:18:26 PM] (weichiu) HADOOP-13953. Make FTPFileSystem's data connection mode and transfer [Jan 9, 2017 11:44:42 PM] (yzhang) HDFS-11292. log lastWrittenTxId etc info in logSyncAll. Contributed by [Jan 10, 2017 2:01:37 AM] (wang) HADOOP-13885. Implement getLinkTarget for ViewFileSystem. Contributed by [Jan 10, 2017 2:05:33 AM] (jing9) HDFS-11273. Move TransferFsImage#doGetUrl function to a Util class. [Jan 10, 2017 2:14:46 AM] (junping_du) YARN-4148. When killing app, RM releases app's resource before they are [Jan 10, 2017 6:12:58 AM] (templedf) YARN-6073. Misuse of format specifier in Preconditions.checkArgument [Jan 10, 2017 8:38:01 AM] (sunilg) YARN-5899. Debug log in AbstractCSQueue#canAssignToThisQueue needs [Jan 10, 2017 10:05:01 AM] (naganarasimha_gr) YARN-5937. stop-yarn.sh is not able to gracefully stop node managers. [Jan 10, 2017 10:24:16 AM] (naganarasimha_gr) YARN-6054. TimelineServer fails to start when some LevelDb state files [Jan 10, 2017 11:37:58 AM] (lei) HDFS-11259. Update fsck to display maintenance state info. (Manoj -1 overall The following subsystems voted -1: compile unit The following subsystems voted -1 but were configured to be filtered/ignored: cc javac The following subsystems are considered long running: (runtime bigger than 1h 0m 0s) unit Specific tests: Failed junit tests : hadoop.hdfs.TestBlockStoragePolicy hadoop.hdfs.tools.offlineImageViewer.TestOfflineImageViewer hadoop.hdfs.web.TestWebHdfsTimeouts hadoop.yarn.server.timeline.TestRollingLevelDB hadoop.yarn.server.timeline.TestTimelineDataManager hadoop.yarn.server.timeline.TestLeveldbTimelineStore hadoop.yarn.server.timeline.webapp.TestTimelineWebServices hadoop.yarn.server.timeline.recovery.TestLeveldbTimelineStateStore hadoop.yarn.server.timeline.TestRollingLevelDBTimelineStore hadoop.yarn.server.applicationhistoryservice.TestApplicationHistoryServer hadoop.yarn.server.resourcemanager.recovery.TestLeveldbRMStateStore hadoop.yarn.server.resourcemanager.scheduler.capacity.TestContainerResizing hadoop.yarn.server.TestMiniYarnClusterNodeUtilization hadoop.yarn.server.TestContainerManagerSecurity hadoop.yarn.server.timeline.TestLevelDBCacheTimelineStore hadoop.yarn.server.timeline.TestOverrideTimelineStoreYarnClient hadoop.yarn.server.timeline.TestEntityGroupFSTimelineStore hadoop.yarn.applications.distributedshell.TestDistributedShell hadoop.mapred.TestShuffleHandler hadoop.mapreduce.v2.hs.TestHistoryServerLeveldbStateStoreService Timed out junit tests : org.apache.hadoop.hdfs.server.datanode.TestDataNodeVolumeFailureReporting org.apache.hadoop.hdfs.server.blockmanagement.TestBlockStatsMXBean org.apache.hadoop.hdfs.server.datanode.TestFsDatasetCache org.apache.hadoop.hdfs.server.datanode.TestDataNodeVolumeFailure compile: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-ppc/213/artifact/out/patch-compile-root.txt [124K] cc: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-ppc/213/artifact/out/patch-compile-root.txt [124K] javac: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-ppc/213/artifact/out/patch-compile-root.txt [124K] unit: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-ppc/213/artifact/out/patch-unit-hadoop-hdfs-project_hadoop-hdfs.txt [200K] https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-ppc/213/artifact/out/patch-unit-hadoop-yarn-project_hadoop-yarn_hadoop-yarn-server_hadoop-yarn-server-nodemanager.txt [16K] https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-ppc/213/artifact/out/patch-unit-hadoop-yarn-project_hadoop-yarn_hadoop-yarn-server_hadoop-yarn-server-applicationhistoryservice.txt [52K] https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-ppc/213/artifact/out/patch-unit-hadoop-yarn-project_hadoop-yarn_hadoop-yarn-server_hadoop-yarn-server-resourcemanager.txt [68K] https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-ppc/213/artifact/out/patch-unit-hadoop-yarn-project_hadoop-yarn_hadoop-yarn-server_hadoop-yarn-server-tests.txt [324K] https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-ppc/213/artifact/out/patch-unit-hadoop-yarn-project_hadoop-yarn_hadoop-yarn-server_hadoop-yarn-server-timeline-pluginstorage.txt [28K] https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-ppc/213/artifact/out/patch-unit-hadoop-yarn-project_hadoop-yarn_hadoop-yarn-applications_hadoop-yarn-applic
Apache Hadoop qbt Report: trunk+JDK8 on Linux/x86
For more details, see https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/ [Jan 9, 2017 1:24:22 PM] (varunsaxena) YARN-6074. FlowRunEntity does not deserialize long values correctly [Jan 9, 2017 4:40:39 PM] (wangda) YARN-3955. Support for application priority ACLs in queues of [Jan 9, 2017 6:32:18 PM] (arp) HDFS-11301. Double wrapping over RandomAccessFile in [Jan 9, 2017 11:18:26 PM] (weichiu) HADOOP-13953. Make FTPFileSystem's data connection mode and transfer [Jan 9, 2017 11:44:42 PM] (yzhang) HDFS-11292. log lastWrittenTxId etc info in logSyncAll. Contributed by [Jan 10, 2017 2:01:37 AM] (wang) HADOOP-13885. Implement getLinkTarget for ViewFileSystem. Contributed by [Jan 10, 2017 2:05:33 AM] (jing9) HDFS-11273. Move TransferFsImage#doGetUrl function to a Util class. [Jan 10, 2017 2:14:46 AM] (junping_du) YARN-4148. When killing app, RM releases app's resource before they are [Jan 10, 2017 6:12:58 AM] (templedf) YARN-6073. Misuse of format specifier in Preconditions.checkArgument -1 overall The following subsystems voted -1: asflicense unit The following subsystems voted -1 but were configured to be filtered/ignored: cc checkstyle javac javadoc pylint shellcheck shelldocs whitespace The following subsystems are considered long running: (runtime bigger than 1h 0m 0s) unit Specific tests: Failed junit tests : hadoop.hdfs.server.datanode.checker.TestThrottledAsyncChecker hadoop.hdfs.server.datanode.TestDirectoryScanner hadoop.yarn.server.timeline.webapp.TestTimelineWebServices hadoop.yarn.server.TestDiskFailures hadoop.yarn.server.TestContainerManagerSecurity hadoop.yarn.server.TestMiniYarnClusterNodeUtilization cc: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/diff-compile-cc-root.txt [4.0K] javac: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/diff-compile-javac-root.txt [168K] checkstyle: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/diff-checkstyle-root.txt [16M] pylint: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/diff-patch-pylint.txt [20K] shellcheck: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/diff-patch-shellcheck.txt [24K] shelldocs: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/diff-patch-shelldocs.txt [16K] whitespace: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/whitespace-eol.txt [11M] https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/whitespace-tabs.txt [1.3M] javadoc: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/diff-javadoc-javadoc-root.txt [2.2M] unit: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/patch-unit-hadoop-hdfs-project_hadoop-hdfs.txt [148K] https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/patch-unit-hadoop-yarn-project_hadoop-yarn_hadoop-yarn-server_hadoop-yarn-server-applicationhistoryservice.txt [12K] https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/patch-unit-hadoop-yarn-project_hadoop-yarn_hadoop-yarn-server_hadoop-yarn-server-tests.txt [324K] asflicense: https://builds.apache.org/job/hadoop-qbt-trunk-java8-linux-x86/282/artifact/out/patch-asflicense-problems.txt [4.0K] Powered by Apache Yetus 0.5.0-SNAPSHOT http://yetus.apache.org - To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org
[jira] [Created] (YARN-6078) Containers stuck in Localizing state
Jagadish created YARN-6078: -- Summary: Containers stuck in Localizing state Key: YARN-6078 URL: https://issues.apache.org/jira/browse/YARN-6078 Project: Hadoop YARN Issue Type: Bug Reporter: Jagadish I encountered an interesting issue in one of our Yarn clusters (where the containers are stuck in localizing phase). Our AM requests a container, and starts a process using the NMClient. According to the NM the container is in LOCALIZING state: {code} 1. 2017-01-09 22:06:18,362 [INFO] [AsyncDispatcher event handler] container.ContainerImpl.handle(ContainerImpl.java:1135) - Container container_e03_1481261762048_0541_02_60 transitioned from NEW to LOCALIZING 2017-01-09 22:06:18,363 [INFO] [AsyncDispatcher event handler] localizer.ResourceLocalizationService$LocalizerTracker.handle(ResourceLocalizationService.java:711) - Created localizer for container_e03_1481261762048_0541_02_60 2017-01-09 22:06:18,364 [INFO] [LocalizerRunner for container_e03_1481261762048_0541_02_60] localizer.ResourceLocalizationService$LocalizerRunner.writeCredentials(ResourceLocalizationService.java:1191) - Writing credentials to the nmPrivate file /../..//.nmPrivate/container_e03_1481261762048_0541_02_60.tokens. Credentials list: {code} According to the RM the container is in RUNNING state: {code} 2017-01-09 22:06:17,110 [INFO] [IPC Server handler 19 on 8030] rmcontainer.RMContainerImpl.handle(RMContainerImpl.java:410) - container_e03_1481261762048_0541_02_60 Container Transitioned from ALLOCATED to ACQUIRED 2017-01-09 22:06:19,084 [INFO] [ResourceManager Event Processor] rmcontainer.RMContainerImpl.handle(RMContainerImpl.java:410) - container_e03_1481261762048_0541_02_60 Container Transitioned from ACQUIRED to RUNNING {code} When I click the Yarn RM UI to view the logs for the container, I get an error that {code} No logs were found. state is LOCALIZING {code} The Node manager 's stack trace seems to indicate that the NM's LocalizerRunner is stuck waiting to read from the sub-process's outputstream. {code} "LocalizerRunner for container_e03_1481261762048_0541_02_60" #27007081 prio=5 os_prio=0 tid=0x7fa518849800 nid=0x15f7 runnable [0x7fa5076c3000] java.lang.Thread.State: RUNNABLE at java.io.FileInputStream.readBytes(Native Method) at java.io.FileInputStream.read(FileInputStream.java:255) at java.io.BufferedInputStream.read1(BufferedInputStream.java:284) at java.io.BufferedInputStream.read(BufferedInputStream.java:345) - locked <0xc6dc9c50> (a java.lang.UNIXProcess$ProcessPipeInputStream) at sun.nio.cs.StreamDecoder.readBytes(StreamDecoder.java:284) at sun.nio.cs.StreamDecoder.implRead(StreamDecoder.java:326) at sun.nio.cs.StreamDecoder.read(StreamDecoder.java:178) - locked <0xc6dc9c78> (a java.io.InputStreamReader) at java.io.InputStreamReader.read(InputStreamReader.java:184) at java.io.BufferedReader.fill(BufferedReader.java:161) at java.io.BufferedReader.read1(BufferedReader.java:212) at java.io.BufferedReader.read(BufferedReader.java:286) - locked <0xc6dc9c78> (a java.io.InputStreamReader) at org.apache.hadoop.util.Shell$ShellCommandExecutor.parseExecResult(Shell.java:786) at org.apache.hadoop.util.Shell.runCommand(Shell.java:568) at org.apache.hadoop.util.Shell.run(Shell.java:479) at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:773) at org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.startLocalizer(LinuxContainerExecutor.java:237) at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$LocalizerRunner.run(ResourceLocalizationService.java:1113) {code} I did a {code}ps aux{code} and confirmed that there was no container-executor process running with INITIALIZE_CONTAINER that the localizer starts. It seems that the output stream pipe of the process is still not closed (even though the localizer process is no longer present). -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org