[ https://issues.apache.org/jira/browse/YARN-5210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15319189#comment-15319189 ]
Varun Saxena commented on YARN-5210: ------------------------------------ QA report comes out clean. Kindly review. Haven't added tests but have verified the fix in my setup. > NPE in Distributed Shell while publishing DS_CONTAINER_START event and other > miscellaneous issues > ------------------------------------------------------------------------------------------------- > > Key: YARN-5210 > URL: https://issues.apache.org/jira/browse/YARN-5210 > Project: Hadoop YARN > Issue Type: Sub-task > Components: timelineserver > Affects Versions: YARN-2928 > Reporter: Varun Saxena > Assignee: Varun Saxena > Labels: yarn-2928-1st-milestone > Attachments: YARN-5210-YARN-2928.01.patch > > > Found a couple of issues while testing ATSv2. > * There is a NPE while publishing DS_CONTAINER_START_EVENT which in turn > means that this event is not published. > {noformat} > 2016-06-07 23:19:00,020 > [org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl #0] INFO > org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl: Unchecked > exception is thrown from onContainerStarted for Container > container_e77_1465311876353_0007_01_000002 > java.lang.NullPointerException > at > org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl.putEntities(TimelineClientImpl.java:389) > at > org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.putContainerEntity(ApplicationMaster.java:1284) > at > org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.publishContainerStartEvent(ApplicationMaster.java:1235) > at > org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster.access$1200(ApplicationMaster.java:175) > at > org.apache.hadoop.yarn.applications.distributedshell.ApplicationMaster$NMCallbackHandler.onContainerStarted(ApplicationMaster.java:986) > at > org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl$StatefulContainer$StartContainerTransition.transition(NMClientAsyncImpl.java:454) > at > org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl$StatefulContainer$StartContainerTransition.transition(NMClientAsyncImpl.java:436) > at > org.apache.hadoop.yarn.state.StateMachineFactory$MultipleInternalArc.doTransition(StateMachineFactory.java:385) > at > org.apache.hadoop.yarn.state.StateMachineFactory.doTransition(StateMachineFactory.java:302) > at > org.apache.hadoop.yarn.state.StateMachineFactory.access$300(StateMachineFactory.java:46) > at > org.apache.hadoop.yarn.state.StateMachineFactory$InternalStateMachine.doTransition(StateMachineFactory.java:448) > at > org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl$StatefulContainer.handle(NMClientAsyncImpl.java:617) > at > org.apache.hadoop.yarn.client.api.async.impl.NMClientAsyncImpl$ContainerEventProcessor.run(NMClientAsyncImpl.java:676) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > * Created time is not reported from distributed shell for both DS_CONTAINER > and DS_APP_ATTEMPT entities. > As can be seen below, when we query DS_APP_ATTEMPT entities, we do not get > createdtime in response. > {code} > [ > { > "metrics": [ ], > "events": [ ], > "type": "DS_APP_ATTEMPT", > "id": "appattempt_1465246237936_0003_000001", > "isrelatedto": { }, > "relatesto": { }, > "info": { > "UID": > "yarn-cluster!application_1465246237936_0003!DS_APP_ATTEMPT!appattempt_1465246237936_0003_000001" > }, > "configs": { } > } > ] > {code} > As can be seen from response received upon querying a DS_CONTAINER entity we > can see that createdtime is not present and DS_CONTAINER_START is not present > either(due to NPE pointed above). > {code} > { > "metrics": [ ], > "events": [ > { > "id": "DS_CONTAINER_END", > "timestamp": 1465314587480, > "info": { > "Exit Status": 0, > "State": "COMPLETE" > } > } > ], > "type": "DS_CONTAINER", > "id": "container_e77_1465311876353_0003_01_000002", > "isrelatedto": { }, > "relatesto": { }, > "info": { > "UID": > "yarn-cluster!application_1465311876353_0003!DS_CONTAINER!container_e77_1465311876353_0003_01_000002" > }, > "configs": { } > } > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org