[ 
https://issues.apache.org/jira/browse/YARN-7890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Lowe reassigned YARN-7890:
--------------------------------

             Assignee: Jason Lowe
    Affects Version/s: 2.9.0
                       3.0.0-alpha1
     Target Version/s: 3.1.0, 2.9.1, 3.0.1

Attached a patch that initializes the filecache and user local directories in 
the relaunch container start context along with a unit test.

> NPE during container relaunch
> -----------------------------
>
>                 Key: YARN-7890
>                 URL: https://issues.apache.org/jira/browse/YARN-7890
>             Project: Hadoop YARN
>          Issue Type: Bug
>    Affects Versions: 3.0.0-alpha1, 2.9.0
>            Reporter: Billie Rinaldi
>            Assignee: Jason Lowe
>            Priority: Major
>         Attachments: YARN-7890.001.patch
>
>
> While running a recent build of trunk, I saw the following:
> {noformat}
> 2018-02-02 21:02:40,026 INFO 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.container.ContainerImpl:
>  Container container_e02_1517604848419_0002_01_000004 transitioned from 
> RELAUNCHING to RUNNING
> 2018-02-02 21:02:40,026 WARN 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerRelaunch:
>  Failed to relaunch container.
> java.lang.NullPointerException
>         at 
> java.util.Collections$UnmodifiableCollection.<init>(Collections.java:1026)
>         at 
> java.util.Collections$UnmodifiableList.<init>(Collections.java:1302)
>         at java.util.Collections.unmodifiableList(Collections.java:1287)
>         at 
> org.apache.hadoop.yarn.server.nodemanager.executor.ContainerStartContext.getFilecacheDirs(ContainerStartContext.java:200)
>         at 
> org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.buildContainerRuntimeContext(LinuxContainerExecutor.java:651)
>         at 
> org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.launchContainer(LinuxContainerExecutor.java:546)
>         at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.launchContainer(ContainerLaunch.java:465)
>         at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerRelaunch.call(ContainerRelaunch.java:107)
>         at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerRelaunch.call(ContainerRelaunch.java:49)
>         at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>         at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>         at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>         at java.lang.Thread.run(Thread.java:748)
> {noformat}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to