Wei Chen created YARN-5914: ------------------------------ Summary: NodeManager will report "Error: No such image, container or task" when DockerContainerLauncher launches a container Key: YARN-5914 URL: https://issues.apache.org/jira/browse/YARN-5914 Project: Hadoop YARN Issue Type: Bug Components: nodemanager Affects Versions: 2.6.0 Environment: Ubuntu 16.04, Docker1.12.1, Spark-2.0.1,Spark-1.6.2,Hadoop2.6.0,Hadoop-2.7.1 Reporter: Wei Chen
Hi, all I have tested MapReduce and Spark(1.6.2,2.0.1) with Docker execution enabled. I found DockerContainerExecutor will report "Error: No such image, container or task" each time it launch a task container. Then I checked the docker_container_executor_session.sh and find this: echo `/usr/bin/docker inspect --format {{.State.Pid}} container_1479428705108_0002_01_000001` > /home/cwei/project/hadoop-2.7.3/yarn-temp/nm-local-dir/nmPrivate/application_1479428705108_0002/container_1479428705108_0002_01_000001/container_1479428705108_0002_01_000001.pid.tmp /bin/mv -f /home/cwei/project/hadoop-2.7.3/yarn-temp/nm-local-dir/nmPrivate/application_1479428705108_0002/container_1479428705108_0002_01_000001/container_1479428705108_0002_01_000001.pid.tmp /home/cwei/project/hadoop-2.7.3/yarn-temp/nm-local-dir/nmPrivate/application_1479428705108_0002/container_1479428705108_0002_01_000001/container_1479428705108_0002_01_000001.pid /usr/bin/docker run --memory=1024m --memory-swap -1 -it --net=host --name container_1479428705108_0002_01_000001 -v /home/cwei/project/hadoop-2.7.3/yarn-temp/nm-local-dir:/home/cwei/project/hadoop-2.7.3/yarn-temp/nm-local-dir -v /home/cwei/project/hadoop-2.7.3/logs/userlogs:/home/cwei/project/hadoop-2.7.3/logs/userlogs -v /home/cwei/project/hadoop-2.7.3/yarn-temp/nm-local-dir/usercache/cwei/appcache/application_1479428705108_0002/container_1479428705108_0002_01_000001:/home/cwei/project/hadoop-2.7.3/yarn-temp/nm-local-dir/usercache/cwei/appcache/application_1479428705108_0002/container_1479428705108_0002_01_000001 sequenceiq/hadoop-docker:2.7.1 bash "/home/cwei/project/hadoop-2.7.3/yarn-temp/nm-local-dir/usercache/cwei/appcache/application_1479428705108_0002/container_1479428705108_0002_01_000001/launch_container.sh" Since `/usr/bin/docker inspect --format {{.State.Pid}} container_1479428705108_0002_01_000001` is called before the container is launched by calling `docker run...` , so it always cause this error log message. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org