hi all,

I have a bash script testing the docker container executor, try to
configure the distributedshell such like below. But keep having error as
like below.

Any helps please... Thanks!


#!/bin/bash
export HADOOP_HOME="/usr/hdp/3.1.0.0-78/hadoop"
export LD_LIBRARY_PATH="$LD_LIBRARY_PATH:$HADOOP_HOME/lib/native"
export HADOOP_OPTS="-Djava.library.path=$HADOOP_HOME/lib/native"
export HADOOP_COMMON_LIB_NATIVE_DIR="$HADOOP_HOME/lib/native"
export JAVA_LIBRARY_PATH="$HADOOP_HOME/lib/native:$JAVA_LIBRARY_PATH"
export
DSHELL_JAR="/usr/hdp/3.1.0.0-78/hadoop-yarn/hadoop-yarn-applications-distributedshell-3.2.0.jar"
#export DOCKER_IMAGE="local/centos"
export DOCKER_IMAGE="local/openjdk:8.1"
export DSHELL_CMD="ls"
export NUM_OF_CONTAINERS=1

yarn --loglevel DEBUG jar $DSHELL_JAR \
-shell_command $DSHELL_CMD \
-jar $DSHELL_JAR \
-shell_env YARN_CONTAINER_RUNTIME_TYPE="$RUNTIME" \
-shell_env YARN_CONTAINER_RUNTIME_DOCKER_IMAGE="$DOCKER_IMAGE" \
-num_containers $NUM_OF_CONTAINERS


19/08/30 15:22:12 INFO distributedshell.ApplicationMaster: placementSpecs null
19/08/30 15:22:12 INFO distributedshell.ApplicationMaster: Requested
container ask: Capability[<memory:10,
vCores:1>]Priority[0]AllocationRequestId[0]ExecutionTypeRequest[{Execution
Type: GUARANTEED, Enforce Execution Type: false}]Resource Profile[]
19/08/30 15:22:14 INFO distributedshell.ApplicationMaster: Got
response from RM for container ask, allocatedCnt=1
19/08/30 15:22:14 INFO distributedshell.ApplicationMaster: Launching
shell command on a new container.,
containerId=container_e101_1567140885858_0043_01_000002,
yarnShellId=1, containerNode=hk-hdpoc-2001.agprod1.agoda.local:45454,
containerNodeURI=hk-hdpoc-2001.agprod1.agoda.local:8042,
containerResourceMemory1024, containerResourceVirtualCores1
19/08/30 15:22:14 INFO distributedshell.ApplicationMaster: Setting up
container launch container for
containerid=container_e101_1567140885858_0043_01_000002 with shellid=1
19/08/30 15:22:14 INFO impl.NMClientAsyncImpl: Processing Event
EventType: START_CONTAINER for Container
container_e101_1567140885858_0043_01_000002
19/08/30 15:22:14 INFO impl.NMClientAsyncImpl: Processing Event
EventType: QUERY_CONTAINER for Container
container_e101_1567140885858_0043_01_000002
19/08/30 15:22:15 INFO distributedshell.ApplicationMaster: Got
response from RM for container ask, completedCnt=1
19/08/30 15:22:15 ERROR distributedshell.ApplicationMaster:
appattempt_1567140885858_0043_000001 got container status for
containerID=container_e101_1567140885858_0043_01_000002,
state=COMPLETE, exitStatus=127, diagnostics=[2019-08-30
15:22:15.671]Exception from container-launch.
Container id: container_e101_1567140885858_0043_01_000002
Exit code: 127
Exception message: Launch container failed
Shell output: main : command provided 4
main : run as user is ambari-qa
main : requested yarn user is ambari-qa
802b0a68c8332e819912e51eafc9527f382f48dbc91365bf5beb6ed54e14389c
Creating script paths...
Creating local dirs...
Getting exit code file...
Changing effective user to root...
Inspecting docker container...
Docker inspect command: /usr/bin/docker inspect --format
{{.State.Pid}} container_e101_1567140885858_0043_01_000002
pid from docker inspect: 0
Obtaining the exit code...
Docker inspect command: /usr/bin/docker inspect --format
{{.State.ExitCode}} container_e101_1567140885858_0043_01_000002
Exit code from docker inspect: 127
Wrote the exit code 127 to
/hadoop/yarn/local/nmPrivate/application_1567140885858_0043/container_e101_1567140885858_0043_01_000002/container_e101_1567140885858_0043_01_000002.pid.exitcode


[2019-08-30 15:22:15.672]Container exited with a non-zero exit code
127. Last 4096 bytes of stderr.txt :


[2019-08-30 15:22:15.673]Container exited with a non-zero exit code
127. Last 4096 bytes of stderr.txt :



19/08/30 15:22:16 INFO distributedshell.ApplicationMaster: Application
completed. Stopping running containers
19/08/30 15:22:16 INFO distributedshell.ApplicationMaster: Application
completed. Signalling finished to RM
19/08/30 15:22:16 INFO impl.AMRMClientImpl: Waiting for application to
be successfully unregistered.
19/08/30 15:22:16 ERROR distributedshell.ApplicationMaster:
Application Master failed. exiting


-- 
Hiu Yen Onn

Reply via email to