[ 
https://issues.apache.org/jira/browse/YARN-10495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

angerszhu updated YARN-10495:
-----------------------------
    Description: 
In  https://issues.apache.org/jira/browse/YARN-9561 we add dependency on crypto 
to container-executor, we meet a case that in our jenkins machine, we have 
libcrypto.so.1.0.0  in shared lib env. but in our nodemanager machine we don't 
have  libcrypto.so.1.0.0  but *libcrypto.so.1.1*

We use a  internal custom dynamic link library environment 
/usr/lib/x86_64-linux-gnu

and we build hadoop with parameter as blow
{code:java}
 -Drequire.openssl -Dbundle.openssl -Dopenssl.lib=/usr/lib/x86_64-linux-gnu
{code}
 

Under jenkins machine shared lib library path /usr/lib/x86_64-linux-gun(where 
is libcrypto)
{code:java}
-rw-r--r-- 1 root root   240136 Nov 28  2014 libcroco-0.6.so.3.0.1
-rw-r--r-- 1 root root    54550 Jun 18  2017 libcrypt.a
-rw-r--r-- 1 root root  4306444 Sep 26  2019 libcrypto.a
lrwxrwxrwx 1 root root       18 Sep 26  2019 libcrypto.so -> libcrypto.so.1.0.0
-rw-r--r-- 1 root root  2070976 Sep 26  2019 libcrypto.so.1.0.0
lrwxrwxrwx 1 root root       35 Jun 18  2017 libcrypt.so -> 
/lib/x86_64-linux-gnu/libcrypt.so.1
-rw-r--r-- 1 root root      298 Jun 18  2017 libc.so
{code}
 

Under nodemanager shared lib library path /usr/lib/x86_64-linux-gun(where is 
libcrypto)
{code:java}
-rw-r--r--  1 root root    55852 2��   7  2019 libcrypt.a
-rw-r--r--  1 root root  4864244 9��  28  2019 libcrypto.a
lrwxrwxrwx  1 root root       16 9��  28  2019 libcrypto.so -> libcrypto.so.1.1
-rw-r--r--  1 root root  2504576 12�� 24  2019 libcrypto.so.1.0.2
-rw-r--r--  1 root root  2715840 9��  28  2019 libcrypto.so.1.1
lrwxrwxrwx  1 root root       35 2��   7  2019 libcrypt.so -> 
/lib/x86_64-linux-gnu/libcrypt.so.1
-rw-r--r--  1 root root      298 2��   7  2019 libc.so
{code}
 We build container-executor with 

The  libcrypto.so 's version is not same case error when we start nodemanager

 
{code:java}
.. 3 more Caused by: 
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationException:
 ExitCodeException exitCode=127: /home/hadoop/hadoop/bin/container-executor: 
error while loading shared libraries: libcrypto.so.1.0.0: cannot open shared 
object file: No such file or directory at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:182)
 at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:208)
 at 
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(LinuxContainerExecutor.java:306)
 ... 4 more Caused by: ExitCodeException exitCode=127: 
/home/hadoop/hadoop/bin/container-executor: error while loading shared 
libraries: libcrypto.so.1.0.0: cannot open shared object file: No such file or 
directory at org.apache.hadoop.util.Shell.runCommand(Shell.java:1008) at 
org.apache.hadoop.util.Shell.run(Shell.java:901) at 
org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:1213) at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:154)
 ... 6 more 
{code}
 

We should make RPATH of container-executor configurable to solve this problem 

  was:
In  https://issues.apache.org/jira/browse/YARN-9561 we add dependency on crypto 
to container-executor, we meet a case that in our jenkins machine, we have 
libcrypto.so.1.0.0  in shared lib env. but in our nodemanager machine we don't 
have  libcrypto.so.1.0.0  but *libcrypto.so.1.1*

we build hadoop with 
{code:java}
 -Drequire.openssl -Dbundle.openssl -Dopenssl.lib=/usr/lib/x86_64-linux-gnu
{code}
 

Under jenkins machine shared lib library pats /usr/lib/x86_64-linux-gun(where 
is libcrypto)
{code:java}
-rw-r--r-- 1 root root   240136 Nov 28  2014 libcroco-0.6.so.3.0.1
-rw-r--r-- 1 root root    54550 Jun 18  2017 libcrypt.a
-rw-r--r-- 1 root root  4306444 Sep 26  2019 libcrypto.a
lrwxrwxrwx 1 root root       18 Sep 26  2019 libcrypto.so -> libcrypto.so.1.0.0
-rw-r--r-- 1 root root  2070976 Sep 26  2019 libcrypto.so.1.0.0
lrwxrwxrwx 1 root root       35 Jun 18  2017 libcrypt.so -> 
/lib/x86_64-linux-gnu/libcrypt.so.1
-rw-r--r-- 1 root root      298 Jun 18  2017 libc.so
{code}
 

Under nodemanager shared lib library pats /usr/lib/x86_64-linux-gun(where is 
libcrypto)
{code:java}
-rw-r--r--  1 root root    55852 2��   7  2019 libcrypt.a
-rw-r--r--  1 root root  4864244 9��  28  2019 libcrypto.a
lrwxrwxrwx  1 root root       16 9��  28  2019 libcrypto.so -> libcrypto.so.1.1
-rw-r--r--  1 root root  2504576 12�� 24  2019 libcrypto.so.1.0.2
-rw-r--r--  1 root root  2715840 9��  28  2019 libcrypto.so.1.1
lrwxrwxrwx  1 root root       35 2��   7  2019 libcrypt.so -> 
/lib/x86_64-linux-gnu/libcrypt.so.1
-rw-r--r--  1 root root      298 2��   7  2019 libc.so
{code}
 We build container-executor with 

The  libcrypto.so 's version is not same case error when we start nodemanager

 
{code:java}
.. 3 more Caused by: 
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationException:
 ExitCodeException exitCode=127: /home/hadoop/hadoop/bin/container-executor: 
error while loading shared libraries: libcrypto.so.1.0.0: cannot open shared 
object file: No such file or directory at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:182)
 at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:208)
 at 
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(LinuxContainerExecutor.java:306)
 ... 4 more Caused by: ExitCodeException exitCode=127: 
/home/hadoop/hadoop/bin/container-executor: error while loading shared 
libraries: libcrypto.so.1.0.0: cannot open shared object file: No such file or 
directory at org.apache.hadoop.util.Shell.runCommand(Shell.java:1008) at 
org.apache.hadoop.util.Shell.run(Shell.java:901) at 
org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:1213) at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:154)
 ... 6 more 
{code}
 

We should make RPATH of container-executor configurable to solve this problem 


> make the rpath of container-executor configurable
> -------------------------------------------------
>
>                 Key: YARN-10495
>                 URL: https://issues.apache.org/jira/browse/YARN-10495
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: yarn
>            Reporter: angerszhu
>            Priority: Major
>         Attachments: YARN-10495.001.patch
>
>
> In  https://issues.apache.org/jira/browse/YARN-9561 we add dependency on 
> crypto to container-executor, we meet a case that in our jenkins machine, we 
> have libcrypto.so.1.0.0  in shared lib env. but in our nodemanager machine we 
> don't have  libcrypto.so.1.0.0  but *libcrypto.so.1.1*
> We use a  internal custom dynamic link library environment 
> /usr/lib/x86_64-linux-gnu
> and we build hadoop with parameter as blow
> {code:java}
>  -Drequire.openssl -Dbundle.openssl -Dopenssl.lib=/usr/lib/x86_64-linux-gnu
> {code}
>  
> Under jenkins machine shared lib library path /usr/lib/x86_64-linux-gun(where 
> is libcrypto)
> {code:java}
> -rw-r--r-- 1 root root   240136 Nov 28  2014 libcroco-0.6.so.3.0.1
> -rw-r--r-- 1 root root    54550 Jun 18  2017 libcrypt.a
> -rw-r--r-- 1 root root  4306444 Sep 26  2019 libcrypto.a
> lrwxrwxrwx 1 root root       18 Sep 26  2019 libcrypto.so -> 
> libcrypto.so.1.0.0
> -rw-r--r-- 1 root root  2070976 Sep 26  2019 libcrypto.so.1.0.0
> lrwxrwxrwx 1 root root       35 Jun 18  2017 libcrypt.so -> 
> /lib/x86_64-linux-gnu/libcrypt.so.1
> -rw-r--r-- 1 root root      298 Jun 18  2017 libc.so
> {code}
>  
> Under nodemanager shared lib library path /usr/lib/x86_64-linux-gun(where is 
> libcrypto)
> {code:java}
> -rw-r--r--  1 root root    55852 2��   7  2019 libcrypt.a
> -rw-r--r--  1 root root  4864244 9��  28  2019 libcrypto.a
> lrwxrwxrwx  1 root root       16 9��  28  2019 libcrypto.so -> 
> libcrypto.so.1.1
> -rw-r--r--  1 root root  2504576 12�� 24  2019 libcrypto.so.1.0.2
> -rw-r--r--  1 root root  2715840 9��  28  2019 libcrypto.so.1.1
> lrwxrwxrwx  1 root root       35 2��   7  2019 libcrypt.so -> 
> /lib/x86_64-linux-gnu/libcrypt.so.1
> -rw-r--r--  1 root root      298 2��   7  2019 libc.so
> {code}
>  We build container-executor with 
> The  libcrypto.so 's version is not same case error when we start nodemanager
>  
> {code:java}
> .. 3 more Caused by: 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationException:
>  ExitCodeException exitCode=127: /home/hadoop/hadoop/bin/container-executor: 
> error while loading shared libraries: libcrypto.so.1.0.0: cannot open shared 
> object file: No such file or directory at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:182)
>  at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:208)
>  at 
> org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(LinuxContainerExecutor.java:306)
>  ... 4 more Caused by: ExitCodeException exitCode=127: 
> /home/hadoop/hadoop/bin/container-executor: error while loading shared 
> libraries: libcrypto.so.1.0.0: cannot open shared object file: No such file 
> or directory at org.apache.hadoop.util.Shell.runCommand(Shell.java:1008) at 
> org.apache.hadoop.util.Shell.run(Shell.java:901) at 
> org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:1213) at 
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor.executePrivilegedOperation(PrivilegedOperationExecutor.java:154)
>  ... 6 more 
> {code}
>  
> We should make RPATH of container-executor configurable to solve this problem 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to