HI Jeff,
by adding PATH and LD_LIBRARY_PATH, I don't see orted not found issue.
[root@vcn03 pasokan]# mpirun --allow-run-as-root -np 4 -host
vcn03,vcn03,vcn04,vcn04 /mnt/lustre_client/pasokan/a.out
--------------------------------------------------------------------------
WARNING: No preset parameters were found for the device that Open MPI
detected:
Local host: vcn03
Device name: mlx5_0
Device vendor ID: 0x02c9
Device vendor part ID: 4114
Default device parameters will be used, which may result in lower
performance. You can edit any of the files specified by the
btl_openib_device_param_files MCA parameter to set values for your
device.
NOTE: You can turn off this warning by setting the MCA parameter
btl_openib_warn_no_device_params_found to 0.
--------------------------------------------------------------------------
[vcn04][[33859,1],2][connect/btl_openib_connect_udcm.c:1235:udcm_rc_qp_to_rtr]
error modifing QP to RTR errno says Invalid argument
[vcn03][[33859,1],0][connect/btl_openib_connect_udcm.c:1235:udcm_rc_qp_to_rtr]
error modifing QP to RTR errno says Invalid argument
[vcn03][[33859,1],1][connect/btl_openib_connect_udcm.c:1235:udcm_rc_qp_to_rtr]
error modifing QP to RTR errno says Invalid argument
[vcn04][[33859,1],3][connect/btl_openib_connect_udcm.c:1235:udcm_rc_qp_to_rtr]
error modifing QP to RTR errno says Invalid argument
Hello world from processor vcn03, rank 0 out of 4 processors
Hello world from processor vcn03, rank 1 out of 4 processors
Hello world from processor vcn04, rank 2 out of 4 processors
Hello world from processor vcn04, rank 3 out of 4 processors
[vcn03:05070] 3 more processes have sent help message help-mpi-btl-openib.txt /
no device params found
[vcn03:05070] Set MCA parameter "orte_base_help_aggregate" to 0 to see all help
/ error messages
[root@vcn03 pasokan]#
but still IOR isn't running while compiled using OpenMPI, throwing segmentation
fault, which used to be very straight forward in Baremetal but not in KVM +
SR-IOV
________________________________________
From: Pharthiphan Asokan
Sent: Tuesday, March 13, 2018 8:42 PM
To: Open MPI Developers
Subject: RE: [OMPI devel] How to Build OpenMPI to support FDR over SR-IOV
Thanks Jeff,
OpenMPI is installed here
[root@vcn03 C]# cd /mnt/lustre_client/pasokan/openmpi-3.0.0/openmpi-3.0.0/
bin/ etc/ include/ lib/ share/
[root@vcn03 C]#
why exporting these variables not taking effect
export PATH=/mnt/lustre_client/pasokan/openmpi-3.0.0/openmpi-3.0.0/bin:$PATH
export
LD_LIBRARY_PATH=/mnt/lustre_client/pasokan/openmpi-3.0.0/openmpi-3.0.0/lib:$LD_LIBRARY_PATH
export
INCLUDE=/mnt/lustre_client/pasokan/openmpi-3.0.0/openmpi-3.0.0/include:$INCLUDE
but as said by providing --prefix
/mnt/lustre_client/pasokan/openmpi-3.0.0/openmpi-3.0.0/ is working
[root@vcn03 C]# mpirun --prefix
/mnt/lustre_client/pasokan/openmpi-3.0.0/openmpi-3.0.0/ --allow-run-as-root -np
2 -host vcn03,vcn04 hostname
vcn04
vcn03
[root@vcn03 C]#
though my issue is IOR isn't running while compile with OpenMPI on SR-IOV
envirorment
[root@vcn03 C]# pwd
/mnt/lustre_client/pasokan/IOR-July12/src/C
[root@vcn03 C]#
[root@vcn03 C]# export
PATH=/mnt/lustre_client/pasokan/openmpi-3.0.0/openmpi-3.0.0/bin:$PATH
[root@vcn03 C]# export
LD_LIBRARY_PATH=/mnt/lustre_client/pasokan/openmpi-3.0.0/openmpi-3.0.0/lib:$LD_LIBRARY_PATH
[root@vcn03 C]# export
INCLUDE=/mnt/lustre_client/pasokan/openmpi-3.0.0/openmpi-3.0.0/include:$INCLUDE
[root@vcn03 C]#
[root@vcn03 C]# gmake posix mpiio
mpicc -o IOR IOR.o utilities.o parse_options.o \
aiori-POSIX.o aiori-noMPIIO.o aiori-noHDF5.o aiori-noNCMPI.o \
-lm
mpicc -o IOR IOR.o utilities.o parse_options.o \
aiori-POSIX.o aiori-MPIIO.o aiori-noHDF5.o aiori-noNCMPI.o \
-lm
[root@vcn03 C]# ./IOR
--------------------------------------------------------------------------
WARNING: No preset parameters were found for the device that Open MPI
detected:
Local host: vcn03
Device name: mlx5_0
Device vendor ID: 0x02c9
Device vendor part ID: 4114
Default device parameters will be used, which may result in lower
performance. You can edit any of the files specified by the
btl_openib_device_param_files MCA parameter to set values for your
device.
NOTE: You can turn off this warning by setting the MCA parameter
btl_openib_warn_no_device_params_found to 0.
--------------------------------------------------------------------------
[vcn03][[34068,1],0][connect/btl_openib_connect_udcm.c:1235:udcm_rc_qp_to_rtr]
error modifing QP to RTR errno says Invalid argument
Segmentation fault
[root@vcn03 C]#
Please help !
________________________________________
From: devel [[email protected]] on behalf of Jeff Squyres
(jsquyres) [[email protected]]
Sent: Tuesday, March 13, 2018 8:20 PM
To: Open MPI Developers List
Subject: Re: [OMPI devel] How to Build OpenMPI to support FDR over SR-IOV
On Mar 13, 2018, at 2:08 AM, Pharthiphan Asokan <[email protected]> wrote:
>
> [root@vcn03 C]# mpirun --allow-run-as-root -np 2 -host vcn03,vcn04 hostname
> bash: orted: command not found
This is the key ^^
These FAQ items may help:
* https://www.open-mpi.org/faq/?category=running#run-prereqs.
* https://www.open-mpi.org/faq/?category=running#adding-ompi-to-path
* https://www.open-mpi.org/faq/?category=running#mpirun-prefix
--
Jeff Squyres
[email protected]
_______________________________________________
devel mailing list
[email protected]
https://lists.open-mpi.org/mailman/listinfo/devel
_______________________________________________
devel mailing list
[email protected]
https://lists.open-mpi.org/mailman/listinfo/devel