Re: [OMPI users] [EXTERNAL] OpenMPI 3.1.6 openib failure: "mlx4_0 errno says Success"

2021-10-15 Thread Fischer, Greg A. via users
pen-mpi.org>> Cc: Fischer, Greg A. mailto:fisch...@westinghouse.com>> Subject: Re: [EXTERNAL] [OMPI users] OpenMPI 3.1.6 openib failure: "mlx4_0 errno says Success" [External Email] HI Greg, It's the aging of the openib btl. You may be able to apply the attach

Re: [OMPI users] [EXTERNAL] OpenMPI 3.1.6 openib failure: "mlx4_0 errno says Success"

2021-10-14 Thread Fischer, Greg A. via users
m is no longer supported. You may want to try using the 4.1.1 release, in which case you'll want to use UCX. Howard From: users mailto:users-boun...@lists.open-mpi.org>> on behalf of "Fischer, Greg A. via users" mailto:users@lists.open-mpi.org>> Reply-To: Open MPI Use

Re: [OMPI users] [EXTERNAL] OpenMPI 3.1.6 openib failure: "mlx4_0 errno says Success"

2021-10-14 Thread Fischer, Greg A. via users
lto:fisch...@westinghouse.com>> Subject: Re: [EXTERNAL] [OMPI users] OpenMPI 3.1.6 openib failure: "mlx4_0 errno says Success" [External Email] HI Greg, It's the aging of the openib btl. You may be able to apply the attached patch. Note the 3.1.x release stream is no long

[OMPI users] OpenMPI 3.1.6 openib failure: "mlx4_0 errno says Success"

2021-10-13 Thread Fischer, Greg A. via users
Hello, I have compiled OpenMPI 3.1.6 from source on SLES12-SP3, and I am seeing the following errors when I try to use the openib btl: WARNING: There was an error initializing an OpenFabrics device. Local host: bl1308 Local device: mlx4_0 --