ication, Nathan!
--john
-Original Message-
From: users [mailto:users-boun...@open-mpi.org] On Behalf Of Nathan Hjelm
Sent: Thursday, June 16, 2016 9:56 AM
To: Open MPI Users
Subject: EXT: Re: [OMPI users] "failed to create queue pair" problem, but
settings appear OK
XRC suppo
s, tho I don’t question
> that we were running out of QPs.
>
> --john
>
>
> From: users [mailto:users-boun...@open-mpi.org] On Behalf Of Nathan Hjelm
> Sent: Wednesday, June 15, 2016 2:43 PM
> To: Open MPI Users
> Subject: EXT: Re: [OMPI users] "failed to create queue
oun...@open-mpi.org] On Behalf Of Nathan Hjelm
Sent: Wednesday, June 15, 2016 2:43 PM
To: Open MPI Users
Subject: EXT: Re: [OMPI users] "failed to create queue pair" problem, but
settings appear OK
You ran out of queue pairs. There is no way around this for larger all-to-all
transfers when u
12288
Other suggestions welcome. Hitting a brick wall here. Thanks!
--john
-Original Message-
From: users [mailto:users-boun...@open-mpi.org] On Behalf Of Gus Correa
Sent: Wednesday, June 15, 2016 1:39 PM
To: Open MPI Users
Subject: EXT: Re: [OMPI users] "failed to create queue pa
users-boun...@open-mpi.org] On Behalf Of Sasso, John (GE
Power, Non-GE)
Sent: Wednesday, June 15, 2016 2:35 PM
To: Open MPI Users
Subject: EXT: [OMPI users] "failed to create queue pair" problem, but settings
appear OK
Chuck,
The per-process limits appear fine, including those
...@open-mpi.org] On Behalf Of Gus Correa
Sent: Wednesday, June 15, 2016 1:39 PM
To: Open MPI Users
Subject: EXT: Re: [OMPI users] "failed to create queue pair" problem, but
settings appear OK
Hi John
1) For diagnostic, you could check the actual "per process" limits on the
:35 PM
To: Open MPI Users
Subject: EXT: [OMPI users] "failed to create queue pair" problem, but settings
appear OK
Chuck,
The per-process limits appear fine, including those for the resource mgr
daemons:
Limit Soft Limit Hard Limit Units
M
d to create queue pair" problem, but
settings appear OK
Hi John
1) For diagnostic, you could check the actual "per process" limits on the nodes
while that big job is running:
cat /proc/$PID/limits
2) If you're using a resource manager to launch the job, the resource manage
Hi John
1) For diagnostic, you could check the actual "per process" limits on
the nodes while that big job is running:
cat /proc/$PID/limits
2) If you're using a resource manager to launch the job,
the resource manager daemon/deamons (local to the nodes) may have to
to set the memlock and oth
In doing testing with IMB, I find that running a 4200+ core case with the IMB
test Alltoall, and message lengths of 16..1024 bytes (as per -msglog 4:10 IMB
option), it fails with:
--
A process failed to create a queue pair.
10 matches
Mail list logo