Noam,

Another idea: check for stale files in /dev/shm/ (or a subdirectory that looks like it belongs to UCX/OpenMPI) and SysV shared memory using `ipcs -m`.

Joseph

On 6/20/19 3:31 PM, Noam Bernstein via users wrote:


On Jun 20, 2019, at 4:44 AM, Charles A Taylor <chas...@ufl.edu <mailto:chas...@ufl.edu>> wrote:

This looks a lot like a problem I had with OpenMPI 3.1.2.  I thought the fix was landed in 4.0.0 but you might want to check the code to be sure there wasn’t a regression in 4.1.x.  Most of our codes are still running 3.1.2 so I haven’t built anything beyond 4.0.0 which definitely included the fix.

Unfortunately, 4.0.0 behaves the same.

One thing that I’m wondering if anyone familiar with the internals can explain is how you get a memory leak that isn’t freed when then program ends?  Doesn’t that suggest that it’s something lower level, like maybe a kernel issue?

Noam

____________
|
|
|
*U.S. NAVAL*
|
|
_*RESEARCH*_
|
LABORATORY

Noam Bernstein, Ph.D.
Center for Materials Physics and Technology
U.S. Naval Research Laboratory
T +1 202 404 8628  F +1 202 404 7546
https://www.nrl.navy.mil


_______________________________________________
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

_______________________________________________
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Reply via email to