On Jun 15, 2012, at 8:02 AM, Jeff Squyres wrote:

> Were there any clues in /var/log/messages or dmesg?
> 

Thanks.  I found a suggestion from Nathan Hjelm to add "options mlx4_core 
log_mtts_per_seg=X" (where X is 5 in my case).  
Offline suggestions (which also included that) were also add "--mca 
mpi_leave_pinned 0" to the mpirun line and to double check my locked memory 
limits.

The only thing I find works reliably is to use "-npernode 32" instead of 
"-npernode 48".  Unfortunately my system has 48 processor node.
I've got lots of headroom on real memory.

Marcus



Reply via email to