I get the following error when trying to run SHMEM programs using UCX. [xiii@shini dir]$ oshrun -n 1 ./target/debug/main [1556046469.890238] [shini:19769:0] sys.c:619 UCX ERROR shmget(size=2097152 flags=0xfb0) for mm_recv_desc failed: Operation not permitted, please check shared memory limits by 'ipcs -l' [1556046469.895859] [shini:19769:0] sys.c:619 UCX ERROR shmget(size=2097152 flags=0xfb0) for mm_recv_desc failed: Operation not permitted, please check shared memory limits by 'ipcs -l' [1556046469.899577] [shini:19769:0] sys.c:619 UCX ERROR shmget(size=270532608 flags=0xfb0) for user allocation failed: Operation not permitted, please check shared memory limits by 'ipcs -l'
As far as I can tell, the programs I'm running are not actually resource constrained, and `ipcs -l` seems to indicate there's plenty of available shared memory. Other than this error, my code seems to run normally. [xiii@shini dir]$ ipcs -l ------ Messages Limits -------- max queues system wide = 32000 max size of message (bytes) = 8192 default max size of queue (bytes) = 16384 ------ Shared Memory Limits -------- max number of segments = 4096 max seg size (kbytes) = 18014398509465599 max total shared memory (kbytes) = 18014398509481980 min seg size (bytes) = 1 ------ Semaphore Limits -------- max number of arrays = 32000 max semaphores per array = 32000 max semaphores system wide = 1024000000 max ops per semop call = 500 semaphore max value = 32767 Do you know what's causing this / if I need to worry about it / how I can fix this error? Thanks, Ben
_______________________________________________ users mailing list users@lists.open-mpi.org https://lists.open-mpi.org/mailman/listinfo/users