Re: [OMPI users] Error on running large number of processes

2007-11-15 Thread Pak Lui
I am assuming all the processes are running on a single SMP? Not sure if you have tried it but you may want to set the mpool_sm_max_size to something other than the default 512MB, since you seem to be using shared memory? Jeff Squyres wrote: My guess is that this is similar to the last post: y

Re: [OMPI users] Error on running large number of processes

2007-11-15 Thread Jeff Squyres
My guess is that this is similar to the last post: you are oversubscribing the nodes so heavily that the OS is running out of some resources (perhaps regular or registered memory?) such that Open MPI is unable to setup its network transport layers properly. On Nov 15, 2007, at 6:35 AM, Cle

[OMPI users] Error on running large number of processes

2007-11-15 Thread Clement Kam Man Chu
Hi, I am using openmpi 1.2.3 under ia64 machine and uses pbs job scheduler. I can successfully run 100 processes on 16 cpus, but I got an error If run 200 processes on the same number of cpus. The error is : PML add procs failed --> Returned "Temporarily out of resource" (-3) instead o