Re: [OMPI users] jobs with more that 2, 500 processes will not even start

2010-12-14 Thread Ralph Castain
It applies to both. In the rsh/ssh launcher, there is a limit on how many concurrent ssh/rsh sessions we have open at any one time. This is required due to OS limitations. As each daemon completes its launch, it "daemonizes" and closes the ssh/rsh session, thus enabling another daemon to be

Re: [OMPI users] jobs with more that 2, 500 processes will not even start

2010-12-14 Thread Gilbert Grosdidier
Bonjour Ralph, I wonder : is this plm_rsh_num_concurrent parameter standing ONLY for rsh use, or for ssh OR rsh, depending on plm_rsh_agent, please ? Thanks, Best, G. Le 14/12/2010 18:30, Ralph Castain a écrit : That's a big cluster to be starting with rsh! :-) When you say it won't

Re: [OMPI users] jobs with more that 2, 500 processes will not even start

2010-12-14 Thread John Hearns
On 14 December 2010 17:32, Lydia Heck wrote: > > I have experimented a bit more and found that if I set > > OMPI_MCA_plm_rsh_num_concurrent=1024 > > a job with more than 2,500 processes will start and run. > > However when I searched the open-mpi web site for the the

Re: [OMPI users] jobs with more that 2, 500 processes will not even start

2010-12-14 Thread Lydia Heck
I have experimented a bit more and found that if I set OMPI_MCA_plm_rsh_num_concurrent=1024 a job with more than 2,500 processes will start and run. However when I searched the open-mpi web site for the the variable I could not find any indication. Best wishes, Lydia Heck 15. jobs