Re: [OMPI users] vfs_write returned -14

2009-06-20 Thread Josh Hursey
On Jun 20, 2009, at 1:48 PM, Kritiraj Sajadah wrote: Hi Josh, Thank you for the email. I can now checkpoint the application on the cluster using OPEN MPI. But I am now facing another problem. When i tried restarting the checkpoint, nothing happens. I copied the checkpoint

Re: [OMPI users] vfs_write returned -14

2009-06-20 Thread Kritiraj Sajadah
Hi Josh, Thank you for the email. I can now checkpoint the application on the cluster using OPEN MPI. But I am now facing another problem. When i tried restarting the checkpoint, nothing happens. I copied the checkpoint file to the $HOME directory and tried restarting it there and

Re: [OMPI users] Machinefile option in opempi-1.3.2

2009-06-20 Thread Rajesh Sudarsan
Thanks Ralph. It worked. Regards, Rajesh On Sat, Jun 20, 2009 at 10:28 AM, Ralph Castain wrote: > Ah, yes - that is definitely true. What you need to use is the "seq" (for > "sequential") mapper. Do the following on your cmd line: > --hostfile hostfile -mca rmaps seq > This

Re: [OMPI users] Machinefile option in opempi-1.3.2

2009-06-20 Thread Ralph Castain
Ah, yes - that is definitely true. What you need to use is the "seq" (for "sequential") mapper. Do the following on your cmd line: --hostfile hostfile -mca rmaps seq This will cause OMPI to map the process ranks according to the order in the hostfile. You need to specify one line for each

Re: [OMPI users] Bug in 1.3.2?: sm btl and isend is serializes

2009-06-20 Thread Mark Bolstad
Thanks, that at least explains what is going on. Because I have an unbalanced work load (at least for now) I assume that I'll need to poll. If I replace the compositor loop with the following, it appears that I prevent the serialization/starvation and service the servers equally. I can think of

Re: [OMPI users] Machinefile option in opempi-1.3.2

2009-06-20 Thread Rajesh Sudarsan
Hi Ralph, Thanks for the reply. The default mapper does round-robin assignment as long as I do not specify the machinefile in the following format: n1 n2 n2 n1where, n1 and n2 are two nodes in the cluster and I use two slots within each node. I have pasted the output and the display map