On Jun 20, 2009, at 1:48 PM, Kritiraj Sajadah wrote:
Hi Josh,
Thank you for the email. I can now checkpoint the
application on the cluster using OPEN MPI. But I am now facing
another problem.
When i tried restarting the checkpoint, nothing happens. I copied
the checkpoint
Hi Josh,
Thank you for the email. I can now checkpoint the application on the
cluster using OPEN MPI. But I am now facing another problem.
When i tried restarting the checkpoint, nothing happens. I copied the
checkpoint file to the $HOME directory and tried restarting it there and
Thanks Ralph. It worked.
Regards,
Rajesh
On Sat, Jun 20, 2009 at 10:28 AM, Ralph Castain wrote:
> Ah, yes - that is definitely true. What you need to use is the "seq" (for
> "sequential") mapper. Do the following on your cmd line:
> --hostfile hostfile -mca rmaps seq
> This
Ah, yes - that is definitely true. What you need to use is the "seq" (for
"sequential") mapper. Do the following on your cmd line:
--hostfile hostfile -mca rmaps seq
This will cause OMPI to map the process ranks according to the order in the
hostfile. You need to specify one line for each
Thanks, that at least explains what is going on. Because I have an
unbalanced work load (at least for now) I assume that I'll need to poll. If
I replace the compositor loop with the following, it appears that I prevent
the serialization/starvation and service the servers equally. I can think of
Hi Ralph,
Thanks for the reply. The default mapper does round-robin assignment
as long as I do not specify the machinefile in the following format:
n1
n2
n2
n1where, n1 and n2 are two nodes in the cluster and I use two
slots within each node.
I have pasted the output and the display map