So I can point you to some of the work that I did while at Indiana University
to support process migration in Open MPI in a coordinated manner. This should
introduce you to some of the internal pieces that fit together to provide this
support.
The transparent C/R in Open MPI webpage from IU is
Thanks for the reply and don't worry about the delay.
Yeah, i supposse it wouln't be easy :(.
But my final goal is what you are mentioning, is to stop one particular
process (previously checkpointed) and the migrate it to another place (node,
core, slot, etc.) and restart it there, but without mak
Many thanks for this contribution!
A few points:
1. This is a lengthy contribution; it's a bit more than a "trivial" patch that
we could include in the mainline without worrying about intellectual property.
:-( Can you officially release this code under the BSD license, and/or sign
the Open
Sorry for the delay; you wrote while many of us were on vacation and we're just
now starting to catch up on past mails...
I'm not entirely sure what you're trying to do. It sounds like you're trying
to replace one process with another. That's quite complicated; there will be a
lot of changes
MPI_Comm_create_keyval and MPI_Comm_free_keyval are the functions you should
use in order to be MPI 2.2 compliant.
Based on my understanding of the MPI standard, your application is incorrect,
and therefore the MPICH behavior is incorrect. The delete function is not there
for you to delete the
I have a problem to finish the porting of ROMIO into Open MPI. It is
related to the routines MPI_Comm_dup together with MPI_Keyval_create,
MPI_Keyval_free, MPI_Attr_get and MPI_Attr_put.
Here is a simple program that reproduces my problem:
===
#include
There are several modules providing collective communications. The default one
in OMPI is tuned, and can be found on ompi/mca/coll/tuned. Each collective has
several implementations, and there is a decision function to switch between
them.
george.
On Jan 6, 2011, at 06:13 , Zaibi Eya wrote:
Hi every body,
I am working in the simulation of the behaviour of collective operations and i
must understand the code (the algorithms) of bcast.c,scatter.c But
unfortunately, i didn't find the algorithms implementation of braodcast,
scatter, gather et the architecture of the framework i