Leonardo,
The CRCP 'coord' component implements the bookmark exchange. I store
the message signatures for the bookmark exchange. Since I am
implementing this above the point-to-point stack in Open MPI (PML) I
need to keep track of this message information to implement post-
checkpoint reso
Josh,
Some time ago I was studying CRCP component, I´m not sure, but I
remember that this component is used for bookmark exchange. You store
these informations exactly for this (bookmark exchange)? After a
successfully checkpoint operation you can free this memory?
Thanks,
Leonardo
Josh Hur
Leonardo,
You are exactly correct. The CRCP module/component will grow the
application size probably for every message that you send or receive.
This is because the CRCP component tracks the signature {data_size,
tag, communicator, peer} (*not* the contents of the message) of every
messag
Hi All,
I made some tests with a dummy "ping" application. Some memory problems
occurred. On these tests I obtained the following results:
1) OpenMPI (without FT):
- delaying 1 second to send token to other node: orted and
application size stable;
- delaying 0 seconds to send token to o