Hi Nausca,
Thank you for your reply!
Let me see if I understood correctly. Using an older version (1.x) my
system will be capable of to create non-coordinated checkpoints among
processes. Then, if I run:
- <dmtcp command> mpirun -np 8 ./test - where each process executes on a
different machine
I will have each one of that process creating a checkpoint, ok?
> But in this case, you have to link your source code with dmtcp library so
files.
How could I do that? I will have to use the function dmtcp Checkpoint()
into the application code?
Thanks a lot!
Edson
2015-10-07 11:28 GMT+02:00 Nausca Hsu <[email protected]>:
> Hi,
> You have to find old version of dmtcp (1.x). At that version, no
> coordinator is required.
> I am working on this now.
> To make latest version run as a single process and no coordinator needed.
> And no dmtcp_launch needed neither.
>
> But in this case, you have to link your source code with dmtcp library so
> files.
>
> Thanks.
> Nausca.
>
> From: Edson Tavares de Camargo <[email protected]>
> Date: 2015年10月7日 星期三 16:32
> To: "Sourceforge. Net Dmtcp-Forum@Lists." <
> [email protected]>
> Subject: [Dmtcp-forum] Uncoordinated checkpoint for MPI
>
> Hi Everyone!
>
> This is my first contact with DMTCP. I'm a phd student and I'm working on
> a message logging protocol for MPI. I'm using OpenMPI for implementing my
> proposal. I have read the DMTCP documentation and I have few questions. But
> first of all, I will tell you why I would like to use a checkpoint tool:
>
> - My message logging protocol supposes that processes create checkpoints
> on a uncoordinated approach. Each process creates a checkpoint
> independently of other. There will be no coordination among the processes.
>
> - For now, I am not worried about a process recovery. This will be part
> of a next phase of my work.
>
> Now my questions about DMTCP.
>
> - There is a coordinator. It is responsible for starting the checkpoints
> on the other processes, right? DMTCP follows a coordinated checkpoint
> approach and creates a consistent global state, ok?
>
> - Would be possible to use DMTCP, or DMTCP plugin, in order to implement a
> uncoordinated checkpoint? In this moment just take checkpoint independently
> on each process.
>
> Thank you in advance!
>
> Edson
>
------------------------------------------------------------------------------
Full-scale, agent-less Infrastructure Monitoring from a single dashboard
Integrate with 40+ ManageEngine ITSM Solutions for complete visibility
Physical-Virtual-Cloud Infrastructure monitoring from one console
Real user monitoring with APM Insights and performance trend reports
Learn More http://pubads.g.doubleclick.net/gampad/clk?id=247754911&iu=/4140
_______________________________________________
Dmtcp-forum mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum