On Sat, Nov 15, 2014 at 12:09 AM, Pushpendra Singh <[email protected]>
wrote:

> Dear Kapil,
>
> I went through the paper and here is my understanding of it: It
> essentially block all the processes till the checkpointing is complete
> which ensures that there are no orphan messages in the saved state. Am I
> correct?
>

That is correct.


>
> We are working on developing non-blocking or partial blocking checkpoint
> metho <http://link.springer.com/chapter/10.1007%2F978-3-642-45249-9_9>ds;
> I was looking for a way to implement our algorithms. I believe that for the
> same, I will have to change the code of DMTCP in order to stop blocking.
>

That sounds interesting. Please let us know if you need any pointers, etc.
and I would be glad to help out.

Kapil


>
>
> Best Regards,
> Pushpendra Singh
>
>
> http://www.iiitd.edu.in/~pushpendra
>
> On Thu, Nov 13, 2014 at 10:41 PM, Kapil Arya <[email protected]>
> wrote:
>
>> Hi Pushpendra,
>>
>> DMTCP ensures a consistent state of the distributed computations by the
>> means of a centralized coordinator and network draining.  You can find more
>> information in this paper: http://dmtcp.sourceforge.net/papers/dmtcp.pdf.
>>
>> Please let me know if you need any more information.
>>
>> Kapil
>>
>>
>> On Thu, Nov 13, 2014 at 12:09 AM, Pushpendra Singh <[email protected]>
>> wrote:
>>
>>> Dear DMTCP team,
>>>
>>>
>>>
>>> Greetings! We are trying to use DMTCP for our work in coordinated
>>> checkpointing. I have few queries in this regard with respect to
>>> distributed applications (where multiple processes running on separate
>>> nodes are communicating with each other and failure of any node means that
>>> all nodes need to be restarted from the last checkpoint):
>>>
>>>
>>>
>>> 1.       Does DMTCP checkpoints ensure consistent state, i.e. there are
>>> no orphan messages?
>>>
>>> 2.       If yes to the previous question then what is the underlying
>>> algorithm for coordinated checkpointing in DMTCP? A link to corresponding
>>> research paper will help.
>>>
>>> 3.       If not then how can I integrate my “coordinated chekcpointing”
>>> algorithm within DMTCP i.e. using DMTCP for mainly storing the checkpoint
>>> while my algorithm ensures consistency.
>>>
>>>
>>>
>>>
>>>
>>> Thanks.
>>>
>>>
>>>
>>> Best,
>>>
>>> Pushpendra
>>>
>>> [image: logo]
>>>
>>> http://www.iiitd.edu.in/~pushpendra/
>>>
>>>
>>>
>>>
>>> ------------------------------------------------------------------------------
>>> Comprehensive Server Monitoring with Site24x7.
>>> Monitor 10 servers for $9/Month.
>>> Get alerted through email, SMS, voice calls or mobile push notifications.
>>> Take corrective actions from your mobile device.
>>>
>>> http://pubads.g.doubleclick.net/gampad/clk?id=154624111&iu=/4140/ostg.clktrk
>>> _______________________________________________
>>> Dmtcp-forum mailing list
>>> [email protected]
>>> https://lists.sourceforge.net/lists/listinfo/dmtcp-forum
>>>
>>>
>>
>
------------------------------------------------------------------------------
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://pubads.g.doubleclick.net/gampad/clk?id=154624111&iu=/4140/ostg.clktrk
_______________________________________________
Dmtcp-forum mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum

Reply via email to