[OMPI devel] RFC: CRS Module for MTCP Checkpointing Package (Revised)

2011-10-12 Thread Alex Brick
WHAT: Bring in the mtcp CRS component WHY: Add support for the MTCP checkpoint/restart service WHERE: opal/mca/crs/mtcp TIMEOUT: Tuesday teleconf, 2011-10-18 (about 1 week from now) --- What is MTCP? MTCP (MultiThreaded CheckPointing; http://dmtcp.sourc

Re: [OMPI devel] RFC: CRS Module for MTCP Checkpointing Package

2011-10-07 Thread Alex Brick
d a new single-process >>> checkpoint-restart mechanism (MTCP), to the ones already provided in Open >>> MPI. However, most of the text in your RFC is about DMTCP, which is another >>> layer on top of MTCP capable of checkpoint/restarting distributed >>> appl

Re: [OMPI devel] RFC: CRS Module for MTCP Checkpointing Package

2011-10-07 Thread Alex Brick
;> application. >> >> I would like to understand what this RFC is really about: MTCP or DMTCP? >> >>  george. >> >> On Oct 6, 2011, at 02:58 , Alex Brick wrote: >> >>> WHAT: Bring in the mtcp CRS component >>> >>

[OMPI devel] RFC: CRS Module for MTCP Checkpointing Package

2011-10-06 Thread Alex Brick
WHAT: Bring in the mtcp CRS component WHY: Add support for the MTCP checkpoint/restart service WHERE: opal/mca/crs/mtcp TIMEOUT: Tuesday teleconf, 2011-10-18 (about 2 weeks from now) --- What is MTCP? DMTCP (Distributed MultiThreaded CheckPointing, htt