We're moving our DMTCP generated checkpoints around before
restarting them.  We use DMTCP_RESTART_DIR to point things at the
new directory.  DMTCP 1.2.7 doesn't seem to like this; it really,
really wants to write the ckpt_BINNAME_*.ckpt file back in the
original directory, going so far as to try to recreate the
containing directory if it's missing.  If it fails to write that
file in the original, DMTCP exits with 99 when it next tries to
checkpoint.

This kills our ability to use DMTCP under HTCondor.  Is this a
known issue?  Is there a fix or workaround?

-- 
Alan De Smet                 Center for High Throughput Computing
[email protected]                       http://chtc.cs.wisc.edu

------------------------------------------------------------------------------
This SF.net email is sponsored by Windows:

Build for Windows Store.

http://p.sf.net/sfu/windows-dev2dev
_______________________________________________
Dmtcp-forum mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum

Reply via email to