We're moving our DMTCP generated checkpoints around before restarting them. We use DMTCP_RESTART_DIR to point things at the new directory. DMTCP 1.2.7 doesn't seem to like this; it really, really wants to write the ckpt_BINNAME_*.ckpt file back in the original directory, going so far as to try to recreate the containing directory if it's missing. If it fails to write that file in the original, DMTCP exits with 99 when it next tries to checkpoint.
This kills our ability to use DMTCP under HTCondor. Is this a known issue? Is there a fix or workaround? -- Alan De Smet Center for High Throughput Computing [email protected] http://chtc.cs.wisc.edu ------------------------------------------------------------------------------ This SF.net email is sponsored by Windows: Build for Windows Store. http://p.sf.net/sfu/windows-dev2dev _______________________________________________ Dmtcp-forum mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dmtcp-forum
