Hi,

I have been working with DMTCP quite a lot recently and have encountered
several issues which were resolved by your excellent support.
Having encountered another problem I hope to benefit from this support once
more.

Currently I am working on a system that should robustly checkpoint a
process that runs on a virtual machine.
This is done by checkpointing the application and through the usage of the
plugin system with which I checkpoint the VM itself. (This is via a
"system" call, which I appreciate you have implemented!)
This is currently working but not completely.
My process:

Start a simple python counter and let it checkpoint every 10 minutes.
After a checkpoint kill the VM and reboot it from the VM checkpoint.
This works perfectly.
But when the restarted process takes a new checkpoint and I want to restart
it, it fails.
The error code 99 is returned which indicates a problem within DMTCP.
After having recompiled to enable debug information I have traced the
problem to this area:

[40000] TRACE at pid.cpp:92 in openOriginalToCurrentMappingFiles;
REASON='Open dmtcpPidMapFile'
     pidMapFile.str() = /tmp/dmtcp-root@ip-10-194-33-122
/dmtcpPidMap.13a71f78f7523f34-40000-514f32c7.514f57098
[40000] ERROR at pid.cpp:80 in openSharedFile; REASON='JASSERT(false)
failed'
     name = /tmp/dmtcp-root@ip-10-194-33-122
/dmtcpPidMap.13a71f78f7523f34-40000-514f32c7.514f57098
     strerror((*__errno_location ())) = No such file or directory
Message: Cannot open file
python2.7 (40000): Terminating...

It is true that when I look at the designated folder I can't find the file.
But my first restart works and the pid.cpp:92 result is:

[40000] TRACE at pid.cpp:92 in openOriginalToCurrentMappingFiles;
REASON='Open dmtcpPidMapFile'
     pidMapFile.str() = /tmp/dmtcp-root@ip-10-194-33-122
/dmtcpPidMap.13a71f78f7523f34-40000-514f32c7.514f56bb5
[40000] TRACE at virtualidtable.h:241 in writeMapsToFile; REASON='Write
Maps to file'
     mapFile = /tmp/dmtcp-root@ip-10-194-33-122
/dmtcpPidMap.13a71f78f7523f34-40000-514f32c7.514f56bb5

Now as well I can't find the file on my filesystem.

I hope you can enlighten me.

Thanks in advance,

Robin Staes

PS: Sorry for the mail bombardment to Kapil, I did not have the nerve to
send another one :)
------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_mar
_______________________________________________
Dmtcp-forum mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum

Reply via email to