Implement Checkpointing service in Hama
---------------------------------------
Key: HAMA-557
URL: https://issues.apache.org/jira/browse/HAMA-557
Project: Hama
Issue Type: New Feature
Components: bsp
Affects Versions: 0.6.0
Reporter: Suraj Menon
Assignee: Suraj Menon
Fix For: 0.6.0
Implement checkpointing service in Apache Hama. My patches for HAMA-533 and
HAMA-534 are blocked on this.
- Checkpointing should be done as messages are either sent or received. I
prefer while receiving messages, as we can achieve some parallelism with
asynchronous messages. Please comment if you differ.
- BSPMaster should hold the checkpoint status for each task. Checkpoint status
includes superstep count and file information for which checkpointing is
complete
- MessageManager should notify Checkpointer of a new message at BSPPeer.
- Implement/Reuse MessageBundle class as splitClass in BSPPeerImpl for recovery
in initInput.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira