Implement Checkpointing service in Hama
---------------------------------------

                 Key: HAMA-557
                 URL: https://issues.apache.org/jira/browse/HAMA-557
             Project: Hama
          Issue Type: New Feature
          Components: bsp
    Affects Versions: 0.6.0
            Reporter: Suraj Menon
            Assignee: Suraj Menon
             Fix For: 0.6.0


Implement checkpointing service in Apache Hama. My patches for HAMA-533 and 
HAMA-534 are blocked on this.
- Checkpointing should be done as messages are either sent or received. I 
prefer while receiving messages, as we can achieve some parallelism with 
asynchronous messages. Please comment if you differ.
- BSPMaster should hold the checkpoint status for each task. Checkpoint status 
includes superstep count and file information for which checkpointing is 
complete
- MessageManager should notify Checkpointer of a new message at BSPPeer.
- Implement/Reuse MessageBundle class as splitClass in BSPPeerImpl for recovery 
in initInput.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to