Eli Reisman created GIRAPH-574:
----------------------------------

             Summary: Move Giraph Master node functionality to AppMaster or 
launch directly from AppMaster in YARN profile
                 Key: GIRAPH-574
                 URL: https://issues.apache.org/jira/browse/GIRAPH-574
             Project: Giraph
          Issue Type: Improvement
            Reporter: Eli Reisman
            Priority: Minor


As folks read the Giraph on YARN code it is inevitable it will occur to someone 
"Well, if the job fails when the ApplicationMaster fails, could we move some or 
all of our Master task functions there and just call it master?"

Yes. In two ways.

One, we launch a dedicated master process marked as such with setup 
responsibilities, and we assess from the app master how the launch went. We 
keep launching "masters" until one takes. Then, we launch the workers.

Another is to simply run MasterThread and associated stuff from the App Master 
directly, and when we know its up and running properly, only then does app 
master launch the workers.

The YARN app master can be rebooted is designed to be a place for 
fault-tolerant "master node" stuff to happen. However, I think a larger purpose 
is to act as a meta-master for launching a DAG of jobs within the run of a 
single app master lifecycle. Or the app master cant act as any of these things, 
or something else I haven't thought of. The architecture is fairly malleable.

This is not a requirement for us, and maybe not a good idea at all. This is 
just a placeholder JIRA to discuss and collect ideas since as I said above 
someone is going to bring it up ;)

Thank you for reading.


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to