[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13096628#comment-13096628
 ] 

Vinod Kumar Vavilapalli commented on MAPREDUCE-2749:
----------------------------------------------------

Thanks for the patch Thomas!

One minor comment I have with this patch is the setting of keys via the event. 
While this works, my concern is that if we do it this way, by the time 
{{NodeManager.start()}} returns, the server isn't still guaranteed to be ready 
for service. I think we can do this another way. If we move 
{{ContainerTokenSecretManager}} to the top level {{NodeManager}} composite, we 
can set the keys as the last step in {{NodeManager.start()}}. Thoughts?

Also, we can add a test for this - mock {{rpc.getServer()}} to throw an 
exception and verify that ResourceManager doesn't have any registered NM.

> [MR-279] NM registers with RM even before it starts various servers
> -------------------------------------------------------------------
>
>                 Key: MAPREDUCE-2749
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2749
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2
>    Affects Versions: 0.23.0
>            Reporter: Vinod Kumar Vavilapalli
>            Assignee: Thomas Graves
>             Fix For: 0.23.0
>
>         Attachments: MAPREDUCE-2749-v2.patch, MAPREDUCE-2749.patch
>
>
> In case NM eventually fails to start the ContainerManager server because of 
> say a port clash, RM will have to wait for expiry to detect the NM crash.
> It is desirable to make NM register with RM only after it can start all of 
> its components successfully.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to