[ 
https://issues.apache.org/jira/browse/HADOOP-2410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tom White updated HADOOP-2410:
------------------------------

       Resolution: Fixed
    Fix Version/s: 0.17.0
         Assignee: Chris K Wensel
     Release Note: The command "hadoop-ec2 run" has been replaced by 
"hadoop-ec2 launch-cluster <group> <number of instances>", and "hadoop-ec2 
start-hadoop" has been removed since Hadoop is started on instance start up. 
See http://wiki.apache.org/hadoop/AmazonEC2 for details.
     Hadoop Flags: [Incompatible change, Reviewed]
           Status: Resolved  (was: Patch Available)

I've just committed this. Thanks Chris!

I tried out the new scripts and they worked fine. I changed the version of 
Hadoop in the env file to be 0.17.0 so that it picks up the new AMI when it is 
created (after 0.17.0 is released). I also changed the version of Java to 
1.6.0_05.

Chris, could you update the documentation on the wiki page with the changes 
please? It would be worth keeping the instructions for the older scripts around 
on the same page.

> Make EC2 cluster nodes more independent of each other
> -----------------------------------------------------
>
>                 Key: HADOOP-2410
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2410
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: contrib/ec2
>    Affects Versions: 0.16.1
>            Reporter: Tom White
>            Assignee: Chris K Wensel
>             Fix For: 0.17.0
>
>         Attachments: concurrent-clusters-2.patch, 
> concurrent-clusters-3.patch, concurrent-clusters.patch, ec2.tgz
>
>
> The cluster start up scripts currently wait for each node to start up before 
> appointing a master (to run the namenode and jobtracker on), and copying 
> private keys to all the nodes, and writing the private IP address of the 
> master to the hadoop-site.xml file (which is then copied to the slaves via 
> rsync). Only once this is all done is hadoop started on the cluster (from the 
> master). This can fail if any of the nodes fails to come up, which can happen 
> as EC2 doesn't guarantee that you get a cluster of the size you ask for (I've 
> seen this happen).
> The process would be more robust if each node was told the address of the 
> master as user metadata and then started its own daemons. This is complicated 
> by the fact that the public DNS alias of the master resolves to a public IP 
> address so cannot be used by EC2 nodes (see 
> http://docs.amazonwebservices.com/AWSEC2/2007-08-29/DeveloperGuide/instance-addressing.html).
>  Instead we need to use a trick 
> (http://developer.amazonwebservices.com/connect/message.jspa?messageID=71126#71126)
>  to find the private IP, and what's more we need to attempt to resolve the 
> private IP in a loop until it is available since the DNS will only be set up 
> after the master has started.
> This change will also mean the private key doesn't need to be copied to each 
> node, which can be slow and has dubious security. Configuration can be 
> handled using the mechanism described in HADOOP-2409.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to