[ 
https://issues.apache.org/jira/browse/HADOOP-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12465698
 ] 

Tom White commented on HADOOP-884:
----------------------------------

I've attached a collection of scripts for this feature. It is still rough round 
the edges, and not ready for inclusion yet (indeed they should probalby be 
separate from the hadoop distribution), but the scripts work for me on Mac OS X 
and ubuntu. I've added instructions to the wiki at 
http://wiki.apache.org/lucene-hadoop/AmazonEC2.

There are lots of improvements that could be made. 

 * Create a Hadoop AMI that runs a parameterized launch to set cluster size and 
master hostname. See 
http://docs.amazonwebservices.com/AmazonEC2/dg/2006-10-01/AESDG-chapter-instancedata.html.
 Such an instance would modify the Hadoop config files on startup to reflect 
cluster size and master hostname.
 * Setting up DNS is a pain. We could either automate the DNS configuration 
using DynDNS's webservice 
(https://www.dyndns.com/developers/specs/syntax.html), or do away with having 
to set up DNS altogether.
 * Create a public Hadoop AMI (for each Hadoop version) so people don't need to 
build their own. See 
http://developer.amazonwebservices.com/connect/entry.jspa?entryID=530&ref=featured.
 * Adapt `run-hadoop-cluster` to take the jar containing the MapReduce job as a 
parameter.


> Create scripts to run Hadoop on Amazon EC2
> ------------------------------------------
>
>                 Key: HADOOP-884
>                 URL: https://issues.apache.org/jira/browse/HADOOP-884
>             Project: Hadoop
>          Issue Type: New Feature
>          Components: scripts
>    Affects Versions: 0.10.1
>            Reporter: Tom White
>         Assigned To: Tom White
>         Attachments: hadoop-ec2-v1.tar.gz
>
>
> It is already possible to run Hadoop on Amazon EC2 
> (http://wiki.apache.org/lucene-hadoop/AmazonEC2), however it is a rather 
> involved, largely manual process. By writing scripts to automate (as far as 
> is possible) image creation and cluster launch it will make it much easier to 
> use Hadoop on EC2.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to