Re: Persistent HDFS On EC2

2009-03-12 Thread Steve Loughran
Kris Jirapinyo wrote: Why would you lose the locality of storage-per-machine if one EBS volume is mounted to each machine instance? When that machine goes down, you can just restart the instance and re-mount the exact same volume. I've tried this idea before successfully on a 10 node cluster on

RE: Persistent HDFS On EC2

2009-03-11 Thread Malcolm Matalka
Haha, good to know I might be a guinea pig! -Original Message- From: Kris Jirapinyo [mailto:kris.jirapi...@biz360.com] Sent: Wednesday, March 11, 2009 15:59 To: core-user@hadoop.apache.org Subject: Re: Persistent HDFS On EC2 That was also the starting point for my experiment (Tom

Re: Persistent HDFS On EC2

2009-03-11 Thread Kris Jirapinyo
That was also the starting point for my experiment (Tom White's article). Note that the most painful part about this setup is probably writing and testing the scripts that will enable this to happen (and also customizing your EC2 images). It would be interesting to see someone else try it. On We

Re: Persistent HDFS On EC2

2009-03-11 Thread Adam Rose
Tom White wrote a great blog post about some options here: http://www.lexemetech.com/2008/08/elastic-hadoop-clusters-with-amazons.html plus an Amazon article: http://developer.amazonwebservices.com/connect/entry.jspa?externalID=873&categoryID=112 Regards, - Adam Kris Jirapinyo wrote: Why w

Re: Persistent HDFS On EC2

2009-03-11 Thread Kris Jirapinyo
Why would you lose the locality of storage-per-machine if one EBS volume is mounted to each machine instance? When that machine goes down, you can just restart the instance and re-mount the exact same volume. I've tried this idea before successfully on a 10 node cluster on EC2, and didn't see any

RE: Persistent HDFS On EC2

2009-03-11 Thread Malcolm Matalka
@hadoop.apache.org Subject: Re: Persistent HDFS On EC2 Malcolm Matalka wrote: > If this is not the correct place to ask Hadoop + EC2 questions please > let me know. > > > > I am trying to get a handle on how to use Hadoop on EC2 before > committing any money to it. My question is

Re: Persistent HDFS On EC2

2009-03-11 Thread Steve Loughran
Malcolm Matalka wrote: If this is not the correct place to ask Hadoop + EC2 questions please let me know. I am trying to get a handle on how to use Hadoop on EC2 before committing any money to it. My question is, how do I maintain a persistent HDFS between restarts of instances. Most of th

Persistent HDFS On EC2

2009-03-11 Thread Malcolm Matalka
If this is not the correct place to ask Hadoop + EC2 questions please let me know. I am trying to get a handle on how to use Hadoop on EC2 before committing any money to it. My question is, how do I maintain a persistent HDFS between restarts of instances. Most of the tutorials I have found i