On Mon, Jan 18, 2010 at 10:20 AM, Grant Ingersoll <[email protected]> wrote:
>> >> I wonder if the CDH2 ami's could be used as a starting point? Not sure >> if you're allowed to unbundle and modify public AMI's. It would >> certainly be more difficult to start from scratch. > > I'd prefer to be dependent on the official Apache distro that we use. > Do you mean the distro of Hadoop, or something else? From what I understand the convenience that CDH2 provides is largely based on the launch/management scripts, I agree that it would make sense to replace the actual hadoop distro with something that we use. It is pretty simple to create AMI's from scratch, but I was wondering about getting things set up to auto-launch the various parts of hadoop at boot time and get the configuration right so that they are bound into a single cluster etc. If those sorts of things are trivial or otherwise covered, no need to start from CDH2. Drew
