Hi,

I currently try to add support for a new service in Whirr. My goal is to
automatically deploy stratosphere ( http://stratosphere.eu ) to AWS. As it
is using HDFS as its data source and has a similar architecural concept like
hadoop Whirr seemed like a good match. My experience with using Whirr is
minimal as I just discovered it today (never actively used a cloud provider
before).

As a starting point I used the HBase / Hadoop service because they both have
the concept of specialiced nodes. I quickly made some progress and got it to
a point where it would create the correct configuration files on the nodes.
But when browsing through JIRA I discovered WHIRR-266 which goal is to
further refine the deployment (which seems like a very good idea to me - the
concept that the install script would also start e.g. hdfs was confusing in
the beginning). So my questions now are:

   1. Will the new concept as already commited for voldemort and cassandra
   substancially change or could I use those services as a reference?
   2. Is there an interest in integrating the new service for stratosphere
   into whirr and if yes what would be the process (e.g. add a jira ticket?)

If you want more information on stratosphere I am happy to answer any
questions.

Also thanks for developing such a usefull project!

--moritz

PS: All my changes will be pushed to my fork of the whirr project in my
public github repository (https://github.com/mkaufmann/whirr) - There is
nothing currently, but I will try to push a initial working state soon.

Reply via email to