[
https://issues.apache.org/jira/browse/WHIRR-506?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13239181#comment-13239181
]
Taha Rafiq commented on WHIRR-506:
----------------------------------
Hello,
I am a Masters student at the University of Waterloo in Canada and I am
currently doing research related to large scale data processing and cloud
computing. I've used Whirr for a couple of projects previously and I'm really
interested in working on this idea of bringing EMR to Whirr. What are some of
the steps I can take to convince you guys to accept me as a student for this
project? Also, are there are any potential mentors for this idea who I can
further discuss this idea with?
Thanks.
> Support elastic mapreduce in whirr
> ----------------------------------
>
> Key: WHIRR-506
> URL: https://issues.apache.org/jira/browse/WHIRR-506
> Project: Whirr
> Issue Type: New Feature
> Components: cli, core
> Reporter: David Alves
> Labels: gsoc, gsoc2012, mentor
>
> This is to be an umbrella issue in the path towards having multicloud EMR
> with whirr.
> Some of the things that must happen towards multicloud EMR (as discussed in
> IRC):
> - Hadoop deployment must be "rock solid"
> - Submitting and monitoring an hadoop mapreduce job through whirr
> - distcp from blobstore to hadoop/hbase cluster
> - cli component for job submission and monitoring.
> Some of the things that would be nice to have additionally:
> - pig service
> - hive service
> - sqoop service
> - regular+spot instances in EMR
> - multistage provisioning (different cluster sizes for different phases)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira