Hi Krish, I completed an MSc project using Hadoop this summer from installation through to programming with the Java Api and then tuning. In all I did about 14 weeks solid with limited unix, server experience and an academic knowledge of Java skills from my Masters course. I got an A ;O)
Along the way I installed Eclipse, got Hadoop to work with it and built a genetic sequence alignment tool. It was hard work but I had a blast. I ran it on a 32 node cluster and got some good speedups. I¹m also interested in developing my skills further and this BigPetStore application seems like a good way to go. Following my course I¹m a trainee db admin for a global investment manager using Sybase. If you want to work on a collaborative project, I am sure I could share my Java skills and knowledge this far if you were happy to share your knowledge too. Why not connect on Linked In ;O) Regards, Chris MacKenzie telephone: 0131 332 6967 email: stu...@chrismackenziephotography.co.uk <http://www.chrismackenziephotography.co.uk/> <http://plus.google.com/+ChrismackenziephotographyCoUk/posts> <http://www.linkedin.com/in/chrismackenziephotography/> From: Krish Donald <gotomyp...@gmail.com> Reply-To: <user@hadoop.apache.org> Date: Friday, 2 January 2015 19:43 To: <user@hadoop.apache.org> Subject: Re: New to this group. I would like to go towards administration side not in development side as I don't know java at all... On Fri, Jan 2, 2015 at 11:37 AM, Jay Vyas <jayunit100.apa...@gmail.com> wrote: > Many demos out there are for the business community... > > For a demonstration of hadoop at a finer grained level, how it's deployed, > packaged, installed and used, for a developer who wants to learn hadoop "the > hard way", > > I'd suggest : > > 1 - Getting Apache bigtop stood up on VMs, and > 2 - running the BigPetStore application , which is meant to demonstrate end to > end building testing and deployment of a hadoop batch analytics system with > mapreduce, pig, and mahout. > > This will also expose you to puppet, gradle, vagrant, all in a big data app > which solves Real world problems like jar dependencies and multiple ecosystem > components. > > Since BPS generates its own data, you don't waste time worrying about > external data sets, Twitter credentials, etc, and can test both on your laptop > and on a 100 node cluster (similar to teragen but for the whole ecosystem). > > Since it features integration tests and tested on Bigtops hadoop distribution, > (which is 100% pure Apache based), it's imo the purest learning source, not > blurred with company specific downloads or branding. > > Disclaimer : Of course I'm biased as I work on it... :) but we've been > working hard to make bigtop easily consumable as a gateway drug to bigdata > processing, and if you have solid linux and Java background, im sure others > would agree it's great place to get immersed in the hadoop ecosystem. > > On Jan 2, 2015, at 1:05 PM, Krish Donald <gotomyp...@gmail.com> wrote: > >> I would like to work on some kind of case studies like I have seen couple on >> Horton works like twitter sentiment analysis, web log analysis etc. >> >> But if somebody can give idea about other case studies which can be worked >> upon and can be put in resume later . >> As I don't have real time project experience. >> >> On Fri, Jan 2, 2015 at 10:33 AM, Ted Yu <yuzhih...@gmail.com> wrote: >>> You can search for Open JIRAs which are related to admin. Here is an example >>> query: >>> >>> https://issues.apache.org/jira/browse/HADOOP-9642?jql=project%20%3D%20HADOOP >>> %20AND%20status%20%3D%20Open%20AND%20text%20~%20%22admin%22 >>> >>> FYI >>> >>> On Fri, Jan 2, 2015 at 10:24 AM, Krish Donald <gotomyp...@gmail.com> wrote: >>>> I have fair understanding of hadoop eco system... >>>> I have setup multinode cluster using VMs in my personal laptop for Hadoop >>>> 2.0 . >>>> But beyond that i would like to work on some project to get a good hold on >>>> the subject. >>>> >>>> I basically would like to go to into Hadoop Administartion side as my >>>> backgroud is RDBMS databases Admnistrator . >>>> >>>> On Fri, Jan 2, 2015 at 10:11 AM, Wilm Schumacher >>>> <wilm.schumac...@gmail.com> wrote: >>>>> Hi, >>>>> >>>>> the "standard" books may be a good start: >>>>> >>>>> I liked the following >>>>> >>>>> definitive guide: >>>>> http://www.amazon.de/Hadoop-Definitive-Guide-Tom-White/dp/1449311520 >>>>> >>>>> hadoop in action: >>>>> http://www.manning.com/lam2/ >>>>> >>>>> hadoop in practive: >>>>> http://www.manning.com/holmes2/ >>>>> >>>>> A list is here: >>>>> http://wiki.apache.org/hadoop/Books >>>>> >>>>> Hope this helps. >>>>> >>>>> Best wishes, >>>>> >>>>> Wilm >>>>> >>>>> Am 02.01.2015 um 19:02 schrieb Krish Donald: >>>>>> > Hi, >>>>>> > >>>>>> > I am new to this group and hadoop. >>>>>> > Please help me to learn hadoop and suggest some self study project . >>>>>> > >>>>>> > Thanks >>>>>> > Krish Donald >>>>> >>>> >>> >>