Hi, I have been using Hadoop since Christmas loosely and from May for an Software engineering MSc at Heriot Watt University in Edinburgh, Scotland. I have written a genetic sequence alignment algorithm.
I have installed Hadoop in various places including a 32 node cluster and am using eclipse kepler sr 2 as an IDE. My current Hadoop version is 2.4.1 which I download as a tar from the apache mirror servers. It¹s been a tough learning curve, but that has made the learning all the more valuable. I believe using the straight Hadoop version has given insights that proprietary builds wouldn¹t have. There are so many confusing issues that crop up, it¹s easy to attach importance to trying to fix the an error which masks another. With the proprietary versions it would be easy to attach blame where it¹s not that build or this builds fault. Go with your heart but be prepared to work to solve the problems you encounter. Buy Tom Whites book, it isn¹t perfect and a couple of years out of date but it gives you enough detail and structure to build an impression you can work from. The downloadable source code is a great help when trying to get started. Good luck. Regards, Chris MacKenzie telephone: 0131 332 6967 email: stu...@chrismackenziephotography.co.uk corporate: www.chrismackenziephotography.co.uk <http://www.chrismackenziephotography.co.uk/> <http://plus.google.com/+ChrismackenziephotographyCoUk/posts> <http://www.linkedin.com/in/chrismackenziephotography/> From: "Adaryl \"Bob\" Wakefield, MBA" <adaryl.wakefi...@hotmail.com> Reply-To: <user@hadoop.apache.org> Date: Thursday, 14 August 2014 01:13 To: <user@hadoop.apache.org> Subject: Re: Started learning Hadoop. Which distribution is best for native install in pseudo distributed mode? He didn¹t ask for the best and nobody framed up their answer like that. He asked what people were using. Out of the 10 responses only four of them actually answered his question. I¹ve been studying Hadoop for two months straight. Quite frankly, I wish more people would ask for community input and what does what and how. Adaryl "Bob" Wakefield, MBA Principal Mass Street Analytics 913.938.6685 www.linkedin.com/in/bobwakefieldmba Twitter: @BobLovesData From: Kilaru, Sambaiah <mailto:sambaiah_kil...@intuit.com> Sent: Wednesday, August 13, 2014 1:10 PM To: user@hadoop.apache.org Subject: Re: Started learning Hadoop. Which distribution is best for native install in pseudo distributed mode? Engough wars on going on which is best. You choose one of it and try to learn and there is nothing that x is better or y is better. It is upto your choice. Thanks, Sam From: Sebastiano Di Paola <sebastiano.dipa...@gmail.com> Reply-To: "user@hadoop.apache.org" <user@hadoop.apache.org> Date: Wednesday, August 13, 2014 at 6:28 PM To: "user@hadoop.apache.org" <user@hadoop.apache.org> Subject: Re: Started learning Hadoop. Which distribution is best for native install in pseudo distributed mode? Hi, I'm a newbie too and I'm not using any particular distribution. Just download the component I need / want to try for my deploiment and use them. It's a slow process but allows me to better understand what I'm doing under the hood. Regards, Seba On Tue, Aug 12, 2014 at 10:12 PM, mani kandan <mankand...@gmail.com> wrote: Which distribution are you people using? Cloudera vs Hortonworks vs Biginsights?