Re: a non-commerial distribution of hadoop ecosystem?

2015-06-01 Thread Demai Ni
Andrew, great to hear that you are also using BigTop. I will surely try it out, to replace my (a little bit) old CDH cluster. :-) cheers Demai On Mon, Jun 1, 2015 at 5:29 PM, Andrew Purtell wrote: > Bigtop, in a nutshell, is a non-commercial multi-stakeholder Apache > project that produces a

Re: a non-commerial distribution of hadoop ecosystem?

2015-06-01 Thread Andrew Purtell
Bigtop, in a nutshell, is a non-commercial multi-stakeholder Apache project that produces a build framework that takes as input source from Hadoop and related big data projects and produces as output OS native packages for installation and management - certainly, a distribution of the Hadoop eco

Re: a non-commerial distribution of hadoop ecosystem?

2015-06-01 Thread jay vyas
I'll reiterate part of what roman said :) Apache BigTop IS the upstream opensource hadoop distribution :) Its the only open distro out there that is actually contributed to and built by people who also are commiter/contributors to the sister projects (hbase, hadoop, spark, and so on). Engineer

Re: a non-commerial distribution of hadoop ecosystem?

2015-06-01 Thread Demai Ni
Chris and Roman, many thanks for the quick response. I will take a look at bigtop. Actually, I heard about it, but thought it is a installation framework, instead of a hadoop distribution. Now I am looking at the BigTop 0.7.0 hadoop instruction, which probably will work fine for my needs. Appreci

Re: a non-commerial distribution of hadoop ecosystem?

2015-06-01 Thread Roman Shaposhnik
On Mon, Jun 1, 2015 at 1:37 PM, Demai Ni wrote: > My question is besides the commercial distributions: CDH(Cloudera) , HDP > (Horton work), and others like Mapr, IBM... Is there a distribution that is > NOT owned by a company? I am looking for something simple for cluster > configuration/install

Re: a non-commerial distribution of hadoop ecosystem?

2015-06-01 Thread Chris Nauroth
Hello Demai, Apache Bigtop is a project that tests and publishes rpm and deb packages for Hadoop ecosystem components. They'll have more details on their own site. http://bigtop.apache.org/ Would this suit your needs? --Chris Nauroth From: Demai Ni mailto:nid...@gmail.com>> Reply-To: "user@h

a non-commerial distribution of hadoop ecosystem?

2015-06-01 Thread Demai Ni
hi, Guys, I have been doing some research/POC using hadoop system. Normally, I either use homebrew on mac for single node installation, or use CDH(Cloudera) for a 3~4 nodes small linux cluster. My question is besides the commercial distributions: CDH(Cloudera) , HDP (Horton work), and others lik

reduce finished container statuses not present at AM

2015-06-01 Thread Grandl Robert
Hi guys, I was running a simple Terasort job with 4 mappers and 2 reducers using Hadoop 3.0.0-Snapshot(trunk). I was analyzing the finished containers reported to AM in RMContainerAllocator.java -> getResources(), but I realized that none of the finished containers for reducers are sent back to

Streaming K-medoids

2015-06-01 Thread Marko Dinic
Hello everyone, I have an idea and I would like to get a validation from community about it. In Mahout there is an implementation of Streaming K-means. I'm interested in your opinion would it make sense to make a similar implementation of Streaming K-medoids? K-medoids has even bigger prob

Hive Metastore Service Startup Fails

2015-06-01 Thread Pratik Gadiya
Hello All, When I try to deploy hortonworks cluster using ambari blueprint APIs, it results in failure while starting up of Hive Metastore service. The same blueprint most of the times works appropriately on the same environment. The parameter which gets changed in the entire blueprint w.r.t h