I'll reiterate part of what roman said :) Apache BigTop IS the upstream opensource hadoop distribution :) Its the only open distro out there that is actually contributed to and built by people
who also are commiter/contributors to the sister projects (hbase, hadoop, spark, and so on). Engineers at Cloudera, Rackspace, Pivotal, WanDisco, Red Hat, Hortonworks, and many other companies regularly contribute to and advise on the development of it, although the big vendors have a love/hate relationship with it - it really is the most integrated and robust fully open hadoop distribution out there. And its super easy to test out and play with. Just join the mailing list and ask us where to get started, you can have a working custom hadoop cluster with spark or hadoop or hbase running on multiple VMs in a matter of minutes. We also curate reference implementations of full stack applications with idiomatic unit testing / build / data lifecycling that you can use to build "real world" bigdata applications as well. See you on #bigtop in irc or on the mailing list ! We're happy to help you get started. On Mon, Jun 1, 2015 at 4:46 PM, Roman Shaposhnik <r...@apache.org> wrote: > On Mon, Jun 1, 2015 at 1:37 PM, Demai Ni <nid...@gmail.com> wrote: > > My question is besides the commercial distributions: CDH(Cloudera) , HDP > > (Horton work), and others like Mapr, IBM... Is there a distribution that > is > > NOT owned by a company? I am looking for something simple for cluster > > configuration/installation for multiple components: hdfs, yarn, > zookeeper, > > hive, hbase, maybe Spark. Surely, for a well-experience person(not me), > > he/she can build the distribution from Apache releases. Well, I am more > > interested on building application on top of it, and hopefully to find > one > > packed them together. > > Apache Bigtop (CCed) aims at delivering a 100% open and > community-driven distribution of big data management technologies > around Apache Hadoop. Same as, for example, what Debian is trying > to do for Linux. > > > BTW, I don't need the latest releases like other commercial distribution > > offered. I am also looking into the ODP(the open data platform), but > that > > project is kind of quiet after the initial Feb announcement. > > Feel free to ping me off list if you want more details on ODP. > > Thanks, > Roman. > -- jay vyas