Hey Rita, All software developed by Cloudera for CDH is Apache (v2) licensed and freely available. See these docs [1,2] for more info.
We publish source packages (which includes the packaging source) and source tarballs, you can find these at http://archive.cloudera.com/cdh/3/. See the CHANGES.txt file (or the cloudera directory in the tarballs) for the specific patches that have been applied. CDH contains a number of projects (Hadoop, Pig, Hive, HBase, Oozie, Flume, Sqoop, Whirr, Hue, ZooKeeper, etc). Most have a small handful of patches applied (often there's only a couple additional patches as we've rolled an upstream dot release that folded in the delta from the previous release). The vast majority of the patches to Hadoop come from the Apache security and append [3, 4] branches. Aside from those the rest are critical backports and bug fixes. In general, we develop upstream first. Hope this clarifies things. Thanks, Eli 1. https://wiki.cloudera.com/display/DOC/Apache+License 2. https://wiki.cloudera.com/display/DOC/CDH3+Installation+Guide 3. http://svn.apache.org/viewvc/hadoop/common/branches/branch-0.20-security 4. http://svn.apache.org/viewvc/hadoop/common/branches/branch-0.20-append On Wed, Mar 23, 2011 at 7:29 AM, Rita <rmorgan...@gmail.com> wrote: > I have been wondering if I should use CDH (http://www.cloudera.com/hadoop/) > instead of the standard Hadoop distribution. > > What do most people use? Is CDH free? do they provide the tars or does it > provide source code and I simply compile? Can I have some data nodes as CDH > and the rest as regular Hadoop? > > > I am asking this because so far I noticed a serious bug (IMO) in the > decommissioning process ( > http://mail-archives.apache.org/mod_mbox/hadoop-common-user/201103.mbox/%3cAANLkTikPKGt5zw1QGLse+LPzUDP7Mom=ty_mxfcuo...@mail.gmail.com%3e > ) > > > > > -- > --- Get your facts first, then you can distort them as you please.-- >