i guess the only problem that creates such demand for CDH is the fact that hadoop project twisted everybody's arm by deprecating the entire MR api over what seems to be just perceived OOA design issues but not functional issues. Even that would've been ok if it weren't for the fact that they did not provide a full replacement of the functionality in the new API for well over a year and 0.21 is which should contain most of replacements is recognized not to be production grade.
On Wed, Jun 8, 2011 at 12:27 AM, Sean Owen <sro...@gmail.com> wrote: > Hadoop is Hadoop, so I don't know that any roadmap inconsistency between CDH > and Hadoop is somehow Hadoop's fault. > > I don't think it's this ambiguous. Mahout runs on 0.20.2 Amazon EMR runs > 0.20.2. The latest Hadoop version is 0.20.203.0. CDH is indeed somewhere > inbetween but that's CDH. > > On Wed, Jun 8, 2011 at 2:46 AM, Lance Norskog <goks...@gmail.com> wrote: > >> CDH is the Cloudera distribution of Hadoop. The Hadoop people screwed >> up their "forward motion" in api design and so now the Hadoop versions >> are screwy. The Cloudera distribution is Hadoop 0.20.something plus >> various bug fixes and features. Mahout runs on normal 0.20.something >> and Cloudera. Amazon Elastic Map-Reduce is also 0.20.whatsit. >> >> Lance >> >> >