Ben -- yes CDH distributions backport 0.21 patches. Basically the maven version of CDH3b3 says 0.20.2+737 where 737 is the number of mostly 0.21 patches that they backported to 0.20.2 platform. They still have pretty useful patches missing but their approach is to do thorough testing of those changes so perhaps some of the features are omited deliberately. Most ppl that i know (including us) use that stuff in production because of the combination of fixes that are hard to ignore (such as append api so desperately needed for latest 0.89 HBase releases) with scrutiny for stability that goes with it. AFAIK they also use these with their clients.
-d On Mon, Dec 20, 2010 at 3:58 PM, Ben Clay <[email protected]> wrote: > Dmitry- > > Thanks for looking into this. > > HADOOP-2141 is indeed what we need. I am a bit confused - the JIRA page > (https://issues.apache.org/jira/browse/HADOOP-2141) shows it was included > in > Hadoop 0.21, but CDH3b3 appears to be based on 0.20 (shown as > hadoop-0.20.noarch in their yum repo). Does Cloudera typically backport > features like this? > > Regardless, I'll install CDH3b3 and use it. Thanks for the tip! > > -Ben > > > -----Original Message----- > From: Dmitriy Lyubimov [mailto:[email protected]] > Sent: Monday, December 20, 2010 6:25 PM > To: [email protected] > Subject: Re: Hadoop 0.21 support > > Also , just in case if you are talking about HADOOP-2141, it _is_ included > in CDH3b3, and i did verify before that CDH3b3 compiles with Mahout. > > Thanks. > -Dmitriy > > On Mon, Dec 20, 2010 at 3:03 PM, Dmitriy Lyubimov <[email protected]> > wrote: > > > Ben, > > > > Just curious, are you planning to use 0.21 or new api from Cloudera > > distribution? Cloudera seems to compile fine although i am not sure if > > they backported the feature you are talking about. > > > > -d > > > > > > On Mon, Dec 20, 2010 at 2:25 PM, Ben Clay <[email protected]> wrote: > > > >> Hi- > >> > >> > >> > >> Is there a timeline for updating Mahout to work with Hadoop 0.21? > >> > >> > >> > >> Hadoop 0.21 introduced some features we need (LATE scheduler) but > >> when running the Mahout Quickstart jobs on both Mahout 0.4 and the > >> latest 0.5 trunk, I get a number of errors relating to the new Hadoop > >> API. I had some trouble finding details using Google, but I see the > >> 0.5 trunk's pom.xml indicates <hadoop.version>0.20.2</hadoop.version> > >> > >> > >> > >> Thanks for any info! > >> > >> > >> > >> -Ben > >> > >> > >> > >> > > > >
