I’m no expert here since I’m stuck on hadoop 1.2.1 but the latest master on github is meant to be used with hadoop 2.x as the default. Mahout 0.9 had some experimental support but is AFAIK not recommended for h 2. No clue about hadoop 2.5 specifically but if it truly is a minor backwards compatible release you may be ok. Mahout now uses Spark, which claims support through hadoop 2.4.x.
On Aug 20, 2014, at 3:03 PM, Wei Zhang <[email protected]> wrote: Hello, After a system upgrade, we have a Hadoop 2.5.0 cloud instance. We are trying to run Mahout on top of it. We are using Mahout 0.9 (downloaded from http://mahout.apache.org/general/downloads.html ) https://issues.apache.org/jira/browse/MAHOUT-1329 indicates Mahout supports Hadoop 2.2.0 But even with mvn clean install -Dhadoop2.version=2.2.0 -DskipTests=true, it doesn't appear Hadoop 2.x jar is downloaded anywhere (i.e., there is only Hadoop 1.2.1 jar downloaded, which is what the pom.xml dictates). Further, even a simple HDFS I/O call would fail (i.e., creating an HDFS file). It seems Hadoop 2.2.0 is not quite compatible with Mahout 0.9 My question is: (1) Does Mahout support Hadoop 2.5.0 or should I check out the code from git (as suggested at http://mahout.apache.org/developers/buildingmahout.html) in order to get a Hadoop 2.x friendly Mahout ? Thanks! Wei
