Finally had a chance to take a look. Although I could probably spend some more time running the different algorithms, I think it looks good.
+1 for release. Here's what I've tried: Source release: - Untarred mahout-0.3-src.tar.bz2 - Built using 'mvn clean install' from an empty repo and default settings, - Executed bin/mahout vectordump -s file and was able to successfully dump a vector file generated from an older version of mahout. - Executed bin/mahout ./bin/mahout org.apache.mahout.utils.nlp.collocations.llr.CollocDriver (locally) - Set HADOOP_HOME, HADOOP_CONF_DIR and was able to use bin/mahout to run kmeans on the cluster Binary release:: - Moved existing mahout 0.3 directory from source release out of the way. - Untarred mahout-0.3.tar.bz2 (binary release), - Ran a vector dump as a above - Used bin/mahout to run kmeans clustering (locally) - Set HADOOP_HOME, HADOOP_CONF_DIR and was able to use bin/mahout to run kmeans on the cluster. Environment: - Ubuntu 9.04 i386 - JDK 1.6.0_16 - Maven 2.2.1 - Ant 1.7.1 - hadoop 0.20.1 On Thu, Mar 11, 2010 at 9:01 PM, Benson Margulies <bimargul...@gmail.com> wrote: > Proposed Mahout release 0.3 artifacts can by found at: > > https://repository.apache.org/content/repositories/orgapachemahout-005/ > > A report of issues addressed in this release can be found at > > https://issues.apache.org/jira/secure/IssueNavigator.jspa?reset=true&&pid=12310751&status=5&status=6&fixfor=12314281&sorter/field=issuekey&sorter/order=DESC > . > > Please vote to release or not. >