[
https://issues.apache.org/jira/browse/MAHOUT-854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jeff Eastman reopened MAHOUT-854:
---------------------------------
Reopening since this appears to be related to a current Jenkins build failure:
Exception in thread "main" org.apache.hadoop.mapred.FileAlreadyExistsException:
Output directory /tmp/mahout-work-jenkins/reuters-minhash already exists
at
org.apache.hadoop.mapreduce.lib.output.FileOutputFormat.checkOutputSpecs(FileOutputFormat.java:134)
at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:846)
at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:807)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
> Add MinHash to build-reuters.sh example
> ---------------------------------------
>
> Key: MAHOUT-854
> URL: https://issues.apache.org/jira/browse/MAHOUT-854
> Project: Mahout
> Issue Type: Improvement
> Components: Clustering, Examples
> Reporter: Varun Thacker
> Assignee: Grant Ingersoll
> Priority: Minor
> Fix For: 0.6
>
> Attachments: MAHOUT-854.patch
>
>
> We can use the Reuters data set for MinHash clustering. Thus adding the
> MinHash algorithm to the build-reuters.sh would be nice.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira