[
https://issues.apache.org/jira/browse/MAHOUT-520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12920428#action_12920428
]
Joe Prasanna Kumar commented on MAHOUT-520:
-------------------------------------------
Drew,
Thanks for your comments.
I was just going over the Breiman example and saw that the document references
testdata directory. This directory is not hardcoded in the BreimanExample class
and so it wont be much of an impact to change the documentation of Breiman
Example. Neverthless, I guess its dangerous to just -rmr on testdata. we could
probably create a separate directory like syntethic_control_testdata. But with
changing this directory we need to modify all the Job.java under
org.apache.mahout.clustering.syntheticcontrol.* . I believe it is better to
modify those classes and add an InputOption / OutputOption to get the input and
output directories as parameters instead of the current hardcoded directory
names. should I submit a separate patch for this ? what do you suggest ?
regards
Joe.
> Add example scripts / integration tests for various algorithms.
> ---------------------------------------------------------------
>
> Key: MAHOUT-520
> URL: https://issues.apache.org/jira/browse/MAHOUT-520
> Project: Mahout
> Issue Type: Improvement
> Components: Classification
> Affects Versions: 0.4
> Reporter: Drew Farris
> Assignee: Drew Farris
> Priority: Minor
> Attachments: MAHOUT-520-syntheticcontrol.patch, MAHOUT-520.patch,
> MAHOUT-520.patch
>
>
> Scripts like build-reuters.sh are useful in that they both demonstrate
> typical usage of Mahout from the command-line but also serve as integration
> tests. We should add additional scripts that drive the algorithms so new
> users can quickly run the examples.
> Perhaps these can also be run from hudson as a part of the nightly builds and
> can serve as integration tests.
> As a start towards this goal, provide build-20news-bayes.sh example (in the
> same vein as build-reuters.sh, that follows
> https://cwiki.apache.org/MAHOUT/twenty-newsgroups.html
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.