[
https://issues.apache.org/jira/browse/MAHOUT-56?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Deneche A. Hakim updated MAHOUT-56:
-----------------------------------
Attachment: watchmaker-tsp.patch
A made a small (relatively) modification to CDGA that allows him to cope with
multi-class classification. You can now give it a target class, and it will
(try to) dicover the classification rule for this class. If you have N classes,
just run it N times with a different target each time.
This modification allowed me to run CDGA over the KDD dataset, but it's
veryyyyyyyyyyyyy slow. It takes more than 8 minutes to do one single iteration
for one target over the 10% dataset (I didn't have the courage to run it over
the whole dataset). At least now, I have a good dataset to test on a cluster :)
the target class (the index of the value for the LABEL in the info file) is
specified just after the dataset name. The following examples run CDGA over the
WDBC dataset with target 1:
{noformat}
$ ~/hadoop-0.17.0/bin/hadoop jar apache-mahout-0.1-dev-ex.jar
org.apache.mahout.ga.watchmaker.cd.CDGA wdbc 1 0.9 1 0.033 0.1 0 100 10
{noformat}
This is the last week of GSoC, so if you have any suggestions about the tests,
the comments and the code I think its time for them :)
> Watchmaker Integration
> ----------------------
>
> Key: MAHOUT-56
> URL: https://issues.apache.org/jira/browse/MAHOUT-56
> Project: Mahout
> Issue Type: Task
> Components: Genetic Algorithms
> Reporter: Deneche A. Hakim
> Assignee: Grant Ingersoll
> Priority: Minor
> Fix For: 0.1
>
> Attachments: libs.zip, libs.zip, libs.zip, tsp-screenshot-1.jpg,
> watchmaker-tsp.patch, watchmaker-tsp.patch, watchmaker-tsp.patch,
> watchmaker-tsp.patch, watchmaker-tsp.patch, watchmaker-tsp.patch,
> watchmaker-tsp.patch, watchmaker-tsp.patch, watchmaker-tsp.patch,
> watchmaker-tsp.patch, watchmaker-tsp.patch, watchmaker-tsp.patch,
> watchmaker-tsp.patch, watchmaker-tsp.patch, watchmaker-tsp.patch
>
>
> The goal of this task is to allow watchmaker definded problems be solved in
> Mahout.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.