[
https://issues.apache.org/jira/browse/MAHOUT-56?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12616573#action_12616573
]
Deneche A. Hakim commented on MAHOUT-56:
----------------------------------------
After meny attemps to load all the informations that you gave me in my
brain-processing-cluster-that-doesnt-work-quit-well, let's see if I understand
it correctly:
The algortihm handles any dataset in a matrix format, where (in my case) the
collumns are the attributes (and one of them is the Label) and the rows are the
datas.
Working with Hadoop, we'll need to pass the dataset in the mapper's input, so
it must be a file (or many files). We'll then need a custom InputFormat to feed
the mappers with the data, and here comes the lovely-named "row-wise splitting
matrix input format".
Now we want to be able to work with any given dataset file format (including
the ARFF and my custom format), and thus the InputFormat needs a decoder that
converts the dataset lines into matrix rows.
> Watchmaker Integration
> ----------------------
>
> Key: MAHOUT-56
> URL: https://issues.apache.org/jira/browse/MAHOUT-56
> Project: Mahout
> Issue Type: Task
> Components: Genetic Algorithms
> Reporter: Deneche A. Hakim
> Assignee: Grant Ingersoll
> Priority: Minor
> Fix For: 0.1
>
> Attachments: libs.zip, libs.zip, libs.zip, tsp-screenshot-1.jpg,
> watchmaker-tsp.patch, watchmaker-tsp.patch, watchmaker-tsp.patch,
> watchmaker-tsp.patch, watchmaker-tsp.patch, watchmaker-tsp.patch,
> watchmaker-tsp.patch, watchmaker-tsp.patch, watchmaker-tsp.patch,
> watchmaker-tsp.patch, watchmaker-tsp.patch, watchmaker-tsp.patch,
> watchmaker-tsp.patch
>
>
> The goal of this task is to allow watchmaker definded problems be solved in
> Mahout.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.