[ 
https://issues.apache.org/jira/browse/MAHOUT-286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Martin Häger updated MAHOUT-286:
--------------------------------

    Attachment: run.sh
                data.training.arff
                data.arff

Attaching:
 * data.arff - test data in ARFF format
 * data.training.arff - training data in ARFF format
 * run.sh - a script that shows how Mahout was run

> Need to be able to run classifiers from non-text input (such as ARFF data)
> --------------------------------------------------------------------------
>
>                 Key: MAHOUT-286
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-286
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Ted Dunning
>         Attachments: data.arff, data.training.arff, mahout.log, run.sh, 
> weka.log
>
>
> Martin Haeger wrote this:
> {quote}
> We're experimenting a bit with Weka and Mahout. Our input data is a
> relation in ARFF format (see attached data.training.arff), and we'd
> like to classify it using Mahout. However, it seems (to us, at first)
> that the Mahout classifier.bayes.interfaces.Algorithm interface is
> centered around documents of text, and not general attribute data.
> Thus, running the classifier causes our ARFF data to be interpreted as
> a document of words, with not very useful results (see attached
> mahout.log).
> With Weka, we're able to get the results we want (see attached weka.log).
> Any suggestions for how to get this working?
> {quote}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to