[
https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13135889#comment-13135889
]
Joe Prasanna Kumar commented on MAHOUT-838:
-------------------------------------------
Lance,
In the attached html file, the confusion matrix has a summary with the labels
and the Correct instance count with the % When I run testClassifier, I couldnt
see this summary in the output. I exec the testclassifier as below ./bin/mahout
testclassifier -m /tmp/mahout-work-joekumar/20news-bydate/bayes-model -d
/tmp/mahout-work-joekumar/20news-bydate/bayes-test-input -type bayes -ng 1
-source hdfs -method mapreduce -cm 20newconfuxion
The summary is printed in the console when we run the testclassifier in a
sequential mode. The toString() of ResultAnalyzer implements the summary info
that is being printed in the console. Should we make the summary info as part
of the ConfusionMatrix ? Should this be a separate JIRA issue ? In the end,
it'd be good to be able to print / write the summary along with the matrix. plz
let me know what the best approach would be.
Joe.
> Make the confusion matrix writable to a file when testing classifiers
> ---------------------------------------------------------------------
>
> Key: MAHOUT-838
> URL: https://issues.apache.org/jira/browse/MAHOUT-838
> Project: Mahout
> Issue Type: Improvement
> Affects Versions: 0.6
> Reporter: Grant Ingersoll
> Priority: Minor
> Attachments: ConfusionMatrix.java, MAHOUT-838.patch,
> MAHOUT-838.patch, MAHOUT-838_mini.patch, MatrixWritable.java, bayes-cm-10.html
>
>
> If you have a lot of labels for a classifier, the confusion matrix is hard to
> fit in terminal window. Would be nice if we could write it out to a file.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira