[ 
https://issues.apache.org/jira/browse/MAHOUT-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13865813#comment-13865813
 ] 

Suneel Marthi commented on MAHOUT-1391:
---------------------------------------

Mansur, Could u also include a test case that reproduces the original issue as 
part of the patch?

> Possibility to disable confusion matrix in naive bayes
> ------------------------------------------------------
>
>                 Key: MAHOUT-1391
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1391
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.8
>            Reporter: Mansur Iqbal
>             Fix For: 0.9
>
>         Attachments: MAHOUT-1391.patch
>
>
> Sometimes confusion matrix is to big and not really necessary.
> And there is another case for the possibility:
> If you split a dataset with many labels with random selection percent to 
> testdataset and trainingdataset, it could happen, that there are 
> classes/labels in testdata, which do not appear in the trainingdataset. By 
> creating a model with the trainingdata the created labelindex does not 
> include some labels from testdata. Therefore if you test on this model with 
> the testdata, mahout tries to create a confusion matrix with the labels from 
> testdata which are not included in the labelindex and throws an exception.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to