[ 
https://issues.apache.org/jira/browse/MAHOUT-147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12732572#action_12732572
 ] 

Grant Ingersoll commented on MAHOUT-147:
----------------------------------------

There is a small bug in BayesTFIDFMapper that doesn't properly handle labels 
with multiple commas:
e.g: -17th_century_mathematicians|anderson,_alexander,1582

> Wikipedia Example improvements
> ------------------------------
>
>                 Key: MAHOUT-147
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-147
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Classification
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.2
>
>
> The Wikipedia example for classification can be improved by:
> 1. streamlining category matching:  Currently, we identify all the categories 
> in the doc and then check to see if there are matches by looping over all the 
> found categories and all the input categories, with the first match winning.
> Am examining a bit closer, so may add more here.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to