[ https://issues.apache.org/jira/browse/MAHOUT-147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12732572#action_12732572 ]
Grant Ingersoll commented on MAHOUT-147: ---------------------------------------- There is a small bug in BayesTFIDFMapper that doesn't properly handle labels with multiple commas: e.g: -17th_century_mathematicians|anderson,_alexander,1582 > Wikipedia Example improvements > ------------------------------ > > Key: MAHOUT-147 > URL: https://issues.apache.org/jira/browse/MAHOUT-147 > Project: Mahout > Issue Type: Improvement > Components: Classification > Reporter: Grant Ingersoll > Assignee: Grant Ingersoll > Priority: Minor > Fix For: 0.2 > > > The Wikipedia example for classification can be improved by: > 1. streamlining category matching: Currently, we identify all the categories > in the doc and then check to see if there are matches by looping over all the > found categories and all the input categories, with the first match winning. > Am examining a bit closer, so may add more here. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.