[ https://issues.apache.org/jira/browse/MAHOUT-1527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13994074#comment-13994074 ]
Andrew Palumbo commented on MAHOUT-1527: ---------------------------------------- The issue re: crashing testnb should probably be addressed in the future- either by rejecting unrecognized labels or by changing TrainNaiveBayesJob's - l option (which doesn't seem to work right now- though i didnt spend too much time looking at it.) > Fix wikipedia classifier example > -------------------------------- > > Key: MAHOUT-1527 > URL: https://issues.apache.org/jira/browse/MAHOUT-1527 > Project: Mahout > Issue Type: Task > Components: Classification, Documentation, Examples > Affects Versions: 0.7, 0.8, 0.9 > Reporter: Sebastian Schelter > Fix For: 1.0 > > Attachments: MAHOUT-1527.patch > > > The examples package has a classification showcase for prediciting the labels > of wikipedia pages. Unfortunately, the example is totally broken: > It relies on the old NB implementation which has been removed, suggests to > use the whole wikipedia as input, which will not work well on a single > machine and the documentation uses commands that have long been removed from > bin/mahout. > The example needs to be updated to use the current naive bayes implementation > and documentation on the website needs to be written. -- This message was sent by Atlassian JIRA (v6.2#6252)