[ https://issues.apache.org/jira/browse/MAHOUT-939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lance Norskog updated MAHOUT-939: --------------------------------- Attachment: 939.patch Update to trunk. This patch lets you reject mail messages if they match a one-line pattern in the headers. Intended for stripping out spam leaker mails like build messages. > ASF Email Classification Examples don't always produce good results > ------------------------------------------------------------------- > > Key: MAHOUT-939 > URL: https://issues.apache.org/jira/browse/MAHOUT-939 > Project: Mahout > Issue Type: Bug > Affects Versions: 0.6 > Reporter: Grant Ingersoll > Assignee: Grant Ingersoll > Labels: MAHOUT_INTRO_CONTRIBUTE > Fix For: 0.7 > > Attachments: 939.patch, MAHOUT-939.patch, MAHOUT-939.patch, > MAHOUT-939.patch, strip_reject.patch > > > The classification examples for the ASF email don't work all that well > currently in terms of quality when it comes to more than a few labels. Also, > need to determine how much memory is required for vectors of cardinality size > 100K. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira