Hello Henry, Wednesday, July 27, 2005, 6:39:22 PM, you wrote:
> [EMAIL PROTECTED] changed: > What |Removed |Added > ---------------------------------------------------------------------------- > Severity|normal |critical > Priority|P5 |P1 > since quite a few of the mass-checkers don't have accounts on that > box, I've also copied the set3 files to these URLs: > http://taint.org/xfer/2005/set3.fn.gz > http://taint.org/xfer/2005/set3.fp.gz > Please download and verify that any mails in the FP set that are > coming from your corpus, are indeed valid ham; and ditto for the FN > set being spam. FN: I spot-checked all FNs with positive scores, and checked every FN with negative scores. Corpus is clean, except: ham: mid=<[EMAIL PROTECTED]> discount: Message-ID: <[EMAIL PROTECTED]> Message-ID: <[EMAIL PROTECTED]> spam newsletter, but this user probably subscribed to it... There are 259 emails from/via constantcontact.com which are treated as spam on my system, have been flagged as spam on my system (scores as high as 30's and 40's), have been encapsulated on delivery, have never been flagged by any user as not-spam, but, for the purposes of a world-wide mass-check, these constantcontact.com emails might be questionable. Note: Not all constantcontact.com is treated as spam here -- quite a few cc.com newsletters are subscribed to and seen as ham by their subscribers and the system. The ones I find above in the fns file are all from a set of eight newsletters which have regularly (almost always) been seen as spam, and no user has ever corrected that classification. Henry: To remove these from the log (if you want to), remove everything where the path is /home/Bob/spamassassin.active/masses/corpus.spam (or corpus.ham), since that identifies my corpus contribution, and where the mid ends in @scheduler. FP: Checked every one. Corpus is clean, except: ham: Message-ID: <[EMAIL PROTECTED]> There are two of these listed. One should be removed. spam: mid=<[EMAIL PROTECTED]> Bob Menschel
