> I think the Genetic Algorithm (GA) assigns all the scores now. > GA's are very > powerful optimization tools, and if the GA lowered those scores, it likely > raised (compensated) other scores that were more common spam signatures. > > The GA is only as good as the population of data it is run on. > Craig posted > an e-mail a few days (weeks?) ago stating he was looking for some sample > non-spam e-mail. I'd be willing to contribute some business oriented > e-mails, as I think the corpus as it stands is leans slightly towards the > tech e-mails. > > Gene
Craig, I would be glad to contribute some emails for this cause. I am currently running procmail so it would be easy to grab a copy of every message that comes in over a weeks time. How many messages would you want? Would they need to be only non-spam or everything? Would they need to have any SA headers stripped from them? Please tell me what you want. I run an ISP in western PA. I would prefer to run the tests for you at my site since I would not feel comfortable uploading other peoples emails offsite. But ya gotta tell me what you need. Let me know, Ed. _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk