> I think the Genetic Algorithm (GA) assigns all the scores now.
> GA's are very
> powerful optimization tools, and if the GA lowered those scores, it likely
> raised (compensated) other scores that were more common spam signatures.
>
> The GA is only as good as the population of data it is run on.
> Craig posted
> an e-mail a few days (weeks?) ago stating he was looking for some sample
> non-spam e-mail.  I'd be willing to contribute some business oriented
> e-mails, as I think the corpus as it stands is leans slightly towards the
> tech e-mails.
>
> Gene

Craig,
I would be glad to contribute some emails for this cause.  I am currently
running procmail so it would be easy to grab a copy of every message that
comes in over a weeks time.  How many messages would you want?  Would they
need to be only non-spam or everything?  Would they need to have any SA
headers stripped from them?  Please tell me what you want.  I run an ISP in
western PA.  I would prefer to run the tests for you at my site since I
would not feel comfortable uploading other peoples emails offsite.  But ya
gotta tell me what you need.

Let me know,
Ed.


_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to