I've set up spamassassin with a site-wide bayes configuration. I have some spamtrap email addresses that supply fresh spam into bayes for training on a cron job. However, from what I've read, bayes needs to have ongoing ham as well as spam for training in order to work well. What's the usual method of supplying the ham? Does that have to be done manually (how often?) or has anyone come up with a way to automatically supply ham.
I have the spamtrap email boxes that receive spam-only but all the real email addresses on the server receive a mix of ham and spam, which is why I need spamassassin in the first place :) I can't find anything in spamassassin docs so far that explains a non-manual way of supplying ham. Have I missed something? Is there some sort of service where I can subscribe to an updated ham corpus automatically like with the clamav database? -Steve