On Fri, Aug 14, 2009 at 01:25, Warren Togami<[email protected]> wrote: > On 08/13/2009 07:33 PM, Michael Parker wrote: >> >> Historical accuracy of network tests is key, providing corpora without >> SpamAssassin rules from actual receive time does not help scoring, it >> hurts it. >> >> Michael >> > > Then shouldn't the documentation mention this? My corpora is inconsistently > filtered by spamassassin in the past. Other corpora from other users > processed nightly on my server has never been filtered by spamassassin in > the past.
That statement is mainly Michael's opinion. I'm not sure about it; my opinion is that we're better off with "post-facto" network test results from the mass-check, than with hardly any network test results at all. The status of your corpora is pretty much the status of most contributors' corpora, fwiw. -- --j.
