On 03/02, John Hardin wrote: > Is the desire for distributed processing stronger than the desire > for consistent results? I'd suggest at least part of the problem > could be addressed by uploading the spam corpora and letting the > central masscheck chew on it. Automating the collect-expire-upload > process for corpora is easy and is less sensitive to temporary > outages - so what if your uploaded spam corpus is a week stale due > to a local failure?
The entire reason for not uploading the spam corpora itself is privacy. I'm pretty sure everybody agrees that for spamassassin, it would be better if everybody used the first and easiest method suggested on the nightly mass-check page, and uploaded their corpora instead of running mass-check and uploading the results. http://wiki.apache.org/spamassassin/NightlyMassCheck However, I'd rather not also need to think about which of my emails might contain information too sensitive to upload where someone else might be abel to read it. (Even though, really, if it's that sensitive, it shouldn't be going over SMTP in cleartext.) But I have wondered about uploading my spam, and running mass-check on my non-spam. -- "Don't go around saying the world owes you a living. The world owes you nothing. It was here first." - Mark Twain http://www.ChaosReigns.com
