Using SpamAssassin to fight comment spam?

Ole Kasper Olsen Wed, 11 Jan 2006 05:36:10 -0800

Hi,

I am a developer on a fairly large community site (30-50,000 active users)with blogs, photo albums and forums.

I spent yesterday tinkering with a spam prevension system which runs eachnew comment to a blog post or image in a photo album through SpamAssassin.I take the provided comment, and assemble a RFC822-compliant message basedon the users IP address and sender and reciever's registered emailaddresses, and then run it through Mail::SpamAssassin (the Perl module)with default settings.

This seems to work. At least it intercepts the test-message provided inthe SpamAssassin documentation.

This system requires me to have a utility where people can mark spam asham in the case of SpamAssassin wrongly identifying a valid comment asspam. I was planning of having this utility teach the Bayesian filter on acommunity-wide basis, i.e. for all users. Therefore, people cannot marktheir own messages as ham. This to guard against spammers teaching thefilter wrongly.


 - Is learning a good idea at all in this setting?

- If so, what are the advantages and more importantly disadvantages ofhaving community-wide learning?

   - Should I use autolearning?

- Is there anything else I should be aware of when implementingSpamAssassin in this setting?

   - Settings
   - Thresholds
   - &c?

After testing this a bit on comments, I hope to expand to blog posts andforum posts as well, so that moderators gets a heads-up when people postspam.


--
Ole Kasper Olsen
Information Systems Developer
Opera Software ASA

Using SpamAssassin to fight comment spam?

Reply via email to