Re: Is it worth transferring bayes data between different sites?

Dave Warren Wed, 02 Dec 2015 15:35:20 -0800

On 2015-12-02 09:14, Sebastian Arcus wrote:

Perfect - that's exactly the sort of real-life based advice I waslooking for. Many thanks!

I run a small shared hosting environment, with a global bayes for allusers as not enough users are ready/willing/able to take the time tosort ham (although more will press "this is spam") and in general, theresults work out well enough.

Sharing bayes between servers or sites would not seem to be particularlydifferent than a shared bayes between multiple customers in a sharedhosting, as long as the "typical end user" is similar. If you have aviagra dealer or diet pill retailer as one of your customers, yourmileage may vary and they may need more personalization, but in general,for typical SOHO and SMB customers, spammy spam is spammy spam andpretty widely distributed.

From what I see, it's ham that varies a lot per-user, and so while wetry to train bayes across a wide range of ham sets, we also do a lot ofautomated whitelisting based on user behaviour based on mail that userssend, or mail that users keep in their mailboxes so that we can skipspam filtering entirely for as much "wanted" mail as possible. We alsotry to reduce filtering on replies based on the "In-Reply-To:" headercontaining headers that match certain formats (such as what our webmailproduces, what we add to messages missing this header, and a few otherformats), so it's possible that someone else who borrowed our bayesdatabase might end up seeing a higher false-positive rate.

We avoid training big companies (Amazon, eBay, etc) as spam even whenthey spam, as long as it's clearly identified in a blockable way,instead providing users the ability to block senders outright whenapplicable.

Sure, there are errors and mistakes, by and large, bayes works out thedetails in a shared environment, a multi-server environment shouldn't betoo different, as long as the customer base is similar.


--
Dave Warren
http://www.hireahit.com/
http://ca.linkedin.com/in/davejwarren

Re: Is it worth transferring bayes data between different sites?

Reply via email to