On Fri, 31 May 2013 12:15:54 -0700 (PDT)
tony wrote:

> now I've given it some more thought it's a certain type of spam that
> gets through. I never get scams or marital aid spam, never. The spam
> mails that get through are consistently to do with hospitality,
> increased sales and business courses. Are there rules I should be
> using to combat this type of spam?

If you don't want to spend a lot of time managing your own local rules,
I'd recommend scoring in a second statistical filter, one that supports
multi-word tokenization. These can do better than Bayes with type of
spam you mentioned because they can match patterns made out of words
that are themselves neutral. I say patterns because once you get
beyond 2-words the tokens can include gaps. 

I use Bogofilter configured for 4-word tokenization and DSpam with
orthogonal sparse bigram tokenization, but I've scored down DSpam
because it was contributing to most of my FPs. 

I recommend trying Bogofilter. No need for a plugin, just pipe your mail
through it and score the x-Bogosity  header.

Reply via email to