Robert Menschel wrote:

> MK> However, these attempts are only going to be effective against the bayes 
> portion
> MK> of SA.
> 
> As I've said before, my opinion is that these attempts are NOT
> effective against SpamAssassin's Bayes system.
> 
> As a rule, we do NOT receive hams which contain such extracted text.
> No matter where the spammers extract their text from, they're going to
> extract words that are not found in ham, and Bayes is going to learn
> that the presence of such words means S P A M.

I agree, mostly, however I have found that SOME emails with extracted text
collide with our ham profile. Not all, not even many, but some do collide.

Really this is entirely a function of how well the spammer can match your ham
profile with his extraction. If he can match it accurately, this technique will
be very effective against your bayes. If they can't match your ham profile, it
won't work at  all.


Just today I got one email with this hit list:

score=17.817, required 5,       autolearn=spam, AB_URI_RBL 1.00, BAYES_10 -0.91,
BLACK_URI_RBL 2.00, DRUGS_ERECTILE 1.00, INFO_GREYLIST_NOTDELAYED -0.00,
RAZOR2_CF_RANGE_51_100 0.20, RAZOR2_CHECK 1.05, RCVD_IN_BL_SPAMCOP_NET 1.50,
RCVD_IN_XBL 4.92, SPAMCOP_URI_RBL 3.00, VIAGRA_ONLINE 4.06


It got the BAYES_10 because the extracted text closely matches the general
language style of my end users. The spam content was 1 line and a url. The
extracted text was 4 lines.



Reply via email to