Re: 'anti' AWL

LuKreme Thu, 30 Apr 2009 15:18:03 -0700

On 30-Apr-2009, at 11:50, Charles Gregory wrote:

On Thu, 30 Apr 2009, LuKreme wrote:
First off, I suppose that if you get real mail from someone who hasonly ever been seen as a spam sender, then yes, the first mailwould be penalized. But is this ever the case?
(nod) Any time someone's address has been used as a spoofed senderbefore that legitimate sender makes first contact with a newcorrespondent. But as I understand your logic, there is no 'rule' todistinguish the 'first' AWL entry as 'special' from all the rest...just that 'others' exist...


Right.

Let's lay out the logic here:
2 AWL is positive or does not exist
a Check for other AWL entries using same address but different hosts.
i If there is an AWL with a negative score, then multiply by-0.2 and
  add to score


So any AWL with a negative score still helps the new mail be negative?
The sender's legit mail helps new spam?

No, the senders AWL HURTS new spam. I fthe score is -2 from the AWLthen -2 * -0.2 = 0.4

ii If there is an AWL with a positive score, under 5.0, thenmultiply by
  0.1 and add
iii If there is an AWL with a positive score over 5.0, thenmultiply it
  by 0.4 and add
So in the unlikely event that spam (from a different server)precedes legitimate mail, the legit sender gets a postitiveadjustment before they have a chance to score negative...

As I understand it the AWL is added after all others, but yes, theFIRST legitimate mail will be penalized.

Note that this logic will also be problematic when sender hasmultiple mail servers. Many senders get a few points positive...

This will only be an issue if those multiple servers have positive AWLscores.

c if total amount added is over some threshold, normalize on thatthreshold
(3 points? 5? 8?)
Now let's presume that the sender is spoofed by spammers on tendifferentIP's, producing ten different AWL entries. How will you distinguishthe legit sender's IP (except by hoping they have scorednegative?)... You will simply add up ALL the IP AWL's and score*any* mail from the sender
with a significant positive adjustment....

As far as I can tell, though it's not easy to be sure, legitimatesenders have negative AWL scores.

3 AWL is negative
{ crickets }
But how often does that really happen? As I said, most people get a*few* points on legit mail.

But it's not the points on the mail, it is only the AWL listing thatwe're looking at.

The idea being that an average score of 0.8 will 'average' with afluke spammy mail and keep the score lower.... But your way isadding those small scores to essentially ALL mail unless the luckysender never mentioned viag.... ooops. There goes *my* score.... LOL

OK, how do we parse out the AWL numbers then so we can see what sortsof AWL numbers exist for legit senders. As I understand it, if anemail comes in from a know sender who was average 0.8 and this emailscores 3.0, a negative AWL will be applied to normalize the emailcloser to 0.8, right? The AWL score is not 0.8, but 3.0 - (AWL value)?

Maybe it makes sense to only do this check if the message has atleast scored positive?
Again, a significant proportion of ham gets a few points.
So yes, if b...@example.com has never emailed me except for a bunchof spam, then yeah, the message is going to get bumped up in itsscore, but how often does that happen? Does that ever happen?
Happens for me all the time. I get dictionary spam with a randomclient's address as sender, and then I get an inquiry from theclient about all these 'bounces' they are receiving. Naturally, theyquote the bounce, which includes some spam sign, and the client isoff to a good start with a moderately spammy mail to me. (smile)
But bob could also e-mail you three or four times, getting a smallpositive score, then you get spammed "from Bob" with high scoresfrom a botnet (and I usually get several copies of a spam likethat), and the next time bob e-mails, he gets logic 2.a.ii sppliedabove for each and every AWL for his address. Could be hefty....

Er.. ok. Perhaps I am misunderstanding the AWL. As I understand it,if a bunch of spam comes in from a server with average scores of 7.0and a new message comes in with a score of 4, it will have a POSITIVEAWL applied to normalize at 7.0. If a message comes from a knowsender with an average score of 2, and this email scores 4, it willget a NEGATIVE AWL score to normalize closer to 2.0, right? Sincethis is a negative AWL 2.a.ii would not apply because the AWL isnegative, so section 2 is skipped entirely and we are at 3. AWL isnegative => {crickets}.

Also, lets say b...@example.com sends a message after a bunch ofspams have been sent, and say that message scores -1.0, plus an AWLadjustment of 5.0 based on the above.
I'm sure there are some people who *would* 'fit your model' and havenegative scores on their legit mail and not be hurt by the proposedrule.

I think we are talking at cross purposes, and that's likely my fault.I am talking about the AWL adjustment being either positive ornegative. Mail that is more spammy than usual will get penalized up.Mail that is less spammy than usual will not be affected.

Which, for any yahoo mailing list will be a different server manytimes.And so if your yahoo list scores slightly positive, all thosedifferent yahoo servers will all add to the score. Ditto hotmail,gmail, etc.

OK, if the value is 0.1 then it would take up to 50 outbound serverswith even distribution to add 5.0 points.

I can see what you *want* to do. I just don't see a practical way todo it.

That's quite possible. As I said initially, it's jut an idea I had tomake the AWL penalize botnets much more. If it can't be done, that'sfine. I think there's some promise here though.

I'm not married to this idea, I just think there's something here thatmight be worth trying.


--
These budget numbers are not just estimates, these are the actual
        results for the fiscal year that ended February the 30th.
        - GWB

Re: 'anti' AWL

Reply via email to