Re: [Ietf-dkim] DKIM Replay Problem Statement and Scenarios -01 draft posted

Alessandro Vesely Wed, 15 Feb 2023 02:24:09 -0800

On Tue 14/Feb/2023 23:42:36 +0100 Scott Kitterman wrote:

On Tuesday, February 14, 2023 4:16:00 PM EST Evan Burke wrote:
On Tue, Feb 14, 2023 at 11:44 AM Michael Thomas <m...@mtcc.com> wrote:
On Tue, Feb 14, 2023 at 11:18 AM Michael Thomas <m...@mtcc.com> wrote:
Have you considered something like rate limiting on the receiver side forthings with duplicate msg-id's? Aka, a tar pit, iirc?
I believe Yahoo does currently use some sort of count-based approach todetect replay, though I'm not clear on the details.
As I recall that technique is sometimes not suggested because (a) we can'tcome up with good advice about how long you need to cache message IDs towatch for duplicates, and (b) the longer that cache needs to live, thelarger of a resource burden the technique imposes, and small operatorsmight not be able to do it well.
At maximum, isn't it just the x= value? It seems to me that if you don'tspecify an x= value, or it's essentially infinite, they are saying theydon't care about "replays". Which is fine in most cases and you can justignore it. Something that really throttles down x= should be a tractableproblem, right?



The ration between duplicate count and x= is the spamming speed.

But even at scale it seems like a pretty small database in comparison tothe overall volume. It's would be easy for a receiver to just prune itafter a day or so, say.
I think count-based approaches can be made even simpler than that, in fact.I'm halfway inclined to submit a draft using that approach, as time permits.
I suppose if the thresholds are high enough, it won't hit much in the way oflegitimate mail (as an example, I anticipate this message will hit at leasthundreds of mail boxes at Gmail, but not millions), but of course letting thefirst X through isn't ideal.

Scott's message hit my server exactly once. Counting is a no-op for smalloperators.

If I had access to a database of numerically scored IP reputation values (Idon't currently, but I have in the past, so I can imagine this at least), Ithink I'd be more inclined to look at the reputation of the domain as a whole(something like average score of messages from an SPF validated Mail From,DKIM validated d=, or DMARC pass domain) and the reputation of the IP for amessage from that domain and then if there was sufficient statistical confidencethat the reputation of the IP was "bad" compared to the domain's reputation Iwould infer it was likely being replayed and ignore the signature.

Some random forwarder in Nebraska can be easily mistaken for a spammer thatway. Reputation is affected by email volume. Even large operators have littleknowledge of almost silent MTAs.

Having senders' signatures transmit the perceived risk of an author wouldcontribute an additional evaluation factor here. Rather than discard validatedsignatures, have an indication to weight them. (In that respect, let me notethe usage of ARC as a sort of second class DKIM, when the signer knows nothingabout the author.)

I think that approaches the same effect as a "too many dupes" approach withoutthe threshold problem. It does require reputation data, but I assume anyentity of a non-trivial size either has access to their own or can buy it fromsomeone else.



DNSWLs exist.


Best
Ale
--



_______________________________________________
Ietf-dkim mailing list
Ietf-dkim@ietf.org
https://www.ietf.org/mailman/listinfo/ietf-dkim

Re: [Ietf-dkim] DKIM Replay Problem Statement and Scenarios -01 draft posted

Reply via email to