Re: using URIBL on other headers

Rob McEwen Sun, 23 Sep 2018 16:02:31 -0700

On 9/22/2018 5:55 PM, Michael Grant wrote:

The URIBL plugin looks for URLs in the subject and message body.
Is there some way to coax it to look in the other headers as well, forexample the From: Reply-to: or the Received headers?



Michael,

This reminds me of that saying, "just because you can, doesn't mean youshould" - and along those lines, I have some interesting observationsabout this:

(1) some URI/domain blacklists are ONLY intended for blocking on thedomain or IP that is at the base of clickable links inside the body ofthe message. These will often have a small (but critical) uptick infalse positives if used to check against domains found in the SMTPenvelope (FROM, PTR record, HELO), with typically a very small increasein additional spams blocked. SO BE CAREFUL -AND- if you use a URI/domainblacklist in that way and they don't prescribe that type of usage, don'tcomplain to them or anyone about any resulting false positives - becauseit would then be your MIS-usage off their list that caused those falsepositive.

(2) Even so, there really are SOME series of spams that can be safelyblocked based on domains that are in the SMTP envelope (FROM, PTRrecord, HELO). In some cases, these are snowshoe spammers who aresending from their own spammy domain - but where this domain is NOTfound in a clickable link inside the body of the message - they reallyare trying to get the user to hit "reply". So there really is a purposefor this, even if it is is a very small percentage of all spam

(3) However, even with that being a very small percentage of all - LARGEmail hosters LOVE THIS IDEA? Why? Because it is SO EFFICIENT for them tobe able to block MORE spam based on information in the SMTP envelope -BEFORE the "data" command. Sometimes, this helps block messages wherethe domain was in a clickable link inside the body of the message - butit is still MORE EFFICIENT to block that based on the domain also beingin the SMTP envelop.

(4) ABOUT THOSE FALSE POSITIVES: One of the main reasons that this is sorisky for False Positives... is because two things are epidemic inrecent years: (a) web site gets hijacked by criminal spammer, whoinstalled pages there that redirect to pornographic dating sites or pillspam websites -AND/OR- (b) email account on the mail server getscredentials hijacked and starts spewing spam. HERE IS THE PROBLEM:*MOST* of the time, one or the other happens, (a or b) but not both.Therefore, if (a) happens, they are sure to land on traditional URIblacklists like SURBL, URIBL, and ivmURI. But this company - whose website was hacked - might not have a single spam coming from their mailserver. Yet, if you do the SMTP envelope checking against such URIblacklists - you're going to have a substantially higher amount of falsepositives due to blocking ALL of those emails that merely have a "FROM"address ending in that domain name - even though NONE of THOSE messagesare spam.

(5) So which lists *DO* support blocking on the SMTP envelope? Spamhaus'DBL list is designed for this. However, invaluement's ivmURI list is NOTsupposed to be used in this manner. SURBL and URIBL were originallydesigned to not be used in this way - but that might have changed inrecent years? I recommend checking on that. In the meantime, I recommend*ONLY* using Spamhaus' DBL list in this way. (possibly SURBL or URIBLtoo? - but double check on that!)

(6) QUESTION: So why would a list not support both blocking methods? Forexample, why wouldn't ivmURI support this method?

ANSWER: What Spamhaus did with DBL, while interesting, put them at astrategic disadvantage, and there isn't a thing they can do about thatwithout making fundamental changes to their strategy. Recall that falsepositive scenario mentioned earlier, where a hacked web site causing aURI-list blacklisting can lead to substantially more false positives dueto only hitting on legit mails when blocking based on this domain beingin the SMTP envelope? Well.. the OPPOSITE situation ALSO causes morefalse positives. When their email system has a hijacked email account,but their web site was NOT hacked - then domain blacklists thatprescribe BOTH blocking methods and blacklist that domain... are goingto then start blocking ALL messages that have that domain as a hyperlinkinside the body of the message, even if THOSE messages are legit. Thiswill then cause a substantial number of false positives that were notpart of those hijacked outbound messages. So this works both ways. Theproblem with such domain blacklists that prescribe both uses... is thatthey either have to settle for (a) more false positives -OR- (b) morefalse negatives. In other words, the higher collateral damage potentialmeans that there is going to be more collateral damage when they "takethe bait" and blacklist the domain -OR- their desire to limit falsepositives will cause them to defer on the listing - even though it wouldhave been an excellent and justified ratio of spam-to-ham blocked, withlittle collateral damage if the mail systems using that list could haveONLY blocked using one method or the other, NOT both! DBL likely errs onthe side of less collateral damage - so it is should be safe to use DBLfor blocking based on both methods, as they prescribe, especiallyconsidering Spamhaus' reputation for extremely low false positives.Then, other URI lists can pick up the slack on the occasional FalseNegatives.

(7) Given this information, at invaluement, we have solved this problemby creating a new domain blacklist ("ivmSED") that is independent ofivmURI, where ivmSED is a domain blacklist used ONLY for blocking basedon the domains found on the SMTP envelope (FROM, PTR record, HELO) - andwhere ivmSED NOT be used for blocking domains in clickable links in thebody of the message, since that is the job of our ivmURI list. That way,ivmSED and ivmURI are independent, and we then have the flexibility toblock a domain using either method independently, or both together, forthe approach that most surgically targets the spam, keeps collateraldamage to a minimum, and without compromises that lead to more falsenegatives. ivmSED has just recently entering beta testing. (SED ="Sender's Envelope Domain").


--
Rob McEwen
https://www.invaluement.com

Re: using URIBL on other headers

Reply via email to