spamd and network whitelisting

Clint Pachl Fri, 16 Dec 2016 06:22:34 -0800

I would like to share my 45-day experience with running spamd and myobservations and how I'm allowing mail from SMTP clusters to bypassspamd. Feedback and discussion would be greatly appreciated.

I have two domains that I have been using for my businesses: one is 13years old and the other is 8 years old. I have never had a spam problemuntil about six months ago. In October I was getting about 100-200 spamsper day per domain. The spam rate was increasing from month to month.All mail was going directly to my OpenSMTPd. I was not using filteringof any kind so the signal-to-noise was very low, and frustrating.

So I read the spamd and related man pages and enabled spamd on myfirewall on November 1. I was astonished! I literally got 6 spam emailsthat first week for both domains!

However, the big problem was, I also wasn't getting legitimate businessemails that were sent from SMTP clusters/pools. After studying my logs,tweaking spamd(8) flags, looking to external solutions (DNSBL, SPF,reverse IP verification), I had some observations and discovered somepatterns. Here's the solution I'd like to share:

I wrote two very small scripts: spamd-dnsbl and spamclusterd. Thesescripts work together to keep spam to a minimum while passing alllegitimate email (in my case so far).

1) spamd-dnsbl: Queries a DNSBL using the IPs in spamdb(8). If an IP ison a black list it is added as a TRAPPED entry in the spamdb. The scriptonly checks IPs which have been added since last run. Currently, onlythe zen.spamhaus.org DNSBL is queried because I found it to be the mosttrue of all those listed athttp://en.wikipedia.org/wiki/Comparison_of_DNS_blacklists.Alternatively, multiple DNSBLs could be queried and the results could beused in aggregate to determine spam status, thus promoted to TRAPPED.

2) spamclusterd: Queries spamdb(8) for networks to whitelist, which itadds to a pf table that bypasses spamd. So before this script getscarried away allowing IP blocks to bypass spamd, the spamdb(8) is firstpruned of spammers using the spamd-dnsbl script.

I've only been running this setup for about 30 days, but I haven'tmissed an email yet; plus spam is still about 1 per day across bothdomains. I receive emails from all the common SMTP clusters, such asGmail, Microsoft (hotmail.com, outlook.com, msn.com, etc.), and Yahoobut also US government agencies such as, mail.mil, usmc.mil, uscg.mil,irs.gov, etc.


I noticed a pattern of commonalities of these legitimate sending clusters:

1. The envelope's from and to addresses are identical across tuples.

2. The HELOs are very similar, with the TLD from each tuple almostcertainly the same.

3. They make multiple attempts from different IP addresses, however, theIPs differ only by a few bits. (Caveat: I'm only using IPv4)

These 3 points are the basis of spamclusterd. How it works is, if two ormore GREY tuples with matching "to" and "from" addresses, HELOs withmatching TLDs, and IPs with matching network bits (/24), then add the/24 network to the spamd-cluster table in pf, which bypasses spamd.

I was going to get fancy and do an SPF lookup and try to determine theexact network to whitelist, but simply whitelisting a 256 IP block seemsgood enough. Once in awhile the subsequent client IP will be outsidethis block, but the /24 seems to work better than 90% of the time.

Currently, just two client IPs from the same /24 network is enough toget that network whitelisted, which seems like a low bar. However, withthe prior DNSBL pruning, this seems sufficient for now.


## Some other observations ##

Spammers, even if sending from the same IP or IP network and regardlessof theTO address, tend to randomize the FROM and/or HELO. Therefore, in thecase of my spamclusterd script, whitelisting a spammer is less likelywhen ensuring both HELO and FROM match for multiple tuples. These IPswill then continue to deal with spamd, and it's business as usual.

I initially tried setting 1 minute passtime and 12 hour greyexp timesfor spamd (i.e. -G 1:12:864) in hopes to eventually whitelist a clientIP, originating from a cluster, that has reattempted within that largewindow. However, in my first week, I missed a couple of Gmails whichresent for 5+ days and ultimately failed to deliver. What wasinteresting was one of the Google server IPs retried after 12 hours and3 minutes, just missing the grey window, while others retried after 24hours. I now set -G 1:10:1080.

It seems safe to assume a spammer if reverse IP lookup returns NXDOMAINand IPis on at least 1 reputable DNSBL or lookup returns SERVFAIL after twoattempts.

Using SPF seems unreliable as of 11/22/16. Tested SPF on hundreds of IPsin spamdb using the ruby spf gem. More than half the IPs did not specifySPF or it failed in some

way.

If the envelope's "from" is our domain (i.e., to and from addresses arethe same domain), it is definitely a spammer because we only send ourmail to the submission port and never to the smtp port. For example,there are currently 217 grey entries and 31 meet this criteria. However,these spammers almost never resend so not worth it to blacklist themafter the first connection attempt. What would be best is if we couldblacklist these spammers upon first connection (for example, add flag tospamd(8) that doesn't allow email from ourselves because we authenticateand submit mail to submission port 587, which could use domains fromspamd.alloweddomains).

Thank you for reading this far. Please let me know if you would likeclarification or have questions. If there is interest in my scripts, Ican send those as well.

Thanks to all the developers who made spamd; an amazing, simple, clevertool.

spamd and network whitelisting

Reply via email to