Re: clamav-unofficial-sigs not helping in a spam flood

Bill Cole Fri, 25 Mar 2016 21:57:07 -0700

On 24 Mar 2016, at 13:50, Yves Goergen wrote:

Hello,
I'm getting more and more spam every day and SpamAssassin can't handleit. Most of it looks very similar but it isn't filtered out.


Have you tried creating local rules for it?

I can't share the rules I've created for *some* of these families ofmalware-connected spam, but because the worst of them (spreadingransomware) are produced programmatically in bulk, they have very strongsimilarities that make multiline 'rawbody' rules helpful as well ascase-sensitive header checks looking for idiosyncratic combinations ofuncommon minor details.

That's vague on purpose because: spammers are known to change behaviorbased on posts here and on other, even notionally "private", anti-spamlists; these particular spam genera have morphed over time and so needto be treated as moving targets with regular rule adjustments andadditions; and the specific best ruleset I've created for these weredone in an environment where they are legally not mine to share,especially in a place where I know spammers look for ways to evadefilters, making those rules obsolete faster.

I can't speak to the ClamAV issue because I don't use the extra sigs andhave come to expect very little of ClamAV. Maybe ask on a ClamAV list?

What other solutions are there to improve the detection rate ofSpamAssassin? My current spam-to-useful ratio in some mailboxes issomewhere around 10:1.

That implies that you are probably underutilizing spam-control measuresin your MTA. I manage a diverse set of mail systems running multipleMTAs and in all cases the most effective anti-spam measure against ALLspam is delaying the initial greeting banner, which is a mandatoryoption for a MTA to be fit for use exposed to the modern Internet. Laterin the message you say you use Exim, which I believe has such a feature,but I am not sure of that. The ideal delay to use is a matter of debatebecause apparently the subtleties of how the delay is done matters, but5 seconds is usually a reasonable delay to catch most spambots and youdon't start to really impair valid mail due to delays until you go above15s.

Close behind a greeting delay, the use of high-accuracy DNSBLs isindispensable: I use Spamhaus Zen (as well as their DROP+EDROP lists inthe network layer to simple never see the listed nets)ix.dnsbl.manitu.net, and psbl.surriel.com. Note that you CANNOT safelyuse many of these in the same ways on outbound mail submitted by yourown users and inbound mail for local delivery. The same is true of manyof the following measures as well. If you are not strictly segregatinginitial submission to a suitably configured port 587 MSA forauthenticated users so that port 25 SMTP is only inbound mail fromrelative strangers, your spam control will be harder to do safely orwell. Your own authenticated users MIGHT send spam, but some of thetactics that work best before letting SpamAssassin see a message areessentially detection of machines that *should* only be sending mailthough an authenticating MSA, not directly to a remote MTA unfamiliarwith them.

I'm not entirely familiar with the other options Exim offers forrejecting spam, but right behind the banner delay and DNSBLs for me arerefusing mail from hosts that HELO/EHLO badly. Systems differ in whatthey can do in that area, but where I use this most aggressively(Postfix systems) I reject mail from hosts that HELO in strictly invalidways that that use idiosyncratically wrong or spammer-associated ways:remote systems claiming one of my names or IP addresses, using a .localname and most unqualified names (with a whitelist for special cases, IPliterals, and as a variety of valid names whose owners have said nomachine anywhere would ever HELO with the name (e.g. "mail.com") andvarious "generic address" patterns where the hostname is derived fromthe client IP.

Behind that, rejecting mail from sending IP's with no PTR records isalmost entirely safe on the modern net, and it is even getting safer (asmore people use it) to require the PTR names to resolve back to the IPof the client machine. On systems where I can, I only check for anexisting PTR, but on systems where only the stronger check is available,the rejections of valid mail have been declining over the last few yearsand the legit systems who keep that problem for more than a few days arequite rare.

As a result, the mail systems I run reject mail at RCPT time and in somecases at connect time from 50-90% of all of their SMTP connections. Soonly 10-50% of potential mail is even seen by SpamAssassin or any othermessage content filter This makes it feasible to do more expensivefiltering in SA (such as AWL or TxRep, Bayes, complex local rules, andURIBLs) because SA is spared from seeing the bulk of the worst stuff.

That's close to the point of abandoning e-mail and reverting totelephone and snailmail. The rate of spam phone calls is a lot lower,and that's not considering the filter.
Examples of the subjects from the recent days:

   FW: Order RF#391032
   Document2
   FW: Payment Receipt
   Sixt Invoice: 6502444876 from 24.03.2016
   Attached document(s)
   FW: Payment Details - [223434]
   Image9876411149045.pdf
   Voicemail from 07730881627 <07730881627> 00:00:24
   FW: Order Status #022412
   FW: Payment #092161
   FW: Confirmation #388194
All of the messages have attachments, but I can't block allattachments completely.

But you may be able to block some. For example, my favorite tool forhooking SA into Postfix and Sendmail is MIMEDefang, a milter which Ithink rules out use with Exim, but in it a few lines of Perl which couldprobably be converted to a set of SA rules and meta-rules rather simplyreject mail if it contains any of about a dozen Windows filetypes orparticular names that are directly executable (.exe,.com, etc.) or havebeen widespread malware vectors and have no business in mail fromstrangers (.chm, winmail.dat,.js, etc.). Checking the relevant MIMEheaders using a 'full' type rule should allow you to exclude some types.Obviously PDFs and MS Office docs are a headache because they are bothchronic malware vectors AND mailed around all the time innocently, butblocking .js files (recently quite popular as a vector) isn't so bad: ifpeople want to share JavaScript code they should use other means. Toomany MUAs today have failed to learn from MS's blunders and essentiallywill execute scripts received in mail and referenced by HTML in thatmail. Not most desktop MUAs, but webmail (which IS a MUA) is often quitesloppy.

If you are not training and using SA's Bayes component you are cripplingSA. It needs some adjustment (e.g. make the ham autolearn thresholdslightly negative and for most sites reducing the spam autolearnthreshold also helps) and it also needs some initial and routinehuman-driven training: have a means for users to submit spam theyrecognize as spam but SA didn't and if you don't reject spam but rathertag it or deliver to a spam folder, a means for them to submit thosemistakes as well. Depending on the details of your delivered mailstoreand how users use it, it may be possible to identify how they handlespam and how they handle ham, and train on that basis. In rare caseswith just the right sort of users you might even be able to train THEMto handle spam and ham in specific ways so that you can automate findingit and feeding it to the Bayes learner.

If you are not using sa-update daily, start doing so now. Rules getadded, changed, and score-adjusted whenever the project has enough freshham & spam input to trust their automated tools for retuning the rulesto the current nature of ham & spam. This is a huge improvement over thepractice of tweaking the scores of the core public ruleset yourself

Finally: use one of the SA site-specific sender reputation tools: AWLand its successor TxRep. I confess that I have not yet converted anysystems from AWL to the better TxRep, but the same recommendationapplies to both: enable one or the other and after a week or two,especially with a well-trained Bayes DB, you may be able to drop yourspam threshold by a whole point safely.

Does grey-listing still work today?

Reportedly, yes, if it is done correctly. Unfortunately, the originalsimple concept has proven to have a number of edge & corner cases thatcan require you to set up things in a complex mail system that youotherwise would not need to, such as a reliable database with sharedaccess if you have more than one host acting as an MX. I don't use itbecause I've never been desperate enough to make mail routinely delayedat that scale.

Is there an easy way to enable it in either SpamAssassin or Exim? Idon't want to fiddle around with databases and such for days in arunning system.

Simple answer: SA definitely not because SA isn't a greylisting tool. Iwould *GUESS* that Exim can't do it without substantial effort becausesoundly-implemented greylisting is a subtle mechanism that almost neveris directly embedded in an MTA but rather is hooked in externally andjust that process of getting the integration right can be a chore.

Re: clamav-unofficial-sigs not helping in a spam flood

Reply via email to