Found a copy here … http://www.impsec.org/~jhardin/antispam/sa-stats.pl
So finally found the right one. It does seem to be all working ok - at least to my eye. ./Sa_Stats.pl --logdir /var/log --filename spamd.log --num 18 Email: 53 Autolearn: 14 AvgScore: 1.02 AvgScanTime: 6.14 sec Spam: 20 Autolearn: 0 AvgScore: 4.15 AvgScanTime: 5.29 sec Ham: 33 Autolearn: 14 AvgScore: -0.88 AvgScanTime: 6.65 sec Time Spent Running SA: 0.09 hours Time Spent Processing Spam: 0.03 hours Time Spent Processing Ham: 0.06 hours TOP SPAM RULES FIRED ------------------------------------------------------------------------------ RANK RULE NAME COUNT %OFMAIL %OFSPAM %OFHAM AVGSCO ------------------------------------------------------------------------------ 1 HTML_MESSAGE 20 52.83 100.00 24.24 4.15 2 SPF_PASS 17 43.40 85.00 18.18 3.76 3 DCC_CHECK 15 39.62 75.00 18.18 4.33 4 BAYES_50 14 26.42 70.00 0.00 3.86 5 RDNS_NONE 13 24.53 65.00 0.00 4.15 6 SPF_HELO_PASS 13 24.53 65.00 0.00 4.00 7 T_REMOTE_IMAGE 8 15.09 40.00 0.00 3.75 8 DKIM_SIGNED 6 45.28 30.00 54.55 3.17 9 BAYES_999 6 11.32 30.00 0.00 4.83 10 BAYES_99 6 11.32 30.00 0.00 4.83 11 DKIM_VALID 6 45.28 30.00 54.55 3.17 12 RP_MATCHES_RCVD 4 30.19 20.00 36.36 3.25 13 DKIM_VALID_AU 4 37.74 20.00 48.48 3.00 14 HTML_IMAGE_RATIO_02 3 5.66 15.00 0.00 3.67 15 MPART_ALT_DIFF_COUNT 3 5.66 15.00 0.00 6.67 16 MPART_ALT_DIFF 2 3.77 10.00 0.00 6.50 17 FROM_12LTRDOM 2 3.77 10.00 0.00 3.00 18 MORE_SEX 2 3.77 10.00 0.00 5.00 ------------------------------------------------------------------------------ TOP HAM RULES FIRED ------------------------------------------------------------------------------ RANK RULE NAME COUNT %OFMAIL %OFSPAM %OFHAM AVGSCO ------------------------------------------------------------------------------ 1 BAYES_00 32 60.38 0.00 96.97 -0.91 2 HEADER_FROM_DIFFERENT_DOMAINS 29 56.60 5.00 87.88 -0.83 3 DKIM_VALID 18 45.28 30.00 54.55 -0.78 4 DKIM_SIGNED 18 45.28 30.00 54.55 -0.78 5 DKIM_VALID_AU 16 37.74 20.00 48.48 -0.88 6 RP_MATCHES_RCVD 12 30.19 20.00 36.36 -1.08 7 HTML_MESSAGE 8 52.83 100.00 24.24 -0.88 8 DCC_CHECK 6 39.62 75.00 18.18 0.17 9 FREEMAIL_FORGED_FROMDOMAIN 6 11.32 0.00 18.18 -1.17 10 SPF_PASS 6 43.40 85.00 18.18 -1.17 11 FREEMAIL_FROM 6 13.21 5.00 18.18 -1.17 12 UNPARSEABLE_RELAY 3 5.66 0.00 9.09 -1.00 13 DEAR_SOMETHING 2 3.77 0.00 6.06 0.50 14 MSGID_FROM_MTA_HEADER 1 1.89 0.00 3.03 -1.00 15 HTML_FONT_LOW_CONTRAST 1 5.66 10.00 3.03 0.00 16 DKIM_ADSP_CUSTOM_MED 1 1.89 0.00 3.03 -1.00 17 BAYES_05 1 1.89 0.00 3.03 0.00 18 ALL_TRUSTED 1 1.89 0.00 3.03 -2.00 ------------------------------------------------------------------------------ > On 11 Mar 2016, at 15:33, Robert Chalmers <rob...@chalmers.com.au> wrote: > > > Just a note - that server address isn’t responding at the moment. Maybe > later.Hopefully only temporary. > > >> On 11 Mar 2016, at 14:59, Dave Funk <dbf...@engineering.uiowa.edu >> <mailto:dbf...@engineering.uiowa.edu>> wrote: >> >> TL;DR >> You want Dallas Engelken's "sa-stats.pl" NOT the one from SA. >> >> This is confusing because there are two different programs named >> "sa-stats.pl". >> >> The one that comes with SpamAssassin (what you're referring to) is an engine >> stats reporting tool; does not do rule hits analysis. >> >> The tool that Charles Sprickman and I used is the one from Dallas Engelken. >> See: http://wiki.apache.org/spamassassin/StatsAndAnalyzers >> <http://wiki.apache.org/spamassassin/StatsAndAnalyzers> >> be sure to search that page for reference to Dallas Engelken. >> >> >> >> On Fri, 11 Mar 2016, Robert Chalmers wrote: >> >>> The sa-stats.pl I refer to is here. >>> https://spamassassin.apache.org/full/3.0.x/dist/tools/sa-stats.pl >>> <https://spamassassin.apache.org/full/3.0.x/dist/tools/sa-stats.pl>. It’s >>> not the same as the ones shown in other posts. I don’t know what >>> that is. >>> and has an output like this. >>> zeus:~ robert$ perl sa-stats.pl >>> Report Title : SpamAssassin - Spam Statistics >>> Report Date : 2016-03-11 >>> Period Beginning : Fri 11 Mar 00:00:00 2016 >>> Period Ending : Sat 12 Mar 00:00:00 2016 >>> Reporting Period : 24.00 hrs >>> -------------------------------------------------- >>> Note: 'ham' = 'nonspam' >>> Total spam detected : 22 ( 51.16%) >>> Total ham accepted : 21 ( 48.84%) >>> ------------------- >>> Total emails processed : 43 ( 2/hr) >>> Average spam threshold : 3.00 >>> Average spam score : 4.46 >>> Average ham score : -2.10 >>> Spam kbytes processed : 397 ( 17 kb/hr) >>> Ham kbytes processed : 147 ( 6 kb/hr) >>> Total kbytes processed : 545 ( 23 kb/hr) >>> Spam analysis time : 339 s ( 14 s/hr) >>> Ham analysis time : 366 s ( 15 s/hr) >>> Total analysis time : 706 s ( 29 s/hr) >>> Statistics by Hour >>> ---------------------------------------------------- >>> Hour Spam Ham >>> ------------- ----------------- -------------- >>> 2016-03-11 00 0 ( 0%) 13 (100%) >>> 2016-03-11 01 0 ( 0%) 0 ( 0%) >>> 2016-03-11 02 2 (100%) 0 ( 0%) >>> 2016-03-11 03 4 (100%) 0 ( 0%) >>> 2016-03-11 04 4 ( 57%) 3 ( 42%) >>> 2016-03-11 05 6 ( 75%) 2 ( 25%) >>> 2016-03-11 06 6 (100%) 0 ( 0%) >>> 2016-03-11 07 0 ( 0%) 3 (100%) >>> 2016-03-11 08 0 ( 0%) 0 ( 0%) >>> 2016-03-11 09 0 ( 0%) 0 ( 0%) >>> 2016-03-11 10 0 ( 0%) 0 ( 0%) >>> 2016-03-11 11 0 ( 0%) 0 ( 0%) >>> 2016-03-11 12 0 ( 0%) 0 ( 0%) >>> 2016-03-11 13 0 ( 0%) 0 ( 0%) >>> 2016-03-11 14 0 ( 0%) 0 ( 0%) >>> 2016-03-11 15 0 ( 0%) 0 ( 0%) >>> 2016-03-11 16 0 ( 0%) 0 ( 0%) >>> 2016-03-11 17 0 ( 0%) 0 ( 0%) >>> 2016-03-11 18 0 ( 0%) 0 ( 0%) >>> 2016-03-11 19 0 ( 0%) 0 ( 0%) >>> 2016-03-11 20 0 ( 0%) 0 ( 0%) >>> 2016-03-11 21 0 ( 0%) 0 ( 0%) >>> 2016-03-11 22 0 ( 0%) 0 ( 0%) >>> 2016-03-11 23 0 ( 0%) 0 ( 0%) >>> Done. Report generated in 1 sec by sa-stats.pl, version 6256. >>> >>> On 10 Mar 2016, at 21:38, Erickarlo Porro <epo...@earthcam.com >>> <mailto:epo...@earthcam.com>> wrote: >>> I would like to know how to get these stats too. >>> From: Robert Chalmers [mailto:rob...@chalmers.com.au >>> <mailto:rob...@chalmers.com.au>] Sent: Tuesday, March 08, 2016 5:25 AM >>> To: users@spamassassin.apache.org <mailto:users@spamassassin.apache.org> >>> Subject: Re: Missed spam, suggestions? >>> Can I ask, how are you getting these stats please? >>> Thanks >>> On 8 Mar 2016, at 05:11, David B Funk <dbf...@engineering.uiowa.edu >>> <mailto:dbf...@engineering.uiowa.edu>> wrote: >>> On Mon, 7 Mar 2016, Charles Sprickman wrote: >>> >>> I’ve been running with some daily training for a little over a week >>> and I’m seeing less spam in my inbox. I’ve >>> seen a few things slip through because bayes tipped them below the >>> default score, these were two phishing emails. >>> >>> Here’s some rule stats for anyone interested: >>> >>> TOP SPAM RULES FIRED >>> >>> RANK RULE NAME COUNT %OFRULES %OFMAIL %OFSPAM >>> %OFHAM >>> >>> 1 TXREP 13171 8.47 40.38 91.00 >>> 72.91 >>> 2 HTML_MESSAGE 12714 8.18 38.98 87.85 >>> 90.80 >>> 3 DCC_CHECK 10593 6.81 32.48 >>> 73.19 33.78 >>> 4 RDNS_NONE 10269 6.60 31.48 >>> 70.95 5.63 >>> 5 SPF_HELO_PASS 10070 6.48 30.87 69.58 >>> 23.41 >>> 6 URIBL_BLACK 9711 6.25 29.77 67.10 >>> 1.58 >>> 7 BODY_NEWDOMAIN_FMBLA 9550 6.14 29.28 >>> 65.98 1.64 >>> 8 FROM_NEWDOMAIN_FMBLA 9483 6.10 29.07 >>> 65.52 1.36 >>> 9 BAYES_99 8486 5.46 26.02 >>> 58.63 1.18 >>> 10 BAYES_999 8141 5.24 24.96 >>> 56.25 1.06 >>> >>> TOP HAM RULES FIRED >>> >>> RANK RULE NAME COUNT %OFRULES %OFMAIL %OFSPAM >>> %OFHAM >>> >>> 1 HTML_MESSAGE 16473 9.13 50.51 87.85 >>> 90.80 >>> 2 DKIM_SIGNED 13776 7.64 42.24 13.81 >>> 75.93 >>> 3 TXREP 13228 7.33 40.56 91.00 >>> 72.91 >>> 4 DKIM_VALID 12962 7.19 39.74 11.93 >>> 71.44 >>> 5 RCVD_IN_DNSWL_NONE 9941 5.51 30.48 8.08 >>> 54.79 >>> 6 DKIM_VALID_AU 8711 4.83 26.71 7.99 >>> 48.01 >>> 7 BAYES_00 8390 4.65 25.72 >>> 1.84 46.24 >>> 8 RCVD_IN_JMF_W 7369 4.09 22.59 2.54 >>> 40.62 >>> 9 RCVD_IN_MSPIKE_WL 6713 3.72 20.58 >>> 4.39 37.00 >>> 10 BAYES_50 6201 3.44 19.01 >>> 25.56 34.18 >>> Based upon your stats it looks like you need more Bayes training. Your >>> Bayes 00/99 hits should rank higher in the rules-fired >>> stats and BAYES_50 shouldn't be in the top-10 at all. >>> (of course if you've only been training for a week that would explain it). >>> For example, here's my top-10 hits (for a one month interval). >>> TOP SPAM RULES FIRED >>> ---------------------------------------------------------------------- >>> RANK RULE NAME COUNT %OFMAIL %OFSPAM %OFHAM S/O >>> ---------------------------------------------------------------------- >>> 1 T__BOTNET_NOTRUST 114907 60.32 86.81 42.66 >>> 0.5755 >>> 2 BAYES_99 109138 32.98 82.45 0.01 >>> 0.9998 >>> 3 BAYES_999 104903 31.70 79.25 0.01 >>> 0.9999 >>> 4 HTML_MESSAGE 90850 79.41 68.63 86.59 >>> 0.3456 >>> 5 URIBL_BLACK 90845 27.61 68.63 0.27 >>> 0.9942 >>> 6 T_QUARANTINE_1 90640 27.40 68.47 0.02 >>> 0.9996 >>> 7 URIBL_DBL_SPAM 79152 24.02 59.79 0.17 >>> 0.9956 >>> 8 KAM_VERY_BLACK_DBL 74301 22.45 56.13 0.00 >>> 1.0000 >>> 9 L_FROM_SPAMMER1k 73667 22.26 55.65 0.00 >>> 1.0000 >>> 10 T__RECEIVED_1 72413 42.60 54.70 34.54 >>> 0.5135 >>> OP HAM RULES FIRED >>> ---------------------------------------------------------------------- >>> RANK RULE NAME COUNT %OFMAIL %OFSPAM %OFHAM S/O >>> ---------------------------------------------------------------------- >>> 1 BAYES_00 182674 56.03 2.11 91.97 >>> 0.0150 >>> 2 HTML_MESSAGE 171992 79.41 68.63 86.59 >>> 0.3456 >>> 3 SPF_PASS 136623 63.08 54.52 68.78 >>> 0.3457 >>> 4 T_RP_MATCHES_RCVD 130879 53.75 35.54 65.89 >>> 0.2644 >>> 5 T__RECEIVED_2 125492 53.76 39.62 63.18 >>> 0.2947 >>> 6 DKIM_SIGNED 114808 38.57 9.72 57.80 >>> 0.1008 >>> 7 DKIM_VALID 105385 34.70 7.16 53.06 >>> 0.0825 >>> 8 RCVD_IN_DNSWL_NONE 92951 29.90 4.56 46.80 >>> 0.0609 >>> 9 T__BOTNET_NOTRUST 84741 60.32 86.81 42.66 >>> 0.5755 >>> 10 KHOP_RCVD_TRUST 84623 26.44 2.19 42.60 >>> 0.0331 >>> Note how highly BAYES 00/99 ranked. What you don't see is that BAYES_50 is >>> way down in the mud (below 50 rank). >>> BTW, this is with a Bayes that is mostly fed via auto-learning. I >>> occasionally >>> hand feed corner cases that get mis-classified (usually things like >>> phishes, or conference announcments that can look shakey). >>> -- >>> Dave Funk University of Iowa >>> <dbfunk (at) engineering.uiowa.edu <http://engineering.uiowa.edu/>> >>> College of Engineering >>> 319/335-5751 FAX: 319/384-0549 1256 Seamans Center >>> Sys_admin/Postmaster/cell_admin Iowa City, IA 52242-1527 >>> #include <std_disclaimer.h> >>> Better is not better, 'standard' is better. B{ >>> >>> Robert Chalmers >>> rob...@chalmers.com.au <mailto:rob...@chalmers.com.au> Quantum Radio: >>> http://tinyurl.com/lwwddov <http://tinyurl.com/lwwddov> >>> Mac mini 6.2 - 2012, Intel Core i7,2.3 GHz, Memory:16 GB. El-Capitan 10.11. >>> XCode 7.2.1 >>> 2TB: Drive 0:HGST HTS721010A9E630. Upper bay. Drive 1:ST1000LM024 >>> HN-M101MBB. Lower Bay >>> Robert Chalmers >>> rob...@chalmers.com.au <mailto:rob...@chalmers.com.au> Quantum Radio: >>> http://tinyurl.com/lwwddov <http://tinyurl.com/lwwddov> >>> Mac mini 6.2 - 2012, Intel Core i7,2.3 GHz, Memory:16 GB. El-Capitan 10.11. >>> XCode 7.2.1 >>> 2TB: Drive 0:HGST HTS721010A9E630. Upper bay. Drive 1:ST1000LM024 >>> HN-M101MBB. Lower Bay >>> >> >> -- >> Dave Funk University of Iowa >> <dbfunk (at) engineering.uiowa.edu <http://engineering.uiowa.edu/>> >> College of Engineering >> 319/335-5751 FAX: 319/384-0549 1256 Seamans Center >> Sys_admin/Postmaster/cell_admin Iowa City, IA 52242-1527 >> #include <std_disclaimer.h> >> Better is not better, 'standard' is better. B{ > > Robert Chalmers > rob...@chalmers.com <mailto:rob...@chalmers.com>.au Quantum Radio: > http://tinyurl.com/lwwddov <http://tinyurl.com/lwwddov> > Mac mini 6.2 - 2012, Intel Core i7,2.3 GHz, Memory:16 GB. El-Capitan 10.11. > XCode 7.2.1 > 2TB: Drive 0:HGST HTS721010A9E630. Upper bay. Drive 1:ST1000LM024 HN-M101MBB. > Lower Bay > > > > Robert Chalmers rob...@chalmers.com <mailto:rob...@chalmers.com>.au Quantum Radio: http://tinyurl.com/lwwddov Mac mini 6.2 - 2012, Intel Core i7,2.3 GHz, Memory:16 GB. El-Capitan 10.11. XCode 7.2.1 2TB: Drive 0:HGST HTS721010A9E630. Upper bay. Drive 1:ST1000LM024 HN-M101MBB. Lower Bay