Thanks guys. What I am going to ask might be a longshot.
But is it possible for anyone who is running a mailserver to give a list of source of SPAM (recent , anytime this year)and the SA score associated? It will be extremely useful for my research and credit would be given. Example:- efetunisie.org,6.3 abcxcf.com,5.7 . . . You might think that there will be privacy issues, but I am asking only for SPAM mails which would be filtered anyways. I need a large corpus of mails for evaluating my technique. On Tue, May 31, 2016 at 8:55 AM, Bowie Bailey <bowie_bai...@buc.com> wrote: > On 5/31/2016 1:38 AM, @lbutlr wrote: > >> On May 30, 2016, at 11:06 PM, Shivram Krishnan <rorryk...@gmail.com> >> wrote: >> >>> 2) I have set a threshold of -10 to see how spamassassin assigns a score >>> for every mail. >>> >> No. Do not do this. >> > > Instead, set this option in your local.cf file: > > add_header all Report _REPORT_ > > This will make SA add a report header to all emails so you can see how > they score. As a plus, you will also see whether it's marked as ham or > spam, which you lose by artificially lowering the threshold. > > You can also use this if you want more info on Bayes scoring (should be > all one line): > > add_header all Bayes bayes=_BAYES_, > N=_BAYESTC_(_BAYESTCLEARNED_-_BAYESTCHAMMY_+_BAYESTCSPAMMY_), > ham=(_HAMMYTOKENS(5,short)_), spam=(_SPAMMYTOKENS(5,short)_) > > -- > Bowie > >