Re: Scoring question

2018-10-08 Thread Chuck McManis
Ok this is an interesting followup. Since I *know* that the gr_domain.cf
file is being read (I needed to change the '15' scores in there to 1.5 to
have a shot at receiving mail from providers like Mailchimp and Constant
Contact) I added the

score HTML_FONT_LOW_CONTRAST 10

To *that* file, recompiled rules, and restarted spamd and voila, it is now
scored at 10 rather the 0.1

And per other comments here a number of senders do the low contrast thing
so what I *really* want are the ones where it is practically *no* contrast,
is COLORx / (COLORx+1) LSB in one of the colors. (looking at source pages i
see background #ff and text #fe for example.

That suggests that local.cf is either not being read, or something comes
along and sets the parameters after it sets them? I note that gr_domain
comes lexically *before* local.cf so that is kind of confusing to me.

--Chuck


On Mon, Oct 8, 2018 at 11:19 AM John Hardin  wrote:

> On Mon, 8 Oct 2018, Chuck McManis wrote:
>
> > Hi,
> >
> > I restart spamd everytime I change local.cf and unfortunately John
> changing
> > the value as you've suggested and as I mentioned I had already tried,
> > doesn't actually change the score it assigns. Still trying to figure this
> > out.
>
> ...whoops, I didn't catch that, sorry...
>
> SA supports multiple config files, they are parsed in
> sorted-by-filename-order. Are there multiple .cf files in your local
> config folder and is the score for that rule being set in one that sorts
> after the one you're editing?
>
> Also: if the score was zero it would be suppressed entirely. The score is
> probably 0.001 (advisory).
>
> The base scores are:
>
> score HTML_FONT_LOW_CONTRAST 0.713 0.001 0.786 0.001
>
> ...so depending on your config that score may be coming from base, and
> your local changes *should* be overriding it.
>
> And - silly question - are you *sure* you're editing the *correct* local
> config file?
>
>
> > --Chuck
> >
> > On Mon, Oct 8, 2018 at 8:33 AM Kevin A. McGrail 
> wrote:
> >
> >> I think he might also need to restart spamd/amavisd/whatever to have the
> >> local.cf change take place.
> >> --
> >> Kevin A. McGrail
> >> VP Fundraising, Apache Software Foundation
> >> Chair Emeritus Apache SpamAssassin Project
> >> https://www.linkedin.com/in/kmcgrail - 703.798.0171
> >>
> >>
> >> On Mon, Oct 8, 2018 at 11:18 AM John Hardin  wrote:
> >>
> >>> On Mon, 8 Oct 2018, Chuck McManis wrote:
> >>>
>  I have been trying to tune scores to achieve better matches with spam
> >>> that
>  is getting through. And one test which shows up is
> >>> HTML_FONT_LOW_CONTRAST
>  which is being scored for 0. Doing a scan of my incoming mail flow
> this
> >>> is
>  a huge signal, perhaps even a disqualifying one as I have yet to find
> a
>  legitimate piece of email where this is true and it isn't spam.
> 
>  So in my local.cf file I added:
> 
>  score HTML_FONT_LOW_CONTRAST 10
> 
>  To have it add 10 points to the spam score, recompiled rules and
> >>> restarted
>  spamd. Yet it still gives it 0 points.
> 
>  0.0 HTML_FONT_LOW_CONTRAST BODY: HTML font color similar or identical
> to
> background
> 
>  How do I change this value?
> >>>
> >>> In your local config file:
> >>>
> >>>score   HTML_FONT_LOW_CONTRAST   1.000   # or whatever score you
> prefer
> >>>
> >>>
> >>> The reason it scores so low in the base ruleset is the S/O is *very*
> low
> >>> in the masscheck corpora - 0.114 - 4% spam hits vs. 31% ham hits.
> >>>
> >>> You might want to be careful if you intend to treat that as a poison
> pill
> >>> by itself...
> >>>
> >>> I'll take a look at whether its performance can be improved.
> >>>
> >>> --
> >>>   John Hardin KA7OHZ
> http://www.impsec.org/~jhardin/
> >>>   jhar...@impsec.orgFALaholic #11174 pgpk -a
> jhar...@impsec.org
> >>>   key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
> >>> ---
> >>>Politicians never accuse you of "greed" for wanting other people's
> >>>money, only for wanting to keep your own money.-- Joseph Sobran
> >>> ---
> >>>   557 days since the first commercial re-flight of an orbital booster
> >>> (SpaceX)
> >>>
> >>
> >
>
> --
>   John Hardin KA7OHZhttp://www.impsec.org/~jhardin/
>   jhar...@impsec.orgFALaholic #11174 pgpk -a jhar...@impsec.org
>   key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
> ---
>The call to let 16-year-olds vote is a call to amplify the votes
>of teachers' unions. If you think political indoctrination in the
>schools is bad now, wait until it has the direct power to tip
>election results.   -- Robert Tracinski
> -

Re: Scoring question

2018-10-08 Thread John Hardin

On Mon, 8 Oct 2018, Chuck McManis wrote:


Hi,

I restart spamd everytime I change local.cf and unfortunately John changing
the value as you've suggested and as I mentioned I had already tried,
doesn't actually change the score it assigns. Still trying to figure this
out.


...whoops, I didn't catch that, sorry...

SA supports multiple config files, they are parsed in 
sorted-by-filename-order. Are there multiple .cf files in your local 
config folder and is the score for that rule being set in one that sorts 
after the one you're editing?


Also: if the score was zero it would be suppressed entirely. The score is 
probably 0.001 (advisory).


The base scores are:

   score HTML_FONT_LOW_CONTRAST 0.713 0.001 0.786 0.001

...so depending on your config that score may be coming from base, and 
your local changes *should* be overriding it.


And - silly question - are you *sure* you're editing the *correct* local 
config file?




--Chuck

On Mon, Oct 8, 2018 at 8:33 AM Kevin A. McGrail  wrote:


I think he might also need to restart spamd/amavisd/whatever to have the
local.cf change take place.
--
Kevin A. McGrail
VP Fundraising, Apache Software Foundation
Chair Emeritus Apache SpamAssassin Project
https://www.linkedin.com/in/kmcgrail - 703.798.0171


On Mon, Oct 8, 2018 at 11:18 AM John Hardin  wrote:


On Mon, 8 Oct 2018, Chuck McManis wrote:


I have been trying to tune scores to achieve better matches with spam

that

is getting through. And one test which shows up is

HTML_FONT_LOW_CONTRAST

which is being scored for 0. Doing a scan of my incoming mail flow this

is

a huge signal, perhaps even a disqualifying one as I have yet to find a
legitimate piece of email where this is true and it isn't spam.

So in my local.cf file I added:

score HTML_FONT_LOW_CONTRAST 10

To have it add 10 points to the spam score, recompiled rules and

restarted

spamd. Yet it still gives it 0 points.

0.0 HTML_FONT_LOW_CONTRAST BODY: HTML font color similar or identical to
   background

How do I change this value?


In your local config file:

   score   HTML_FONT_LOW_CONTRAST   1.000   # or whatever score you prefer


The reason it scores so low in the base ruleset is the S/O is *very* low
in the masscheck corpora - 0.114 - 4% spam hits vs. 31% ham hits.

You might want to be careful if you intend to treat that as a poison pill
by itself...

I'll take a look at whether its performance can be improved.

--
  John Hardin KA7OHZhttp://www.impsec.org/~jhardin/
  jhar...@impsec.orgFALaholic #11174 pgpk -a jhar...@impsec.org
  key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
---
   Politicians never accuse you of "greed" for wanting other people's
   money, only for wanting to keep your own money.-- Joseph Sobran
---
  557 days since the first commercial re-flight of an orbital booster
(SpaceX)







--
 John Hardin KA7OHZhttp://www.impsec.org/~jhardin/
 jhar...@impsec.orgFALaholic #11174 pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
---
  The call to let 16-year-olds vote is a call to amplify the votes
  of teachers' unions. If you think political indoctrination in the
  schools is bad now, wait until it has the direct power to tip
  election results.   -- Robert Tracinski
---
 557 days since the first commercial re-flight of an orbital booster (SpaceX)


Re: Scoring question

2018-10-08 Thread Chuck McManis
On Mon, Oct 8, 2018 at 9:52 AM RW  wrote:

> Are you sure that you are actually using spamd? You wouldn't be the
> first to run an unnecessary spamd instance while the actual
> classifications are done in a different daemon, such as amavisd, using
> the spamassassin libraries.
>

That is a fair question, I believe I am and I'll give you two different
ways that I can reason to that conclusion.

The first is that I'm running Postfix. My master.cf in Postfix  has the
following chain:
smtp -> smtpd - o content_filter=spamfilter
spamfilter -> pipe /usr/local/bin/spamck -f ${sender} ${recipient}

/usr/local/spamck has in it:
cat | /usr/local/bin/spamc -u filter > checked file.

And spamc sends mail through spamd in order to check it.

Second set of reasoning.
When I first started using spamassassin I discovered that the gr_domain.cf
file was poison pilling the mail list vendors Constant Contact and
Mailchimp (among others). Since I have users that actually get mail they
want from those providers I went in and manually dialed back the scores.
That followed by sa-compile then restart, and  the new scores show up in
the spam header and they stopped flagging a false positives.

That said, and it reminded me that I had also tried whitelisting some
domains in local.cf (being victims of this aggressive scoring on the
gr_domain.cf file part) and that whitelisting did *not* work. So I suppose
it is possible that my local.cf file isn't actually being used by spamd.
(which seems odd, but I'm not sure how to check that definitively)

--Chuck


Re: Scoring question

2018-10-08 Thread RW
On Mon, 8 Oct 2018 09:11:17 -0700
Chuck McManis wrote:

> Hi,
> 
> I restart spamd everytime I change local.cf and unfortunately John
> changing the value as you've suggested and as I mentioned I had
> already tried, doesn't actually change the score it assigns. Still
> trying to figure this out.

Are you sure that you are actually using spamd? You wouldn't be the
first to run an unnecessary spamd instance while the actual
classifications are done in a different daemon, such as amavisd, using
the spamassassin libraries. 


Re: Scoring question

2018-10-08 Thread Chuck McManis
Hi,

I restart spamd everytime I change local.cf and unfortunately John changing
the value as you've suggested and as I mentioned I had already tried,
doesn't actually change the score it assigns. Still trying to figure this
out.

--Chuck

On Mon, Oct 8, 2018 at 8:33 AM Kevin A. McGrail  wrote:

> I think he might also need to restart spamd/amavisd/whatever to have the
> local.cf change take place.
> --
> Kevin A. McGrail
> VP Fundraising, Apache Software Foundation
> Chair Emeritus Apache SpamAssassin Project
> https://www.linkedin.com/in/kmcgrail - 703.798.0171
>
>
> On Mon, Oct 8, 2018 at 11:18 AM John Hardin  wrote:
>
>> On Mon, 8 Oct 2018, Chuck McManis wrote:
>>
>> > I have been trying to tune scores to achieve better matches with spam
>> that
>> > is getting through. And one test which shows up is
>> HTML_FONT_LOW_CONTRAST
>> > which is being scored for 0. Doing a scan of my incoming mail flow this
>> is
>> > a huge signal, perhaps even a disqualifying one as I have yet to find a
>> > legitimate piece of email where this is true and it isn't spam.
>> >
>> > So in my local.cf file I added:
>> >
>> > score HTML_FONT_LOW_CONTRAST 10
>> >
>> > To have it add 10 points to the spam score, recompiled rules and
>> restarted
>> > spamd. Yet it still gives it 0 points.
>> >
>> > 0.0 HTML_FONT_LOW_CONTRAST BODY: HTML font color similar or identical to
>> >background
>> >
>> > How do I change this value?
>>
>> In your local config file:
>>
>>score   HTML_FONT_LOW_CONTRAST   1.000   # or whatever score you prefer
>>
>>
>> The reason it scores so low in the base ruleset is the S/O is *very* low
>> in the masscheck corpora - 0.114 - 4% spam hits vs. 31% ham hits.
>>
>> You might want to be careful if you intend to treat that as a poison pill
>> by itself...
>>
>> I'll take a look at whether its performance can be improved.
>>
>> --
>>   John Hardin KA7OHZhttp://www.impsec.org/~jhardin/
>>   jhar...@impsec.orgFALaholic #11174 pgpk -a jhar...@impsec.org
>>   key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
>> ---
>>Politicians never accuse you of "greed" for wanting other people's
>>money, only for wanting to keep your own money.-- Joseph Sobran
>> ---
>>   557 days since the first commercial re-flight of an orbital booster
>> (SpaceX)
>>
>


Re: Scoring question

2018-10-08 Thread Kevin A. McGrail
I think he might also need to restart spamd/amavisd/whatever to have the
local.cf change take place.
--
Kevin A. McGrail
VP Fundraising, Apache Software Foundation
Chair Emeritus Apache SpamAssassin Project
https://www.linkedin.com/in/kmcgrail - 703.798.0171


On Mon, Oct 8, 2018 at 11:18 AM John Hardin  wrote:

> On Mon, 8 Oct 2018, Chuck McManis wrote:
>
> > I have been trying to tune scores to achieve better matches with spam
> that
> > is getting through. And one test which shows up is HTML_FONT_LOW_CONTRAST
> > which is being scored for 0. Doing a scan of my incoming mail flow this
> is
> > a huge signal, perhaps even a disqualifying one as I have yet to find a
> > legitimate piece of email where this is true and it isn't spam.
> >
> > So in my local.cf file I added:
> >
> > score HTML_FONT_LOW_CONTRAST 10
> >
> > To have it add 10 points to the spam score, recompiled rules and
> restarted
> > spamd. Yet it still gives it 0 points.
> >
> > 0.0 HTML_FONT_LOW_CONTRAST BODY: HTML font color similar or identical to
> >background
> >
> > How do I change this value?
>
> In your local config file:
>
>score   HTML_FONT_LOW_CONTRAST   1.000   # or whatever score you prefer
>
>
> The reason it scores so low in the base ruleset is the S/O is *very* low
> in the masscheck corpora - 0.114 - 4% spam hits vs. 31% ham hits.
>
> You might want to be careful if you intend to treat that as a poison pill
> by itself...
>
> I'll take a look at whether its performance can be improved.
>
> --
>   John Hardin KA7OHZhttp://www.impsec.org/~jhardin/
>   jhar...@impsec.orgFALaholic #11174 pgpk -a jhar...@impsec.org
>   key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
> ---
>Politicians never accuse you of "greed" for wanting other people's
>money, only for wanting to keep your own money.-- Joseph Sobran
> ---
>   557 days since the first commercial re-flight of an orbital booster
> (SpaceX)
>


Re: Scoring question

2018-10-08 Thread John Hardin

On Mon, 8 Oct 2018, Chuck McManis wrote:


I have been trying to tune scores to achieve better matches with spam that
is getting through. And one test which shows up is HTML_FONT_LOW_CONTRAST
which is being scored for 0. Doing a scan of my incoming mail flow this is
a huge signal, perhaps even a disqualifying one as I have yet to find a
legitimate piece of email where this is true and it isn't spam.

So in my local.cf file I added:

score HTML_FONT_LOW_CONTRAST 10

To have it add 10 points to the spam score, recompiled rules and restarted
spamd. Yet it still gives it 0 points.

0.0 HTML_FONT_LOW_CONTRAST BODY: HTML font color similar or identical to
   background

How do I change this value?


In your local config file:

  score   HTML_FONT_LOW_CONTRAST   1.000   # or whatever score you prefer


The reason it scores so low in the base ruleset is the S/O is *very* low 
in the masscheck corpora - 0.114 - 4% spam hits vs. 31% ham hits.


You might want to be careful if you intend to treat that as a poison pill 
by itself...


I'll take a look at whether its performance can be improved.

--
 John Hardin KA7OHZhttp://www.impsec.org/~jhardin/
 jhar...@impsec.orgFALaholic #11174 pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
---
  Politicians never accuse you of "greed" for wanting other people's
  money, only for wanting to keep your own money.-- Joseph Sobran
---
 557 days since the first commercial re-flight of an orbital booster (SpaceX)


Re: Scoring question

2007-08-14 Thread Matt Kettler
Rick Zeman wrote:
> Does this score:
>
> 0.001 BAYES_50Bayesian spam probability is 40 to 60%
>
> seem to be rather low for something with a 50% probablity of being spam?
>   
No, as it has a 50% probability of being nonspam too.

50% is the "exactly undecided" mark.




Re: Scoring question

2007-08-14 Thread arni

Rick Zeman schrieb:

Does this score:

0.001   BAYES_50Bayesian spam probability is 40 to 60%

seem to be rather low for something with a 50% probablity of being spam?

SA 3.2.1 run within Maia with autolearning on.

Tnx


  
BAYES_50 means that bayes thinks that its 50% chance to be ham and 50% 
chance to be spam - so bayes should stay neutral because it has no 
"opinion" on this message.


arni


Re: Scoring question

2007-08-14 Thread Daryl C. W. O'Shea

On 8/14/2007 3:49 PM, Rick Zeman wrote:

Does this score:

0.001   BAYES_50Bayesian spam probability is 40 to 60%

seem to be rather low for something with a 50% probablity of being spam?


Anything higher would seem to be a little high for something with a 50% 
probability of being ham.


Daryl



Re: scoring question

2006-11-11 Thread Matt Kettler
Miles Fidelman wrote:
> Hi,
>
> I got the following in a message from our list management software:
>
> *X-Spam-Status: * Yes, hits=9.7 tagged_above=0.0 required=6.3
> tests=AWL, BAYES_20, NO_RELAYS
> *X-Spam-Level: * *
> *X-Spam-Flag: * YES
>
> Basic configuration:
> Debian Sarge
> Postfix
> amavisd-new
> spamassassin 3.001003
> standard ruleset, plus updates from
> - default channel
> - saupdates.openprotect.com
>
> The thing is, that if I'm reading things correctly, the scores for the
> listed tests are:
> AWL 1 (default)
Nope... the AWL has a variable score. It's the "Automatic whitelist"
which is really more of a "History-tracking score averager" than
anything else. It's only called AWL because its most common effect is to
push down scores when a normally low-scoring sender sends a message that
gets a high score. In this case, it went the other way. A sender that
was high-scoring in the past sent a low scoring message and got pushed up.
> 50_scores.cf:score BAYES_20 0.0001 0.0001 -0.740 -0.740
> 50_scores.cf:score NO_RELAYS -0.001
>
> Which should add up to .259 (net tests and Bayes turned on).
>
> So... why is this showing hits=9.7?  What am I missing?
See above, the variable score for the AWL would have been on the order
of +9.45 or so.

Apparently the past average for this sender is somewhere around +20,
causing the AWL to add a lot to this message.

The AWL score is based on the current pre-awl score, and the past
average for that sender.

Basically the AWL always looks at the difference between the current
score, and the past average. It then adds half that difference in.

See  http://wiki.apache.org/spamassassin/AutoWhitelist

>
>
>
>