Re: Disabling autolearn on given rule

2021-09-23 Thread Bill Cole
On 2021-09-22 at 05:19:48 UTC-0400 (Wed, 22 Sep 2021 11:19:48 +0200) Bert Van de Poel is rumored to have said: I for one have no idea how I would submit a fix to SA once I've written it, to give a concrete example. I'm guessing I just paste the patch to a Bugzilla comment and hope someone merg

Re: Disabling autolearn on given rule

2021-09-23 Thread Matus UHLAR - fantomas
On 9/22/2021 8:11 AM, Kevin A. McGrail wrote: So I'd recommend a different take.  Autolearn is an abomination we never should have published.  It is, in effect, a switch to allow a inherent bias in the modelling to grow and continue. On 22.09.21 10:39, Jared Hall wrote: Agreed, predictable Ga

Re: Disabling autolearn on given rule

2021-09-22 Thread Jared Hall
On 9/22/2021 8:11 AM, Kevin A. McGrail wrote: Morning all, So I'd recommend a different take.  Autolearn is an abomination we never should have published.  It is, in effect, a switch to allow a inherent bias in the modelling to grow and continue. Agreed, predictable Garbage Out (FP) become

Re: Disabling autolearn on given rule

2021-09-22 Thread Matus UHLAR - fantomas
On 21.09.21 13:11, Matt Corallo wrote: I recently noticed my bayes was rarely matching any spam, and it turns out this was due to autolearn=ham'ing occurring on lots of list traffic that I only occasionally read, some of which was blatant spam. Sadly, list traffic can be pretty hard to categori

Re: Disabling autolearn on given rule

2021-09-22 Thread Benny Pedersen
On 2021-09-22 14:11, Kevin A. McGrail wrote: Morning all, So I'd recommend a different take. Autolearn is an abomination we never should have published. It is, in effect, a switch to allow a inherent bias in the modelling to grow and continue. Disable autolearn, wipe your Bayes store, and man

Re: Disabling autolearn on given rule

2021-09-22 Thread Kevin A. McGrail
Morning all, So I'd recommend a different take. Autolearn is an abomination we never should have published. It is, in effect, a switch to allow a inherent bias in the modelling to grow and continue. Disable autolearn, wipe your Bayes store, and manually train from hand classified ham and spam.

Re: Disabling autolearn on given rule

2021-09-22 Thread Martin Gregorie
On Tue, 2021-09-21 at 18:57 -0700, Loren Wilton wrote: > > Well, from the few I've seen, they all seem to have a relatively > constant structure. Someone pointed you to a plugin that is at least > dealing in this having a better suggestion. > > While I wrote a little Perl a decade ago I've forgot

Re: Disabling autolearn on given rule

2021-09-22 Thread Bert Van de Poel
This is complete news to me! Based on the activity on the dev list, I had assumed there were still 10-20 people devoting some of their time to developing SA. If you are the only one, that of course changes my view very much, and would be something worth communicating in some spot. When I asked

Re: Disabling autolearn on given rule

2021-09-22 Thread Henrik K
On Wed, Sep 22, 2021 at 10:45:43AM +0200, Bert Van de Poel wrote: > > I hope I'm not passing on too much of a negative message. It would be great > of someone had a look at the Bayes autolearn code. I think it would be a > great service to the community! The fact is that there really aren't any ac

Re: Disabling autolearn on given rule

2021-09-22 Thread Bert Van de Poel
I think having a look at the code itself is a good idea. I'm not sure if it's up-to-date but you can find some information on https://cwiki.apache.org/confluence/display/SPAMASSASSIN/DevelopmentStuff I've found that just reporting issues on SA's bugzilla is completely useless since it's just u

Re: Disabling autolearn on given rule

2021-09-22 Thread Henrik K
On Tue, Sep 21, 2021 at 06:57:22PM -0700, Loren Wilton wrote: > > I guess one thing you might be able to do is implement a tflags flag of > absolutely_no_autolearn or some such that would force-disable the autolearn > decision if the rule had hit, but that might be something that would have to > be

Re: Disabling autolearn on given rule

2021-09-21 Thread Loren Wilton
(2) where would I go to look at building a plugin for this? Ideally something that ends up upstream, but though I can write code, I know no perl :). Well, from the few I've seen, they all seem to have a relatively constant structure. Someone pointed you to a plugin that is at least dealing in

Re: Disabling autolearn on given rule

2021-09-21 Thread Matt Corallo
On 9/21/21 18:01, Loren Wilton wrote: None of these seem to accomplish disabling learning for a specific rule I think the problem is that I believe Bayes works off of the total score, and probably only sees rule names as more tokens, if it sees them at all. If it indeed works off the total

Re: Disabling autolearn on given rule

2021-09-21 Thread Loren Wilton
None of these seem to accomplish disabling learning for a specific rule I think the problem is that I believe Bayes works off of the total score, and probably only sees rule names as more tokens, if it sees them at all. If it indeed works off the total score, about all you can do is somehow tw

Re: Disabling autolearn on given rule

2021-09-21 Thread Matt Corallo
On 9/21/21 15:53, Benny Pedersen wrote: On 2021-09-21 22:11, Matt Corallo wrote: "tflags MAILING_LIST_MULTI noautolearn" doesn't seem like quite what I want, it just reduces the score used to decide whether to learn. There's some old bugzilla mentions asking for this feature, but it seems the r

Re: Disabling autolearn on given rule

2021-09-21 Thread Benny Pedersen
On 2021-09-21 22:11, Matt Corallo wrote: "tflags MAILING_LIST_MULTI noautolearn" doesn't seem like quite what I want, it just reduces the score used to decide whether to learn. There's some old bugzilla mentions asking for this feature, but it seems the response was "write a plugin". Is there a

Disabling autolearn on given rule

2021-09-21 Thread Matt Corallo
Hi! I recently noticed my bayes was rarely matching any spam, and it turns out this was due to autolearn=ham'ing occurring on lots of list traffic that I only occasionally read, some of which was blatant spam. Sadly, list traffic can be pretty hard to categorize and ends up getting through due