Re: Training spamassassin past 5,000 emails

2021-03-09 Thread Kris Deugau
RW wrote: On Tue, 09 Mar 2021 08:52:28 -0500 Steve Dondley wrote: I will also be allowing users to flag their own spam using the roundcube webmail client. If you do that you should review the submissions. This. SO much this. ALL THE THIS. If you're using the "Mark as Junk" or "Mark as Jun

Re: Training spamassassin past 5,000 emails

2021-03-09 Thread RW
On Tue, 09 Mar 2021 08:52:28 -0500 Steve Dondley wrote: > On 2021-03-09 08:42 AM, RW wrote: > > > > If you keep a full archive of what's been trained. I think it makes > > sense to trim out old mail occasionally and recreate the database - > > particularly if it's a single user Bayes. > > I

Re: Training spamassassin past 5,000 emails

2021-03-09 Thread Bill Cole
On 9 Mar 2021, at 7:49, Steve Dondley wrote: I've read through https://spamassassin.apache.org/full/3.1.x/doc/sa-learn.html which states that "anything over about 5000 messages does not improve accuracy significantly in our tests." Did you read the section on expiration? https://spamassassi

Re: Training spamassassin past 5,000 emails

2021-03-09 Thread Steve Dondley
On 2021-03-09 08:28 AM, Greg Troxel wrote: Steve Dondley writes: I've read through https://spamassassin.apache.org/full/3.1.x/doc/sa-learn.html which states that "anything over about 5000 messages does not improve accuracy significantly in our tests." I would take that with a grain of salt.

Re: Training spamassassin past 5,000 emails

2021-03-09 Thread RW
On Tue, 09 Mar 2021 07:49:38 -0500 Steve Dondley wrote: > I've read through > https://spamassassin.apache.org/full/3.1.x/doc/sa-learn.html which > states that "anything over about 5000 messages does not improve > accuracy significantly in our tests." > > So once I hit 5,000, what do? Do I run -

Re: Training spamassassin past 5,000 emails

2021-03-09 Thread Greg Troxel
Steve Dondley writes: > I've read through > https://spamassassin.apache.org/full/3.1.x/doc/sa-learn.html which > states that "anything over about 5000 messages does not improve > accuracy significantly in our tests." I would take that with a grain of salt. Based on my experience running SA fo

Training spamassassin past 5,000 emails

2021-03-09 Thread Steve Dondley
I've read through https://spamassassin.apache.org/full/3.1.x/doc/sa-learn.html which states that "anything over about 5000 messages does not improve accuracy significantly in our tests." So once I hit 5,000, what do? Do I run --forget on say the 500 oldest emails, delete those from my ham/spa