On 10/05, Jim Popovitch wrote:
> On Wed, Oct 5, 2011 at 17:41, RW <rwmailli...@googlemail.com> wrote:
> > The usual reason for a hiatus is that too much spam or ham has aged-out
> > in the corpora, and a top-up is needed.

I think it's more accurate to say the usual reason is that too many people
have stopped automatically submitting data via masscheck, and we need
more people to submit data.

I have a graphical representation of the problem here:
http://www.chaosreigns.com/dnswl/tot.svg
Green is spam, red is non-spam.  They both need to be above the blue line
(150,000 emails each) for score generation to run to create the rule updates.
Counts as of the last (net) run:  
Non-spams: 136261  (90.8% of the minimum)
Spams:     351950 (234.6% of the minimum)

> So, how do we get it top-up'ed?

You contribute your data:
http://wiki.apache.org/spamassassin/NightlyMassCheck
The more we have, the more accurately we can calculate optimal rule
scores, always.  Unfortunately the Project Management Committee has a habit
of never responding to requests for masscheck accounts.


But the current situation appears to be abnormal.  For some reason RuleQA
/ score generation isn't including data submitted by uploading full emails
(normally just rule hit stats are uploaded).  

There is an open bug about that problem here:
https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6671

It seems there is nobody with the access, knowledge of the system,
and time required to fix the problem.

There was supposed to be a SpamAssassin v3.4.0 Release Candidate released
19 days ago, which seems to be primarily held up by this rule update
problem.  Which nobody is working on.

-- 
"Go forth, and be excellent to one another." - http://www.jhuger.com/fredski.php
http://www.ChaosReigns.com

Reply via email to