https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6386

           Summary: Limit corpora message age in score generation
           Product: Spamassassin
           Version: SVN Trunk (Latest Devel Version)
          Platform: Other
        OS/Version: All
            Status: NEW
          Severity: major
          Priority: P5
         Component: Score Generation
        AssignedTo: [email protected]
        ReportedBy: [email protected]


[I'm marking this as major severity since it could have a major effect on the
scores of all network tests.  Feel free to adjust as appropriate.]

Justin mentioned that old ham hits (resulting in false positives) from network
tests of the original score generation run from when a given ham sample is
first introduced are carried forward through time when new scores are
generated.  This seems inappropriate, especially in the case of network tests,
since the data behind network tests tend to change over time.  In particular a
FP on an old network test may not continue to be a FP when using current
network test data, i.e., the network test data may have had the FP removed
after the original scoring run and no longer cause an FP.  As a result, such
retrospective FPs under the existing score generation system may not reflect
actual FPs from current network test data, leading to a lower than appropriate
score for a particular test.

One solution would be to have some kind of time limit on network test results. 
Some blacklist/blocklist data are highly dynamic and tend to change from day to
day so an expiration time on the order of a few days may be appropriate.

-- 
Configure bugmail: 
https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

Reply via email to