On 2014.12.03 05.45, Mark Martinec wrote:
> listsb-spamassas...@bitrate.net wrote:
>> i was testing with a sample message, and noticed that when running
>> manually with --debug, there seem to be numerous differences in the
>> results, such as scores for the same tests differing, visual ordering
>> of results differing [is this significant?], and bayes not being
>> listed when using --debug.  am i doing something wrong?  are my
>> expectations misguided?  i'm doing these tests as the user named
>> amavis, which the amavis software runs as.
>>
>>> spamassassin --test-mode --debug < message3.txt
>>  1.6 RCVD_IN_BRBL_LASTEXT   RBL: No description available.
> [...]
> 
>>> spamassassin --test-mode < message3.txt
>>  1.4 RCVD_IN_BRBL_LASTEXT   RBL: No description available.
>>                             [94.73.46.5 listed in 
>> bb.barracudacentral.org]
>> -1.9 BAYES_00               BODY: Bayes spam probability is 0 to 1%
>>                             [score: 0.0000]
> 
> 
> Apparently in the first case a score set 1 was chosen, and in the second
> case a score set 3. Availability of a bayes scanner choses between the two.

i'm ignorant here - what is a score set?  is there documentation i can read up 
on?

> Could it be that you have a fresh bayes database which had less than 200
> spam and 200 ham entries in the first attempt, but became populated
> and functional by the time of the second attempt?

i don't believe so - here's another exercise, with bayes info before and after 
each test.  

>sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0      22642          0  non-token data: nspam
0.000          0       2254          0  non-token data: nham
0.000          0     258660          0  non-token data: ntokens
0.000          0 1416781285          0  non-token data: oldest atime
0.000          0 1417643830          0  non-token data: newest atime
0.000          0 1417643689          0  non-token data: last journal sync atime
0.000          0 1417632951          0  non-token data: last expiry atime
0.000          0     691200          0  non-token data: last expire atime delta
0.000          0       2775          0  non-token data: last expire reduction 
count

 >spamassassin --test-mode --debug < message3.txt
Content analysis details:   (16.0 points, 5.0 required)

 pts rule name              description
---- ---------------------- --------------------------------------------------
 1.7 URIBL_WS_SURBL         Contains an URL listed in the WS SURBL blocklist
                            [URIs: ialloansystems.com]
 2.5 URIBL_DBL_SPAM         Contains a spam URL listed in the DBL blocklist
                            [URIs: ialloansystems.com]
 1.6 RCVD_IN_BRBL_LASTEXT   RBL: No description available.
                            [94.73.46.5 listed in bb.barracudacentral.org]
 1.7 URIBL_BLACK            Contains an URL listed in the URIBL blacklist
                            [URIs: ialloansystems.com]
 0.1 URIBL_SBL_A            Contains URL's A record listed in the SBL blocklist
                            [URIs: www.ialloansystems.com]
-0.0 T_RP_MATCHES_RCVD      Envelope sender domain matches handover relay
                            domain
-0.0 SPF_HELO_PASS          SPF: HELO matches SPF record
-0.0 SPF_PASS               SPF: sender matches SPF record
 2.4 RAZOR2_CF_RANGE_E8_51_100 Razor2 gives engine 8 confidence level
                            above 50%
                            [cf: 100]
 0.4 RAZOR2_CF_RANGE_51_100 Razor2 gives confidence level above 50%
                            [cf: 100]
 2.0 PYZOR_CHECK            Listed in Pyzor (http://pyzor.sf.net/)
 1.7 RAZOR2_CHECK           Listed in Razor2 (http://razor.sf.net/)
 0.0 DIGEST_MULTIPLE        Message hits more than one network digest check
 5.0 KAM_VERY_BLACK_DBL     Email that hits both URIBL Black and Spamhaus DBL
-3.1 AWL                    AWL: adjust score towards average for this sender

>sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0      22642          0  non-token data: nspam
0.000          0       2254          0  non-token data: nham
0.000          0     258660          0  non-token data: ntokens
0.000          0 1416781285          0  non-token data: oldest atime
0.000          0 1417643830          0  non-token data: newest atime
0.000          0 1417643689          0  non-token data: last journal sync atime
0.000          0 1417632951          0  non-token data: last expiry atime
0.000          0     691200          0  non-token data: last expire atime delta
0.000          0       2775          0  non-token data: last expire reduction 
count

>spamassassin --test-mode < message3.txt
Content analysis details:   (17.0 points, 5.0 required)

 pts rule name              description
---- ---------------------- --------------------------------------------------
 1.7 URIBL_BLACK            Contains an URL listed in the URIBL blacklist
                            [URIs: ialloansystems.com]
 1.6 URIBL_WS_SURBL         Contains an URL listed in the WS SURBL blocklist
                            [URIs: ialloansystems.com]
 2.5 URIBL_DBL_SPAM         Contains a spam URL listed in the DBL blocklist
                            [URIs: ialloansystems.com]
 1.4 RCVD_IN_BRBL_LASTEXT   RBL: No description available.
                            [94.73.46.5 listed in bb.barracudacentral.org]
 0.1 URIBL_SBL_A            Contains URL's A record listed in the SBL blocklist
                            [URIs: www.ialloansystems.com]
 3.5 BAYES_99               BODY: Bayes spam probability is 99 to 100%
                            [score: 1.0000]
-0.0 SPF_HELO_PASS          SPF: HELO matches SPF record
-0.0 T_RP_MATCHES_RCVD      Envelope sender domain matches handover relay
                            domain
-0.0 SPF_PASS               SPF: sender matches SPF record
 0.2 BAYES_999              BODY: Bayes spam probability is 99.9 to 100%
                            [score: 1.0000]
 1.4 PYZOR_CHECK            Listed in Pyzor (http://pyzor.sf.net/)
 0.5 RAZOR2_CF_RANGE_51_100 Razor2 gives confidence level above 50%
                            [cf: 100]
 1.9 RAZOR2_CF_RANGE_E8_51_100 Razor2 gives engine 8 confidence level
                            above 50%
                            [cf: 100]
 0.9 RAZOR2_CHECK           Listed in Razor2 (http://razor.sf.net/)
 5.0 KAM_VERY_BLACK_DBL     Email that hits both URIBL Black and Spamhaus DBL
 0.3 DIGEST_MULTIPLE        Message hits more than one network digest check
-4.0 AWL                    AWL: adjust score towards average for this sender

>sa-learn --dump magic
0.000          0          3          0  non-token data: bayes db version
0.000          0      22642          0  non-token data: nspam
0.000          0       2256          0  non-token data: nham
0.000          0     258777          0  non-token data: ntokens
0.000          0 1416781285          0  non-token data: oldest atime
0.000          0 1417643866          0  non-token data: newest atime
0.000          0 1417643689          0  non-token data: last journal sync atime
0.000          0 1417632951          0  non-token data: last expiry atime
0.000          0     691200          0  non-token data: last expire atime delta
0.000          0       2775          0  non-token data: last expire reduction 
count

Reply via email to