On Sun, 19 Jan 2025, Matus UHLAR - fantomas wrote:

should I assume that RuleQA works with different corpus than masscheck?

The following is my understanding absent having ever done a deep dive through the scoring code...

Nope. masscheck runs the current ruleset against the contributors' ham and spam corpora and records which rules hit which message. The results are uploaded to the RuleQA host where those results are analysed and scores are generated.

They are different phases in the same process, and we're pretty loose about using the terms interchangably to refer to the entire thing.

On 15.01.25 09:23, Bill Cole wrote:
It is worth noting that the 72_scores.cf file is what the RuleQA process produces.

50_scores.cf should be generated by mass-checking, where H2 generated
no score (why?)

Bill was correct. 72_scores.cf is generated by the masscheck/ruleqa process. The masscheck/ruleqa process does not evaluate or generate scores for all rules, though.

50_scores.cf is fixed scores set either manually or (historically) from a different analysis process that is run *very* infrequently. It is not generated from the masscheck/ruleqa process.

There is no score for RCVD_IN_MSPIKE_H2 in 50_scores.cf. There are scores for the other MSPIKE rules there and the ones appearing there do NOT appear in 72_scores.cf so it looks like they are not being evaluated for scoring. The score for _H2 was apparently accidentally removed by this commit:

https://svn.apache.org/viewvc/spamassassin/trunk/rules/50_scores.cf?r1=1213550&r2=1213551&diff_format=h

I've added a score for RCVD_IN_MSPIKE_H2 back into 50_scores.cf to see whether that affects things. It looks like that is the only MSPIKE rule that masscheck is generating scores for and I can't really see why apart from it not appearing in 50_scores.cf, and at the moment I don't want to go spelunking in the code to verify that's the override...



--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 [email protected]                         pgpk -a [email protected]
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  Politicians never accuse you of "greed" for wanting other people's
  money, only for wanting to keep your own money.    -- Joseph Sobran
-----------------------------------------------------------------------
 4 days until John Moses Browning's 170th Birthday

Reply via email to