https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6473
Summary: Making Bayes Learn RelayCountry Metadata
Product: Spamassassin
Version: unspecified
Platform: PC
OS/Version: FreeBSD
Status: NEW
Severity: enhancement
Priority: P2
Component: Plugins
AssignedTo: [email protected]
ReportedBy: [email protected]
Created an attachment (id=4794)
--> (https://issues.apache.org/SpamAssassin/attachment.cgi?id=4794)
Patch to add Bayes-specific Relaycountry metadata
Bayes doesn't learn tokens shorter than 3 characters and so discards all the
two-letter country codes in the RelayCountry metadata.
As the existing format is well suited to header rules, and to avoid breaking
existing local rules, I suggest adding additional metadata specifically for
Bayes.
I've attached a patch. It produces a token for the first trusted country, plus
a token for each country change e.g.
"US US CA NG" becomes "Trusted_US USCA CANG"
I think this is better than simply having a token per country as that loses all
information about ordering e.g. if you are running SA in the UK then "TW" and
"CZ TW" might be all spam, but "GB TW" and "US TW" could be less spammy due to
travellers using TW IP addresses to connect their submission servers.
Ordered pairs are also more resistant to forged headers. If a spammer adds
extra received headers as bayes poison and sends it though a foreign country,
it will show as a spammy pair rather than a hammy country code e.g CNGB is
spammy because the ordering is wrong.
--
Configure bugmail:
https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.