W dniu 20.03.2019 o 15:27, Dominic Raferd pisze: > On Wed, 20 Mar 2019 at 13:14, piecka <teplav...@gmail.com> wrote: >> >> Hello >> >> We've encountered a high false positive rate with MIXED_ES rule for emails >> written in Czech language. Czech naturally uses all of the e,ě and é. >> >> The situation is similar for Slovak language, which includes e and é. >> >> It seems the same with Greek >> (https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7691). >> >> Email messages written in one of the above mentioned (probably even other) >> languages have a much higher false positive rate than I would consider >> acceptable. >> >> Additionally, the default score for the rule is 3.999 which is quite high. >> >> I don't think the rule is suitable for the default ruleset in the current >> form. > > I have seen similar problems and agree. I reduced its score with this > line in /etc/spamassassin/local.cf: > score MIXED_ES 0.499 >
MIXED_ES has hits in ham in masscheck https://ruleqa.spamassassin.org/20190317-r1855682-n/MIXED_ES/detail part of ham mails in corpus which trigger MIXED_ES is in polish language.