Another thing: How well does language recognition in SA work? I'd like
to give mail that's written in Slovene, say, -20 points, since it's
almost never spam.
It's done using a tripplets dictionary IIRC, however I don't think
there's any support for any kind of "white" language rules.
You could theoretically hack the rule and make the
UNDESIRED_LANGUAGE_BODY rule into a DESIRED_LANGUAGE_BODY rule instead,
by negating it and reversing the score. However, you'd have to eliminate
the original rule to do it (otherwise everything that isn't Slovene
would get positive points)
Could you give me an example of what to modify, if it isn't too much
work.... :)