https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7091

--- Comment #3 from Mark Martinec <[email protected]> ---
> > body CRAZY_EURO /€uro/
> > header SUBJ_CREDIT_FR Subject =~ /crédit/
> 
> So... how do we make rules aware of whether or not normalize_charset is
> enabled?

The same way as making them aware of original encoding on a text - you can't.

I have been asking myself the same question - and I think the question
is wrong. There is no difference (from rules viewpoint) between text
that is originally encoded as UTF-8 (or plain US-ASCII) and a text that
is transcoded into UTF-8 from some other character set by normalize_charset.

Also each MIME part or mail header could be in a different character set.
Rules work (mostly) on all text in one go. To make a rule aware of a
character set or encoding would require this information to be propagated
with each line or paragraph of a text. This is simply unsustainable.
A conditionalized rule does not help here, as a condition is evaluated
at a startup time.

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to