http://issues.apache.org/SpamAssassin/show_bug.cgi?id=3787


[EMAIL PROTECTED] changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Keywords|triage                      |




------- Additional Comments From [EMAIL PROTECTED]  2007-04-27 02:59 -------
ok, it's been a long time since this bug was opened, so I'll give a quick
summary for the new arrivals coming from bugs 5440 and 5437.

There's a perl bug in dealing with matching ISO-8859-1 patterns against UTF-8
strings: http://rt.perl.org/rt3/Public/Bug/Display.html?id=37950 .  Here are
the options:

- Apparently this bug is fixed in perl 5.8.8, so you could upgrade to that.

- Alternatively you could rebuild your current perl from source using the patch
  here, or at that rt.perl.org bug.

- Alternatively, you could fix the rules to avoid the bug: see comments 54, 69
  and 70.  (The easiest way is to remove the /i at the end of the rule and fix
  them to use /[iI]/ instead of just /i/ inside the patterns.)

SARE guys -- any chance the rules from comment 10 could be fixed in the
distributed SARE rulesets to include the workaround?  This is going to be a
major FAQ once 3.2.0 is released, since perl 5.8.8 is still not that common.

I'll attach a demo of one SARE rule fixed:

spamassassin -Lt -p rule.cf < badmsg  2>&1 | grep 'Malformed UTF-8 character' |
wc -l
872
spamassassin -Lt -p rule_fixed.cf < badmsg  2>&1 | grep 'Malformed UTF-8
character' | wc -l
0




------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to