On Sunday 13 February 2005 18:46, Rogério Brito wrote: > I would also appreciate if the Debian shipped spamassassin would include > the fix (if it is indeed correct),
I'm not going to pretend being an expert on SA or UTF8 in Perl (allthough I have read that chapter in the Camel book! :-) ). Nor do I know the HTML::Parser code in detail. Reviewing this bug, it is clearly HTML::Parser that's complaining, and the documentation for the error we're seeing, at http://search.cpan.org/~gaas/HTML-Parser-3.45/Parser.pm#DIAGNOSTICS says "The parser can process raw undecoded UTF-8 sanely if the utf8_mode is enabled" The proposed patch simply enables utf8_mode and so, it seems trivial and very safe to me. Upstream is probably not going to apply it, since it depends on a 5.8, but it seems ideal as a Debian specific patch. I'm not following the SA lists, so I don't know if they are planning a 3.0.3 release, but I see they are reviewing a few bugs now, so it could be forthcoming, in which case I hope the SA maintainers will consider it, and if it is not released, it would be nice to have this patch in Sarge anyway... :-) Cheers, Kjetil