Re: Preliminary design proposal for charset normalization support in SpamAssassin

John Gardiner Myers Tue, 23 Aug 2005 11:52:05 -0700

Matt Sergeant wrote:

Wasn't there unicode normalisation in the original email parser that Isubmitted to the project (that Theo turned into the current parser) ?
Certainly it would make sense to use that if you could. It works verywell on a very large set of test data.

That code only deals with MIME-labeled charsets. It has no provisionfor charset detection.

The code puts charset normalization inside ofMail::SpamAssassin::Message::Node::decode(). I don't think charsetnormalization is appropriate for the decode call that is used in parsingmessage/rfc822 objects.

Re: Preliminary design proposal for charset normalization support in SpamAssassin

Reply via email to