Re: Preliminary design proposal for charset normalization support in SpamAssassin

Matt Sergeant Wed, 24 Aug 2005 08:53:45 -0700

On 23 Aug 2005, at 14:51, John Gardiner Myers wrote:

Matt Sergeant wrote:
Wasn't there unicode normalisation in the original email parser thatI submitted to the project (that Theo turned into the current parser)?
Certainly it would make sense to use that if you could. It works verywell on a very large set of test data.
That code only deals with MIME-labeled charsets. It has no provisionfor charset detection.

Really? I must have written that later in my local version of the code.I can probably provide some code for charset detection - it's fairlysimple once you have the heuristics figured out.


Matt.


______________________________________________________________________
This email has been scanned by the MessageLabs Email Security System.

For more information please visit http://www.messagelabs.com/email______________________________________________________________________

Re: Preliminary design proposal for charset normalization support in SpamAssassin

Reply via email to