http://bugzilla.spamassassin.org/show_bug.cgi?id=3348
------- Additional Comments From [EMAIL PROTECTED] 2004-05-04 11:11 -------
Subject: Re: base64 encoded html messages seem to confuse get_uri_list
> So... Anyone have thoughts about this? Part of me is inclined to leave
> things as they are, and the other part of me says we should emulate
> Apple Mail here and make as much valid/visible text as possible.
Maybe we should start decoding where base64 appears to begin. I think
we should generally follow the common behavior, but this is one case
where we probably avoid doing the thing that doesn't let us catch the
spam. Perhaps something like:
first non-blank line:
if line is a legal MIME boundary
treat as a MIME boundary
anything else
treat as base64
all remaining lines:
treat as base64
Thankfully, "-" is not in base64. However, that could open us up to
some stupid spammer using a fake MIME boundary that is indeed decoded by
some mailers that skip non-base64 characters (like "-") and produces a
line of spam text.
So, a more robust technique would be:
first non-blank line:
if line is a legal MIME boundary
decode line (raw, our MIME decoding routine simulates the more
common behavior of skipping non-base64 characters)
if decoded line is binary garbage
treat the line as a MIME boundary
else
treat it like base64
anything else
treat as base64
all remaining lines:
treat as base64
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.