Related to the second part of Werner's message being scrubbed with the message:
An embedded and charset-unspecified text was scrubbed...
Poking in the email package (on python 2.4.4) shows:
def get_content_charset(self, failobj=None):
"""Return the charset parameter of the Content-Type header.
The returned string is always coerced to lower case. If there is no
Content-Type header, or if that header has no charset parameter,
failobj is returned.
"""
This seems to violate section 5.2 of RFC 2045 which says parts lacking
a Content-type header should be assumed to be text/plain with a
charset of us-ascii. The get_content_type method in email.Message
does mention RFC 2045 and uses text/plain if the content-type is
invalid. Would it be appropriate to set failobj="us-ascii" when
calling this method in Scrubber.py?
BTW, if it's easier for you guys to track this in bugzilla, let me
know and I'll file on one SF.
--
Todd OpenPGP -> KeyID: 0xBEAF0CE3 | URL: www.pobox.com/~tmz/pgp
======================================================================
The marvel of all history is the patience with which men and women
submit to burdens unnecessarily laid upon them by their governments.
-- William H. Borah
pgpP97opAbw6b.pgp
Description: PGP signature
------------------------------------------------------ Mailman-Users mailing list [email protected] http://mail.python.org/mailman/listinfo/mailman-users Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/ Unsubscribe: http://mail.python.org/mailman/options/mailman-users/archive%40jab.org Security Policy: http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq01.027.htp
