Hi,
I think this bug is fixed. Please get a patch from
http://sourceforge.net/tracker/index.php?func=detail&aid=670167&group_id=103&atid=300103
Cheers,
David Eisner wrote:
I checked out the latest sources from CVS this afternoon and
discovered the following bug in Scrubber.py ( I think ).
I. PROBLEM
If I send a message with an attachment to a list from Outlook (XP),
the pipermail archived version of the message is missing the
content of the message, although the attahcment is there.
Here's an example:
--snip--
Skipped content of type multipart/alternative-- next part --
A non-text attachment was scrubbed...
Name: test.doc
Type: application/msword
Size: 23040 bytes
Desc: not available
Url : http://calcetalk.umd.edu/pipermail/test/attachments/20030203/fb37e2d4/test-0001.doc
--snip--
If, however, I send a message without an attachment, it works correctly:
--snip--
This is Test 6, from Outlook, with no attachment.
-David
-- next part --
An HTML attachment was scrubbed...
URL: http://calcetalk.umd.edu/pipermail/test/attachments/20030203/1d720250/attachment.htm
--snip--
II. ANALYSIS
I poked around Scrubber.py and added some syslog statements. The
problem is occuring in the process() method.
a. The structure of an Outlook message with an attachment looks like this:
multipart/mixed
multipart/alternative
text/plain
text/html
application/msword
Here's are the result of the syslog statements I put in process():
Feb 03 20:48:01 2003 (32291) Processing message part multipart/mixed
Feb 03 20:48:01 2003 (32291) Processing message part multipart/alternative
Feb 03 20:48:01 2003 (32291) Processing message part text/plain
Feb 03 20:48:01 2003 (32291) Processing message part text/html
Feb 03 20:48:01 2003 (32291) Processing message part application/msword
Feb 03 20:48:01 2003 (32291) Out of for loop, final sanitizing
Feb 03 20:48:01 2003 (32291) Processing payload part multipart/alternative
Feb 03 20:48:01 2003 (32291)continuing
Feb 03 20:48:01 2003 (32291) Processing payload part text/plain
In the final sanitizing, the text/plain and text/html subparts within
the multipart/alternatvie are lost. The last text/plain part is
what was the word attachment, before it was scrubbed.
b. The structure of an Outlook message with no attachment looks like this:
multipart/alternative
text/plain
text/html
The syslog statements:
Feb 03 20:48:59 2003 (32291) Processing message part multipart/alternative
Feb 03 20:48:59 2003 (32291) Processing message part text/plain
Feb 03 20:48:59 2003 (32291) Processing message part text/html
Feb 03 20:48:59 2003 (32291) Out of for loop, final sanitizing
Feb 03 20:48:59 2003 (32291) Processing payload part text/plain
Feb 03 20:48:59 2003 (32291) Processing payload part text/plain
In this case, there's no problem, because the text/plain and text/html
subparts are returned directly by msg.get_payload(). In part a., the
first thing returned by msg.get_payload() is the multipart/alternative
part, which is skipped since it's not 'text/plain'.
-David
+--+
David Eisner| E-mail: [EMAIL PROTECTED] |
CALCE EPSC | Phone: 301-405-5341 |
University of Maryland | Fax:301-314-9269 |
+--+
--
Tokio Kikuchi, tkikuchi@ is.kochi-u.ac.jp
http://weather.is.kochi-u.ac.jp/
--
Mailman-Users mailing list
[EMAIL PROTECTED]
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
This message was sent to: archive@jab.org
Unsubscribe or change your options at
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org