Re: [Mailman-Users] buildup of attachment files...

2009-11-17 Thread Troy Campbell

Thanks Mark,

The list's Non-digest options - scrub-nondigest is No.

In the Content Filtering -Details for pass_mime_types field
I show the following:

multipart/mixed
multipart/alternative
text/plain

Would that be sufficient to do what you are suggesting if I
turn Edit filter_content on? 


Could I then remove the attachments subdirectories?

Regards,
Troy


on 11/16/2009 07:23 PM Mark Sapiro said the following:

Troy Campbell wrote:

  

On one mailing list I'm seeing alot of attachment files building up
in the archive directory:


[...]
  

Basically there are many directories in the attachments
directory that go from 20070808 to 20091116.

I'm running 2.1.9.  I just need to know how to cleanup and turnoff
if possible.




It is Scrubber.py that saves these. Depending on settings, you may get
one or two copies of each attachment which is either not text/plain or
text/plain with an unknown character set.

If the list's Non-digest options - scrub-nondigest is Yes, you will
get one saved attachment when the attachment is removed from the
message and replaced by a link to the saved attachment. Otherwise, you
get two. One when the attachment is scrubbed for the archive and one
when the attachment is scrubbed from the plain format digest.

You can avoid almost all of this by removing all non-plain text with
content filtering.

If you don't remove them with content filtering, you can avoid the
'digest copies' by setting Digest options - digestable to No. You can
avoid the 'archive copies' by turning off archiving for the list.

  

--
Mailman-Users mailing list Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://wiki.list.org/x/AgA3
Security Policy: http://wiki.list.org/x/QIA9
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org


Re: [Mailman-Users] buildup of attachment files...

2009-11-17 Thread Mark Sapiro
Troy Campbell wrote:

In the Content Filtering -Details for pass_mime_types field
I show the following:

multipart/mixed
multipart/alternative
text/plain

Would that be sufficient to do what you are suggesting if I
turn Edit filter_content on? 


I suggest the following in pass_mime_types

multipart
message/rfc822
text/html
text/plain

plus collapse_alternatives and convert_html_to_plaintext = Yes

This will allow the sub-parts of any multipart message including
multipart/related and multipart/signed to be examined. It will also
allow plain text (and HTML) from attached messages and will ultimately
discard all but the first alternative from multipart/alternative and
convert any remaining HTML to plain text.

This will allow very little that will ultimately be scrubbed. Only
text/plain attachments with unspecified character sets.


Could I then remove the attachments subdirectories?


You can remove the attachments directories anyway. They will be
recreated if needed. The problem with removal is there are messages in
the HTML archive with links to scrubbed attachments and if you remove
the directory or files, you break the links. Whether or not this is
important is up to you.

-- 
Mark Sapiro m...@msapiro.netThe highway is for gamblers,
San Francisco Bay Area, Californiabetter use your sense - B. Dylan

--
Mailman-Users mailing list Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://wiki.list.org/x/AgA3
Security Policy: http://wiki.list.org/x/QIA9
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org


[Mailman-Users] buildup of attachment files...

2009-11-16 Thread Troy Campbell

On one mailing list I'm seeing alot of attachment files building up
in the archive directory:

# pwd
/var/lib/mailman/archives/private
# ls listname*
listname:
attachments  index.html

listname.mbox:

Looking down a couple directory levels in attachements:
# pwd
/var/lib/mailman/archives/private/listname/attachments/20091116/4763b4e9
# file *
attachment.obj: gzip compressed data, was ErrorReport.21234.txt, from 
Unix, last modified: Mon Nov 16 11:03:59 2009

[r...@albers 4763b4e9]# ls
attachment.obj

Basically there are many directories in the attachments
directory that go from 20070808 to 20091116.

I'm running 2.1.9.  I just need to know how to cleanup and turnoff
if possible.

Thanks!
Troy




--
Mailman-Users mailing list Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://wiki.list.org/x/AgA3
Security Policy: http://wiki.list.org/x/QIA9
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org


Re: [Mailman-Users] buildup of attachment files...

2009-11-16 Thread Mark Sapiro
Troy Campbell wrote:

On one mailing list I'm seeing alot of attachment files building up
in the archive directory:
[...]

Basically there are many directories in the attachments
directory that go from 20070808 to 20091116.

I'm running 2.1.9.  I just need to know how to cleanup and turnoff
if possible.


It is Scrubber.py that saves these. Depending on settings, you may get
one or two copies of each attachment which is either not text/plain or
text/plain with an unknown character set.

If the list's Non-digest options - scrub-nondigest is Yes, you will
get one saved attachment when the attachment is removed from the
message and replaced by a link to the saved attachment. Otherwise, you
get two. One when the attachment is scrubbed for the archive and one
when the attachment is scrubbed from the plain format digest.

You can avoid almost all of this by removing all non-plain text with
content filtering.

If you don't remove them with content filtering, you can avoid the
'digest copies' by setting Digest options - digestable to No. You can
avoid the 'archive copies' by turning off archiving for the list.

-- 
Mark Sapiro m...@msapiro.netThe highway is for gamblers,
San Francisco Bay Area, Californiabetter use your sense - B. Dylan

--
Mailman-Users mailing list Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://wiki.list.org/x/AgA3
Security Policy: http://wiki.list.org/x/QIA9
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org