[Mailman-Users] broken mbox files: No blank lines before From

2006-01-22 Thread Bastian Friedrich
Hi,

I've been browsing the archives, bug lists and google for a while but 
was unable to find a solution for my problem, so I'm asking here.

The mbox archive of one of my lists (~ 15.000 mails in 2 years) is a 
little broken:  In approx. 5% (guessed) of the cases, two mails are not 
correctly separated by a blank line, i.e. the From  header of the 
latter email is directly under the last (non-empty) line of the 
preceding letter. This results in some problems:

1) I am in the process of converting the mbox file to a different format 
- this will not work with broken mbox files like this.
2) List archives are not correctly rebuilt by ~mailman/bin/arch

Do you know how this problem occurs? Should I report this to mailman's 
bug tracker?

Do you have any idea how to fix the problem? I'd be happy to hack a 
newline in all necessary places - but the box is a little big...

Do you have any knowledge of some fix-my-mbox-program that does the 
job? Can anyone help me with a perl one-liner or sth?

Thx,
   Bastian

-- 
 Bastian Friedrich  [EMAIL PROTECTED]
 Adress  Fon available on my HP   http://www.bastian-friedrich.de/
\~\
\ A chicken is an egg's way of producing more eggs.

--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp


Re: [Mailman-Users] broken mbox files: No blank lines before From

2006-01-22 Thread Mark Sapiro
Bastian Friedrich wrote:

The mbox archive of one of my lists (~ 15.000 mails in 2 years) is a 
little broken:  In approx. 5% (guessed) of the cases, two mails are not 
correctly separated by a blank line, i.e. the From  header of the 
latter email is directly under the last (non-empty) line of the 
preceding letter. This results in some problems:


Do you have any knowledge of some fix-my-mbox-program that does the 
job? Can anyone help me with a perl one-liner or sth?


bin/cleanarch

should do the job.

-- 
Mark Sapiro [EMAIL PROTECTED]   The highway is for gamblers,
San Francisco Bay Area, Californiabetter use your sense - B. Dylan

--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp


Re: [Mailman-Users] broken mbox files: No blank lines before From

2006-01-22 Thread Mark Sapiro
Mark Sapiro wrote:

bin/cleanarch

should do the job.


bin/cleanarch seems to be broken. A patch is attached that will fix it,
or you can download a fixed version at
http://cvs.sourceforge.net/viewcvs.py/mailman/mailman/bin/cleanarch?view=log

-- 
Mark Sapiro [EMAIL PROTECTED]   The highway is for gamblers,
San Francisco Bay Area, Californiabetter use your sense - B. Dylan

--- test-mailman-2.1/bin/cleanarch  2005-08-26 19:40:16.0 -0700
+++ test-mailman/bin/cleanarch  2006-01-22 12:22:55.15625 -0800
@@ -59,7 +60,7 @@
 # From RFC 2822, a header field name must contain only characters from 33-126
 # inclusive, excluding colon.  I.e. from oct 41 to oct 176 less oct 072.  Must
 # use re.match() so that it's anchored at the beginning of the line.
-fre = re.compile(r'[\041-\071\073-\0176]+')
+fre = re.compile(r'[\041-\071\073-\176]+')
 
 
 
--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp

Re: [Mailman-Users] broken mbox files: No blank lines before From

2006-01-22 Thread Bastian Friedrich
Hi,

On Sunday 22 January 2006 19:55, Mark Sapiro wrote:
 Bastian Friedrich wrote:
  not correctly separated by a blank line, i.e. the From  header of
  the latter email is directly under the last (non-empty) line of the
  preceding letter.

 bin/cleanarch

There it was... directly at my hands... :)

Thx a lot, cleanarch did a perfect job (although my fscked up SuSE Linux 
installation will not execute it... *grrr*)

   Bastian

-- 
 Bastian Friedrich  [EMAIL PROTECTED]
 Adress  Fon available on my HP   http://www.bastian-friedrich.de/
\~\
\ My keyboard has an F1 key. Where is the NASCAR key?

--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp


Re: [Mailman-Users] broken mbox files: No blank lines before From

2006-01-22 Thread Thomas Hochstein
Bastian Friedrich schrieb:

 Do you have any knowledge of some fix-my-mbox-program that does the 
 job? Can anyone help me with a perl one-liner or sth?

Look for a typical From-Line like
| From [EMAIL PROTECTED] Fri Sep 8 13:21:43 2003
(I'd use a regular expression for that) with a non-empty line before
it and replace that $line with \n$line.

-thh
--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=showamp;file=faq01.027.htp