Re: [Mailman-Users] Cleaning up archive that aren't in mbox format?

2007-09-05 Thread Mark Sapiro
Dennis Carr wrote:
>Mark Sapiro wrote:
>
>> However, there should be an archives/private/ffml.mbox/ffml.mbox file
>> which contains all posts to the ffml list. Look at that file and see
>> if the From_ line in the body of the Fri Jul 6 00:13:27 PDT 2007
>> Happosai's Life of DOOM #6 post is escaped with a leading '>' or not.
>> It should be.
>
>It isn't named with .mbox, but it is the single file with all the
>messages.  What I'm looking at are the messages, but in some cases you
>get the following:
>
>>From [EMAIL PROTECTED]  Fri Jul  6 00:11:24 2007
>From: [EMAIL PROTECTED] ()
>Date: Fri, 06 Jul 2007 07:11:24 -
>Subject: No subject
>Message-ID: <[EMAIL PROTECTED]>
>
>...between parts of the entry, with one line of the message omitted and
>replaced with the above.


Are you sure you aren't looking at the .txt files (e.g.
archives/private/ffml/2007-September.txt) that contain the messages
for the month. This is not what you want. The messages in this file
have been heavily processed by the archiver. It is really just a text
version of the monthly HTML archive.

In addition to the archives/private/ffml directory which contains the
archives there should be an archives/private/ffml.mbox directory that
contains the archives/private/ffml.mbox/ffml.mbox file which has all
(not just one month's) posts as they were delivered to the archiver.

The only reasons why you wouldn't have this is if ARCHIVE_TO_MBOX has
been given some value other than the default 2 or 1 (actually 0 is the
only value other than 2 that would write the html archive), or there
is some permissions issue preventing it from being created or written
at all, but in this latter case, Mailman's 'error' log would be full
of messages about this.

-- 
Mark Sapiro <[EMAIL PROTECTED]>   The highway is for gamblers,
San Francisco Bay Area, Californiabetter use your sense - B. Dylan

--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq01.027.htp


Re: [Mailman-Users] Cleaning up archive that aren't in mbox format?

2007-09-05 Thread Dennis Carr
Mark Sapiro wrote:
> Dennis Carr wrote:
> 
>> My list server, as near as I can tell, does not appear to be parsing out
>>from mbox format.
>> (To backtrack, early last month, I had asked the list for help on
>> correctly parsing out the From information.  It is apparently doing this
>> correctly, as in putting the > in front of the From.)
> 
> 
> What are you looking at that has the properly escaped >From_?
> 
> 
>> Now, though, I have the 2007-July file that has the broken messages
>> still in here.
>>
>> In short, did I run the archiver correctly?  Or for that matter, where
>> should I be looking/running this file/whatever?
> 
> 
> You never answered the questions I asked (see
> ).

Oh, bugger all, sorry.  Python 2.4.4, and there is an email package in
pythonlib.

> However, there should be an archives/private/ffml.mbox/ffml.mbox file
> which contains all posts to the ffml list. Look at that file and see
> if the From_ line in the body of the Fri Jul 6 00:13:27 PDT 2007
> Happosai's Life of DOOM #6 post is escaped with a leading '>' or not.
> It should be.

It isn't named with .mbox, but it is the single file with all the
messages.  What I'm looking at are the messages, but in some cases you
get the following:

>From [EMAIL PROTECTED]  Fri Jul  6 00:11:24 2007
From: [EMAIL PROTECTED] ()
Date: Fri, 06 Jul 2007 07:11:24 -
Subject: No subject
Message-ID: <[EMAIL PROTECTED]>

...between parts of the entry, with one line of the message omitted and
replaced with the above.


> 
> If it is and if
> 
> bin/cleanarch -n < archives/private/ffml.mbox/ffml.mbox
> 
> doesn't report any problems, then
> 
> bin/arch --wipe ffml
> 
> should rebuild the archive.

I'll give that a whirl and see what happens, thanks Mark.   

-Dennis

--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq01.027.htp


Re: [Mailman-Users] Cleaning up archive that aren't in mbox format?

2007-09-03 Thread Mark Sapiro
Dennis Carr wrote:

>My list server, as near as I can tell, does not appear to be parsing out
>from mbox format.
>
>(To backtrack, early last month, I had asked the list for help on
>correctly parsing out the From information.  It is apparently doing this
>correctly, as in putting the > in front of the From.)


What are you looking at that has the properly escaped >From_?


>Now, though, I have the 2007-July file that has the broken messages
>still in here.
>
>In short, did I run the archiver correctly?  Or for that matter, where
>should I be looking/running this file/whatever?


You never answered the questions I asked (see
).

However, there should be an archives/private/ffml.mbox/ffml.mbox file
which contains all posts to the ffml list. Look at that file and see
if the From_ line in the body of the Fri Jul 6 00:13:27 PDT 2007
Happosai's Life of DOOM #6 post is escaped with a leading '>' or not.
It should be.

If it is and if

bin/cleanarch -n < archives/private/ffml.mbox/ffml.mbox

doesn't report any problems, then

bin/arch --wipe ffml

should rebuild the archive.

If there are problems with the ffml.mbox file, then you can run

bin/cleanarch < archives/private/ffml.mbox/ffml.mbox > tmpfile
mv tmpfile archives/private/ffml.mbox/ffml.mbox

before running bin/arch as above.

If there are problems, then I think you must somehow have the wrong
python email library. Refer to the questions in my above referenced
reply to your original post.

-- 
Mark Sapiro <[EMAIL PROTECTED]>   The highway is for gamblers,
San Francisco Bay Area, Californiabetter use your sense - B. Dylan

--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq01.027.htp


[Mailman-Users] Cleaning up archive that aren't in mbox format?

2007-09-02 Thread Dennis Carr
My list server, as near as I can tell, does not appear to be parsing out
from mbox format.

(To backtrack, early last month, I had asked the list for help on
correctly parsing out the From information.  It is apparently doing this
correctly, as in putting the > in front of the From.)

Now, though, I have the 2007-July file that has the broken messages
still in here.

In short, did I run the archiver correctly?  Or for that matter, where
should I be looking/running this file/whatever?

-Dennis Carr
--
Mailman-Users mailing list
Mailman-Users@python.org
http://mail.python.org/mailman/listinfo/mailman-users
Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py
Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-users/archive%40jab.org

Security Policy: 
http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq01.027.htp