Re: maildir over mbox?

2002-01-26 Thread David Clarke

On Tue, 22 Jan 2002, David Clarke wrote:
> Don't know why but for me there isn't much of a difference between them,
> everyone else seems to be getting a big difference.  I was however

I just noticed the partition I was testing on was actually ext3, which
probably explains my results.

David.

-- 
All generalizations are false, including this one. -- Mark Twain
-
David Clarke <[EMAIL PROTECTED]> | David Clarke 
Key Fingerprint :  869B 53DD 5E80 E1F0 93F6  9871 0508 0296 5957 F723



Re: maildir over mbox?

2002-01-22 Thread David T-G

Matthew --

...and then Matthew D. Fuller said...
% 
% On Mon, Jan 21, 2002 at 09:26:55PM -0500 I heard the voice of
% Derek D. Martin, and lo! it spake thus:
% > 
% > In which case I would ask, dude, why?  I thought my counterpart at
% > work was a pack rat...  ;-)
% 
% Hmmm
% Well, my current archives have...   *does some quick scripting*
% (ttyp4):{920}% cat temp | dc
% 817181
% 
% So not quite a million messages, but still far more inodes than I'd care
% to eat on /home.

Ouch.  Yeah.


% 
% I rotate my folders manually every few weeks; generally, once a mailbox
% (in Maildir, being an active mailbox) reached 4000-7000 messages, and
% starts taking more than 6 or 7 seconds to open, I tag-all and drop it
% into a mbox, then slot that mbox into my archives.

Makes sense.


% 
% So, does that make ME a packrat?   ;)
% For reference:
% (ttyp4):{921}% du -sh .
% 2.5G.

Well, ...

  [zero] [9:05am] ~/Mail>  cat `find . -type f -print | egrep -v /Z/ | \
  egrep '/F\.'` | egrep '^From ' | wc -l
38124

  [zero] [9:09am] ~/Mail>  cat `find . -type f -print | egrep -v /Z/ | \
  egrep -v '/F\.'` | egrep '^From ' | wc -l
31237

  [zero] [9:13am] ~/Mail>  zcat `find . -type f -print | egrep /Z/ | \
  egrep '/F\.'` | egrep '^From ' | wc -l
   168069

  [zero] [9:16am] ~/Mail>  zcat `find . -type f -print | egrep /Z/ | \
  egrep -v '/F\.'` | egrep '^From ' | wc -l
28940

  [zero] [9:16am] ~/Mail>  echo "0 38124+ 31237+ 168069+ 28940+ pq" | dc
  266370

  [zero] [9:16am] ~/Mail>  du -sh .
  1.1G.

I suppose it does, since I figure I am.  I honestly thought I'd at least
come close, but you're running nearly four times as many messages as I.
Of course, you're only taking up twice as much disk space; you must have
smaller messages overall ;-)

  [zero] [9:18am] ~/Mail>  ls -lFRt | tail
  -rw---   1 davidtg   829 Jan 12  1994 alan.z

How old is your oldest? :-)


% 
% -- 
% Matthew Fuller (MF4839) |[EMAIL PROTECTED]
% Unix Systems Administrator  |[EMAIL PROTECTED]
% Specializing in FreeBSD |http://www.over-yonder.net/
% 
% "The only reason I'm burning my candle at both ends, is because I
%   haven't figured out how to light the middle yet"


:-D
-- 
David T-G  * It's easier to fight for one's principles
(play) [EMAIL PROTECTED] * than to live up to them. -- fortune cookie
(work) [EMAIL PROTECTED]
http://www.justpickone.org/davidtg/Shpx gur Pbzzhavpngvbaf Qrprapl Npg!




msg23553/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-22 Thread David Clarke

On Mon, 21 Jan 2002, Alexander Skwar wrote:
> FWIW, I just did some testing using mutt 1.3.25i on my Athlon 800 MHz,
> 786 MB RAM, and a IBM DDRS-39130W SCSI UW hard drive running reiserfs.
> The Maildir/mbox I tested, had 84.533 messages and about 321 MB.
> Opening the mbox beast took 2:53 minutes, while opening the same
> converted to Maildir (with mutt) took more than 25 minutes.  Go
> figure...

A little different on my system, the mailbox I tested had 13028 emails,
and was only fairly small, 36 meg.  On a MAXTOR 6L040J2 running over
ata66 on a reiser partition.  Athlon 1000MHz, with 384MB ram.

mbox 2.88s user 0.25s system 93% cpu 3.357 total

maildir 3.10s user 0.82s system 94% cpu 4.014 total

Don't know why but for me there isn't much of a difference between them,
everyone else seems to be getting a big difference.  I was however
surprised with the cpu use.  Looks like both are fairly heavy on my
system.  The times are fairly average, I ran it a few times and it made
little difference, around 0.05s, and not much difference to the cpu
usage.

-- 
All generalizations are false, including this one. -- Mark Twain
-
David Clarke <[EMAIL PROTECTED]> | David Clarke 
Key Fingerprint :  869B 53DD 5E80 E1F0 93F6  9871 0508 0296 5957 F723



Re: maildir over mbox?

2002-01-21 Thread Matthew D. Fuller

On Mon, Jan 21, 2002 at 09:26:55PM -0500 I heard the voice of
Derek D. Martin, and lo! it spake thus:
> 
> Um...  Oh, are you European?  I seem to recall that Europeans switch
> the meaning of '.' and ',' in numbers, as compared to us US types...
> So perhaps you meant eighty-four thousand five hundred thirty-three
> messages?
> 
> In which case I would ask, dude, why?  I thought my counterpart at
> work was a pack rat...  ;-)

Hmmm
Well, my current archives have...   *does some quick scripting*
(ttyp4):{920}% cat temp | dc
817181

So not quite a million messages, but still far more inodes than I'd care
to eat on /home.

I rotate my folders manually every few weeks; generally, once a mailbox
(in Maildir, being an active mailbox) reached 4000-7000 messages, and
starts taking more than 6 or 7 seconds to open, I tag-all and drop it
into a mbox, then slot that mbox into my archives.

So, does that make ME a packrat?   ;)
For reference:
(ttyp4):{921}% du -sh .
2.5G.



-- 
Matthew Fuller (MF4839) |[EMAIL PROTECTED]
Unix Systems Administrator  |[EMAIL PROTECTED]
Specializing in FreeBSD |http://www.over-yonder.net/

"The only reason I'm burning my candle at both ends, is because I
  haven't figured out how to light the middle yet"



Re: maildir over mbox?

2002-01-21 Thread Thomas Hurst

* Alexander Skwar ([EMAIL PROTECTED]) wrote:

> With mutt, I get the same kind of results.  However, other MUAs behave
> differently.  For instance, when I did the testing, Evolution opened
> the same Maildir way faster than it handled the mbox file (which
> contained the same messages). mutt seems to be *extremely* optimized
> for mbox.

Evolution is aimed at maildir, and so does metadata caching.  Probably
worth a look by anyone interested in adding similar functionality to
mutt (hello Mr Elkins :)

And yes, mutt's optimized for mbox style stuff, in that it does minimal
writes 'n' stuff which lesser clients probably don't. 'n' stuff.

> reiserfs.

I wonder what sort of interesting things you could do with mailfs..
having the underlying filesystem know something about the format of
messages and how they're accessed would probably be good for those users
who absolutely demand support for insanely huge mailspools.

> The Maildir/mbox I tested, had 84.533 messages and about 321 MB.
> Opening the mbox beast took 2:53 minutes, while opening the same
> converted to Maildir (with mutt) took more than 25 minutes.  Go
> figure...

I think the obvious answer here is "don't use mailboxes of any format
that big and expect any sort of decent performance outside some very
specific situations" :)

Metadata caching would probably be good on really large mboxes too, but
tbh I think the effort would be better spent on a decent mail archiving
tool that handles all the different formats, compression etc, so us
must-not-delete-anything obsessives can keep our working folders small
:)



-- 
Thomas 'Freaky' Hurst  -  [EMAIL PROTECTED]  -  http://www.aagh.net/
-
Vax Vobiscum



Re: maildir over mbox?

2002-01-21 Thread Derek D. Martin

At some point hitherto, Alexander Skwar hath spake thusly:
> The Maildir/mbox I tested, had 84.533 messages and about 321 MB.  Opening

Eh?  How does one have .533 messages in a mailbox?  Perhaps fractional
messages are some feature of Maildir that I was unaware of?

Um...  Oh, are you European?  I seem to recall that Europeans switch
the meaning of '.' and ',' in numbers, as compared to us US types...
So perhaps you meant eighty-four thousand five hundred thirty-three
messages?

In which case I would ask, dude, why?  I thought my counterpart at
work was a pack rat...  ;-)


-- 
Derek Martin   [EMAIL PROTECTED]
-
I prefer mail encrypted with PGP/GPG!
GnuPG Key ID: 0x81CFE75D
Retrieve my public key at http://pgp.mit.edu
Learn more about it at http://www.gnupg.org



msg23499/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-21 Thread Christian Ordig

On Mon, Jan 21, 2002 at 10:37:01PM +0100, Alexander Skwar wrote:
> running reiserfs.
well ... as tests showed, ReiserFS seems to be a _really_ slow
beast when it comes to read Maildir folders ... tried with Ext2/3?
Should be really faster.

-- 
Christian Ordig
Germany



Re: maildir over mbox?

2002-01-21 Thread Alexander Skwar

So sprach »Christian Ordig« am 2002-01-21 um 11:22:04 +0100 :
> opening times might be ... but think about updating times and the

Well, that's true, however, updating times aren't *that* important for
me.  When I receive new mail, I let procmail sort it into appropriate
mailfiles; each list has got its own mailfile plus some other personal
files.  Now, when new mail arrives, mutt notices that there's new mail
in one of the files, and I'll change to that file.  So I find myself
*very* often opening mailfiles.  And since the difference is that
tremendously bad for Maildir on reiserfs on my machine, I can't stand to
wait that extremely long.

Futher, updating times in mbox with mutt aren't that bad, after all.
This may be due to the fact, that I much more often delete messages
which are at the end of the file, which seems to be faster than
deleteing huge/a lot of messages from the middle of the mbox.

> "no locking needed" goodies :-)

I don't use NFS, so I don't need this goody.

Alexander Skwar
-- 
How to quote:   http://learn.to/quote (german) http://quote.6x.to (english)
Homepage:   http://www.iso-top.de  | Jabber: [EMAIL PROTECTED]
   iso-top.de - Die günstige Art an Linux Distributionen zu kommen
   Uptime: 7 days 0 hours 12 minutes



Re: maildir over mbox?

2002-01-21 Thread Alexander Skwar

So sprach »Benjamin Michotte« am 2002-01-20 um 19:33:10 +0100 :
> > What are the benefits of using one type over the other?
> opening a mbox with ± 7000 mails : less than 10 seconds.
> opening the same in Maildir : 3 minutes...

With mutt, I get the same kind of results.  However, other MUAs behave
differently.  For instance, when I did the testing, Evolution opened the
same Maildir way faster than it handled the mbox file (which contained
the same messages).  mutt seems to be *extremely* optimized for mbox.

FWIW, I just did some testing using mutt 1.3.25i on my Athlon 800 MHz,
786 MB RAM, and a IBM DDRS-39130W SCSI UW hard drive running reiserfs.
The Maildir/mbox I tested, had 84.533 messages and about 321 MB.  Opening
the mbox beast took 2:53 minutes, while opening the same converted to
Maildir (with mutt) took more than 25 minutes.  Go figure...

Alexander Skwar
-- 
How to quote:   http://learn.to/quote (german) http://quote.6x.to (english)
Homepage:   http://www.iso-top.de  | Jabber: [EMAIL PROTECTED]
   iso-top.de - Die günstige Art an Linux Distributionen zu kommen
   Uptime: 6 days 1 hour 14 minutes



Re: maildir over mbox?

2002-01-21 Thread Andy Davidson

I wrote:
>%  And it appears that the archive mailboxes *have* to be mboxes.
>% If I try to save a message to an existing maildir folder, mutt objects.

At 12:17 PM 1/21/02 -0500, David T-G wrote:
>Um, that shouldn't be the case.  mutt will happily read and write mbox,
>Maildir, MMDF, and MH mailboxes with no complaints.  Are you sure you're
>using the trailing slash (dunno if it's really necessary)?  What is the
>objection?  Can you give us more details?

It actually didn't object; it simply created it as an mbox.  But the answer 
was even simpler than the trailing slash, which I had.  What I didn't have 
--- and thought I did --- was the set mbox_type="Maildir" in .muttrc

Ooops!  All is well now

andy



Andy Davidson   --- Pheon Research
   If you know what you're doing, you're not learning anything.



Re: maildir over mbox?

2002-01-21 Thread David T-G

Andy --

...and then Andy Davidson said...
% 
% At 04:50 AM 1/21/02 -0600, Matthew D. Fuller wrote:
% >
% >Personally, I use maildir for all my 'active' mailboxes (read: the ones
...
% >I use mbox for my archive mailboxes, because it's simpler and more compact
...
% 
% I am in the process of converting to using mutt, so I've been playing

Welcome!


% a bit.  And it appears that the archive mailboxes *have* to be mboxes.
% If I try to save a message to an existing maildir folder, mutt objects.

Um, that shouldn't be the case.  mutt will happily read and write mbox,
Maildir, MMDF, and MH mailboxes with no complaints.  Are you sure you're
using the trailing slash (dunno if it's really necessary)?  What is the
objection?  Can you give us more details?


HTH & HAND

:-D
-- 
David T-G  * It's easier to fight for one's principles
(play) [EMAIL PROTECTED] * than to live up to them. -- fortune cookie
(work) [EMAIL PROTECTED]
http://www.justpickone.org/davidtg/Shpx gur Pbzzhavpngvbaf Qrprapl Npg!




msg23462/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-21 Thread Will Yardley

Michael Elkins wrote:
> 
> I'd be curious to get some feedback on my header caching patch for maildir
> folders (can be found at http://www.sigpipe.org:8080/mutt/).

ok a little more feedback. overall performance is a little zippier, but
if i leave a folder open and it receives messages, i get the 'folder was
externally modified' message, and sometimes the message indicator seems
to have jumped up (although maybe i'm just hallucinating).

this makes sense, but it might be nice if there were a way around it.

w



Re: maildir over mbox?

2002-01-21 Thread Andy Davidson

At 04:50 AM 1/21/02 -0600, Matthew D. Fuller wrote:
>
>Personally, I use maildir for all my 'active' mailboxes (read: the ones
>that mail gets delivered to and I read) because it's that much safer,
>easier and more efficient to alter, and roughly similar in speed to open.
>I use mbox for my archive mailboxes, because it's simpler and more compact
>(I don't need to blow a few million inodes on mail archives, thank you
>very much), and it's faster on the mailboxes with tens or hundreds of
>thousands of messages.

I am in the process of converting to using mutt, so I've been playing
a bit.  And it appears that the archive mailboxes *have* to be mboxes.
If I try to save a message to an existing maildir folder, mutt objects.

>And, having spent rather some time lately writing code to parse mbox's,
>I'd like to make the following general comment:
>Bah.

Agreed.

andy



Re: maildir over mbox?

2002-01-21 Thread Derek D. Martin

At some point hitherto, Thomas Hurst hath spake thusly:
> * Derek D. Martin ([EMAIL PROTECTED]) wrote:
> 
> > At some point hitherto, Will Yardley hath spake thusly:
> > > our office mail machine is (unfortunately) linux with ext2, and i
> > > can attest to the fact that Maildir is pretty slow on ext2.
> >
> > And most other filesystems...  Try it on FAT.  =8^)
> 
> I think the overhead of opening and closing tonnes of small files is
> inherintly going to be slower than one big file on reading.

Right...

> -% mutt -Re 'push q' -f test-mbox
> 6.80s user 0.75s system 85% cpu 8.810 total
> 
> -% mutt -Re 'push q' -f test-maildir
> 7.39s user 2.38s system 32% cpu 29.882 total

Yeah, that's what I mean.  And that's with your optimized filesystem
and a pretty beefy system.  

Now don't get me wrong, I'm not saying that maildir doesn't have
advantages.  It does.  For me though, this is a significant enough
trade-off to make sticking with mbox worthwhile.  I'm impatient! :)

-- 
Derek Martin   [EMAIL PROTECTED]
-
I prefer mail encrypted with PGP/GPG!
GnuPG Key ID: 0x81CFE75D
Retrieve my public key at http://pgp.mit.edu
Learn more about it at http://www.gnupg.org



msg23443/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-21 Thread Thomas Hurst

* Derek D. Martin ([EMAIL PROTECTED]) wrote:

> At some point hitherto, Will Yardley hath spake thusly:
> > our office mail machine is (unfortunately) linux with ext2, and i
> > can attest to the fact that Maildir is pretty slow on ext2.
>
> And most other filesystems...  Try it on FAT.  =8^)

I think the overhead of opening and closing tonnes of small files is
inherintly going to be slower than one big file on reading.

Maildir is a simpler format for the MUA; one of mutt's advantages is with
mbox it does minimal writes, so for instance when it changes a status
flag, it just overwrites the old one, where as a lesser client may well
end up rewriting the entire mailbox from that point.  With Maildir, even
if the MUA rewrites the entire message each time the change is always
limited to one small file.

Maildir's advantages are in reliability (it's harder to corrupt 1000
small files than 1 big one), and modification cost
(moving/deleting/editing a message is a constant time operation, where
as mbox will potentially get slower as it gets bigger.  This isn't so
bad when the mailbox is small, or when you're modifying messages near
the end, but things like message archiving is going to be faster (and
easier) with maildir.

> > i haven't tried copying one of my large Maildir folders over to one
> > of my machines (FreeBSD, UFS filesystem with soft-updates) to see
> > how much faster it is, but i've heard that the difference is pretty
> > large.
>
> But how does it compare to mbox on the same FS?  I'll bet it's still
> significantly slower.

Well, I'm running FreeBSD 4.5 + UFS+FFS+SU and dirhashing, let's see..

I did try this a few months ago, btw, Maildir added about 25% to opening
time; good metadata caching could well make the difference here.
Personally, though, I'd use a minimal mbox type file over a dbm for that;
dbm's involve a lot of seeking, which is what we're trying to reduce :)

Anyway;

-% cat archive/2001/lists/cvs-all/11-2001.cvs-all
archive/2001/lists/cvs-all/12-2001.cvs-all >test-mbox
-% grep -c '^From ' test-mbox
7870

-% time mutt -Re 'push q' -f test-mbox
6.75s user 0.80s system 99% cpu 7.618 total

It took about a minute to convert to a mailbox;

-% mutt -Re 'push q' -f test-maildir
6.96s user 1.56s system 97% cpu 8.784 total

Those are cached values, though, so they're not really measuring
filesystem performance, s, let's malloc 300MB and fill it with
zero's, and:

-% mutt -Re 'push q' -f test-mbox
6.80s user 0.75s system 85% cpu 8.810 total

-% mutt -Re 'push q' -f test-maildir
7.39s user 2.38s system 32% cpu 29.882 total

Witness the overhead of all that seeking.

This is a Dual 466MHz Celeron w/ 384MB and a 20GB UDMA/33 drive.

-- 
Thomas 'Freaky' Hurst  -  [EMAIL PROTECTED]  -  http://www.aagh.net/
-
You've been telling me to relax all the way here,
and now you're telling me just to be myself?
-- The Return of the Secaucus Seven



Re: maildir over mbox?

2002-01-21 Thread Matthew D. Fuller

On Sun, Jan 20, 2002 at 06:45:54PM -0800 I heard the voice of
Michael Elkins, and lo! it spake thus:
> 
> Mutt attempts to compensate for this by using quoted-printable encoding when
> it detects things that might break a signature, thus escaping the problem.
> But yes, mbox format is more susceptable to corruption of this form.

Mutt also bypasses the problem by not doing From_-escaping when the
message has its handy-dandy Content-Length: header (which makes it fun to
parse the files if you don't think of it...  "Why does this 8-message
folder show 12 messages?").

Personally, I use maildir for all my 'active' mailboxes (read: the ones
that mail gets delivered to and I read) because it's that much safer,
easier and more efficient to alter, and roughly similar in speed to open.
I use mbox for my archive mailboxes, because it's simpler and more compact
(I don't need to blow a few million inodes on mail archives, thank you
very much), and it's faster on the mailboxes with tens or hundreds of
thousands of messages.

And, having spent rather some time lately writing code to parse mbox's,
I'd like to make the following general comment:
Bah.



-- 
Matthew Fuller (MF4839) |[EMAIL PROTECTED]
Unix Systems Administrator  |[EMAIL PROTECTED]
Specializing in FreeBSD |http://www.over-yonder.net/

"The only reason I'm burning my candle at both ends, is because I
  haven't figured out how to light the middle yet"



Re: maildir over mbox?

2002-01-21 Thread Christian Ordig

On Sun, Jan 20, 2002 at 09:39:52PM -0500, Derek D. Martin wrote:
> But how does it compare to mbox on the same FS?  I'll bet it's still
> significantly slower.
opening times might be ... but think about updating times and the
"no locking needed" goodies :-)

-- 
Christian Ordig
Germany



Re: maildir over mbox?

2002-01-20 Thread Will Yardley

one last thing - if you're using the patch by david champion to count
attachments, it won't work with this patch.  this is because mutt
doesn't look at the message file at all... so all files show up as if
they had one attachment.

w



Re: maildir over mbox?

2002-01-20 Thread Will Yardley

Michael Elkins wrote:
> 
> I'd be curious to get some feedback on my header caching patch for
> maildir folders (can be found at http://www.sigpipe.org:8080/mutt/). 

(thanks to michael for helping me to get this to compile)... anyway
finally got this to work.  note that you have to put '--enable-cache' in
your configure args.

also, the libdb2 include file on debian didn't have 'db_create' (which
configure uses to test if the db stuff is working);
s/db_create/db_open/g fixed that.. so if you have and older berkley db
include file, you might need to do that.  me figured it out so i think
he is going to put something more generic in the patch perhaps.

the header caching thing seems to speed things up by about double
it does take a fair amount longer the first time (while it's building
the db, i assume)

zugzug% time /usr/bin/mutt -f mail/Trash-010202/
/usr/bin/mutt -f mail/Trash-010202/  1.06s user 0.27s system 69% cpu
1.922 total

zugzug% time ./src/build/mutt-1.3.26/mutt  -f mail/Trash-010202/
./src/build/mutt-1.3.26/mutt -f mail/Trash-010202/  2.32s user 0.44s
system 87% cpu 3.165 total

if i remove the db file, time is:

/usr/bin/mutt -f mail/Trash-010202/  3.15s user 1.06s system 81% cpu
5.163 total

this is for a folder with about 6k messages. the subjective time it
takes to open is a bit longer than the time reported here.

in any event, there's definitely a lot of improvement here.  personally
i think it would be great if this made it into the main source tree
before 1.4, but i realize some people might object to the caching thing.

w



Re: maildir over mbox?

2002-01-20 Thread Benjamin Michotte

On Mon, Jan 21, 2002 at 02:29:22AM, Christian Ordig wrote:
> well ... promises of ReiserFS should even tell us "it's optimized for 
> filesystems holding thousands of small files" ... 
well, I think it's "it's optimized for... hum, nothing" ;)
> 
> Number of messages: 9089 (mutt-users archive of 2001)
7939 (mutt-users & mutt-dev)
> Hardware: P100, 128MB RAM, Socket7 ASUS Mainboard, 2.5GB Western Digital
>   IDE Harddrive
P2-350, 390MB RAM, ASUS Mainboard (P2-B), 20GB Quantum IDE HDD
> System: OpenBSD 2.9
Linux 2.4.17 on Slackware 8.0 (the best linux distribution ;p (yep, it's
a troll ;p)
> Mutt-Version: 1.2.5 compile options: 
> -DOMAIN
> -DEBUG
> -HOMESPOOL  -USE_SETGID  -USE_DOTLOCK  -USE_FCNTL  +USE_FLOCK
> -USE_IMAP  -USE_GSS  -USE_SSL  -USE_POP  +HAVE_REGCOMP  -USE_GNU_REGEX  
> +HAVE_COLOR  +HAVE_PGP  -BUFFY_SIZE -EXACT_ADDRESS  +ENABLE_NLS
Mutt 1.3.26i (2002-01-18)
Compile options:
-DOMAIN
-DEBUG
-HOMESPOOL  -USE_SETGID  +USE_DOTLOCK  -DL_STANDALONE  
+USE_FCNTL  -USE_FLOCK
-USE_POP  +USE_IMAP  -USE_GSS  -USE_SSL  -USE_SASL  
+HAVE_REGCOMP  -USE_GNU_REGEX  
+HAVE_COLOR  +HAVE_START_COLOR  +HAVE_TYPEAHEAD  +HAVE_BKGDSET  
+HAVE_CURS_SET  +HAVE_META  +HAVE_RESIZETERM  
+HAVE_PGP  -BUFFY_SIZE -EXACT_ADDRESS  -SUN_ATTACHMENT  
+ENABLE_NLS  -LOCALES_HACK  +HAVE_WC_FUNCS  +HAVE_LANGINFO_CODESET
+HAVE_LANGINFO_YESEXPR  
+HAVE_ICONV  -ICONV_NONTRANS  +HAVE_GETSID  +HAVE_GETADDRINFO
 
> Filesystem: UFS, mounted sync
ext3 since yesteray ;)

> ... strange ...
Maybe Linux is a really poor kernel ;)

> Christian Ordig
---end quoted text---

cu,
binny

-- 

question = ( to ) ? be : ! be;
   -- Wm. Shakespeare

°v°  Benjamin Michotte<[EMAIL PROTECTED]>
_o_  web  : http://www.baby-linux.net



msg23403/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-20 Thread Benjamin Michotte

On Mon, Jan 21, 2002 at 02:31:56AM, Christian Ordig wrote:
> 
> little question: cached or first opening ?
the first opening takes about 20 seconds (for this 7900 mails) and when
it's cached, it takes about 10-15 seconds, which is really more
acceptable ;)

> Christian Ordig
---end quoted text---

binny

-- 

question = ( to ) ? be : ! be;
   -- Wm. Shakespeare

°v°  Benjamin Michotte<[EMAIL PROTECTED]>
_o_  web  : http://www.baby-linux.net



msg23400/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-20 Thread Samuel Padgett

Derek D. Martin <[EMAIL PROTECTED]> writes:

> No manager I've ever worked for would tolerate waiting 3 minutes to
> open their inbox...

That's funny because where I work, we use Lotus Notes, and I'm
sure many managers routinely wait this long for Notes to open
their inboxes (particularly if they don't replicate).  Remarkable
what you can get used to, eh? 

It's always a relief to come home a use a *real* mail client ;-)

Sam "OT--Sorry" Padgett



Re: maildir over mbox?

2002-01-20 Thread Will Yardley

Derek D. Martin wrote:
> 
> But how does it compare to mbox on the same FS?  I'll bet it's still
> significantly slower.

but with mbox, the entire file has to be stated every time the file is
read or modified. with a large file, this can be pretty resource
intensive, and can also be time consuming.

this is a big problem with POP3 - and this is a big advantage for
machines where most users are connecting remotely.  Maildir keeps the
load lower, and in many cases is faster to boot.

your assumption may also be faulty because the problem ext2 has is with
opening all those little files; so it doesn't necessarily follow that
mbox will also be faster on other filesystems.  in other words, Maildir
is (i'm pretty sure) inherently faster, but is limited by file system
performance.

there was an article somewhere (i don't have the link) on this
(specifically on the fact that ext2 and Maildir could be slow); i'll
see if i can dig up the link.

i'm excited to check out me's header caching patch - i didn't realize
there was a patch available - so thanks for letting us know!

w




Re: maildir over mbox?

2002-01-20 Thread budsz

On Sun, Jan 20, 2002 at 09:21:44PM -0500, Derek D. Martin wrote:
>It's not any safer if you do it RIGHT.  In computer science, you want
>to tend to optimize for the common case, and the common case when
>reading e-mail is wanting quick access.  :)

Absolutely, I mean in my experience, I'll choise maildir format to separate
message from someone, because it's more difficult if your type mailbox is mbox
(1 file for many message) compare with maildir (1 message is 1 file). hey it's
my opinion OK, I don't know about the other, IMHO..:)


-- 
budsz



msg23397/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-20 Thread Michael Elkins

Derek D. Martin wrote:
> I'm no expert, but it strikes me that OPENING maildir mailboxes on ANY
> filesystem will ALWAYS be slower than mbox, because of what you need
> to do.  An mbox mailbox will generally have little fragmentation on

I'd be curious to get some feedback on my header caching patch for maildir
folders (can be found at http://www.sigpipe.org:8080/mutt/).  Initial
reports are that it greatly speeds things up since after the first time you
load a mailbox, you only have to traverse the list of files in a directory
without actually opening them up for reading.  I'm not familiar with the
structure of inodes and how fast it is to traverse the list of files in a
directory, but I suspect it will be significantly faster than the current
method.



Re: maildir over mbox?

2002-01-20 Thread Derek D. Martin

At some point hitherto, Will Yardley hath spake thusly:
> Christian Ordig wrote:
> > 
> > Filesystem: UFS, mounted sync
> [...] 
> > Are there others having such poor performance with Maildir as Benjamin
> > has? And with which filesystem OS combinations?
> 
> our office mail machine is (unfortunately) linux with ext2, and i can
> attest to the fact that Maildir is pretty slow on ext2.

And most other filesystems...  Try it on FAT.  =8^)

> i haven't tried copying one of my large Maildir folders over to one of
> my machines (FreeBSD, UFS filesystem with soft-updates) to see how much
> faster it is, but i've heard that the difference is pretty large.

But how does it compare to mbox on the same FS?  I'll bet it's still
significantly slower.


-- 
Derek Martin   [EMAIL PROTECTED]
-
I prefer mail encrypted with PGP/GPG!
GnuPG Key ID: 0x81CFE75D
Retrieve my public key at http://pgp.mit.edu
Learn more about it at http://www.gnupg.org



msg23394/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-20 Thread Michael Elkins

Andy Davidson wrote:
> As a slight aside on this discussion, I had read somewhere --- citation
> lost --- that the munging of mboxes to escape lines beginning "From " in a
> message to ">From " messed up PGP signing.  Is this valid? [I suspect not,
> because I see lots of signed messages and you can't *all* be using
> maildirs, can you? :-)]

Mutt attempts to compensate for this by using quoted-printable encoding when
it detects things that might break a signature, thus escaping the problem.
But yes, mbox format is more susceptable to corruption of this form.



Re: maildir over mbox?

2002-01-20 Thread Derek D. Martin

At some point hitherto, Roman Neuhauser hath spake thusly:
> This format can get _very_ slow with large mailboxes on filesystems that do
> not handle directoris with many files in them. This should include the
> Linux ext2fs.
>
> FreeBSD post-4.4 FFS with softupdates and dirhash should shine with this
> format.

I'm no expert, but it strikes me that OPENING maildir mailboxes on ANY
filesystem will ALWAYS be slower than mbox, because of what you need
to do.  An mbox mailbox will generally have little fragmentation on
most Unix-like filesystems, so when you open it (in the mailbox sense,
as opposed to the filesystem sense, i.e. to get all the headers), it
will be something like this:

  open
  read
  read
  read
  read
  close

Whereas a maildir format mailbox has many small files, which could be
anywhere on the disk partition.  So when you open one, you have to go
through these contortions:

  open
  read
  close

  seek (the disk heads, not the syscall)

  open
  read
  close

  seek 

  open
  read
  close

  seek

I can't see how this could ever be fast...


-- 
Derek Martin   [EMAIL PROTECTED]
-
I prefer mail encrypted with PGP/GPG!
GnuPG Key ID: 0x81CFE75D
Retrieve my public key at http://pgp.mit.edu
Learn more about it at http://www.gnupg.org



msg23392/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-20 Thread Will Yardley

Christian Ordig wrote:
> 
> Filesystem: UFS, mounted sync
[...] 
> Are there others having such poor performance with Maildir as Benjamin
> has? And with which filesystem OS combinations?

our office mail machine is (unfortunately) linux with ext2, and i can
attest to the fact that Maildir is pretty slow on ext2.

i haven't tried copying one of my large Maildir folders over to one of
my machines (FreeBSD, UFS filesystem with soft-updates) to see how much
faster it is, but i've heard that the difference is pretty large.

if you look at past threads, you'll see a lot of mention of Maildir
being slow; this is usually from linux-centric people who don't realize
the problem is with ext2 :>

w



Re: maildir over mbox?

2002-01-20 Thread Derek D. Martin

At some point hitherto, budsz hath spake thusly:
> >yes, I know. I tried to convert my mbox to Maildirs, but about 3 minutes
> >to open a folder is really awfull, so I keep mbox
> 
> If we look in speed to read right..? how about savety...? let's say I want
> to copy paste 1000 email to some place, I think maildir format is better
> (could separate each other message). 

It's not any safer if you do it RIGHT.  In computer science, you want
to tend to optimize for the common case, and the common case when
reading e-mail is wanting quick access.  :)

No manager I've ever worked for would tolerate waiting 3 minutes to
open their inbox...


-- 
Derek Martin   [EMAIL PROTECTED]
-
I prefer mail encrypted with PGP/GPG!
GnuPG Key ID: 0x81CFE75D
Retrieve my public key at http://pgp.mit.edu
Learn more about it at http://www.gnupg.org



msg23390/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-20 Thread Andy Davidson

As a slight aside on this discussion, I had read somewhere --- citation
lost --- that the munging of mboxes to escape lines beginning "From " in a
message to ">From " messed up PGP signing.  Is this valid? [I suspect not,
because I see lots of signed messages and you can't *all* be using
maildirs, can you? :-)]

andy



Andy Davidson   --- Pheon Research
   If you know what you're doing, you're not learning anything.



Re: maildir over mbox?

2002-01-20 Thread Christian Ordig

> Oh my good... convert my reiserfs partition to ext3...
> about 10-15 seconds to open my mutt Maildir now !!!
ooops ... I should have already read this mail before answering the
last subthread ,-)

little question: cached or first opening ?

> Absolutly... reiserfs sucks.
*g* 

-- 
Christian Ordig
Germany



Re: maildir over mbox?

2002-01-20 Thread Christian Ordig

On Sun, Jan 20, 2002 at 09:39:05PM +0100, Benjamin Michotte wrote:
> On Sun, Jan 20, 2002 at 09:01:39PM, Christian Ordig wrote:
> > uhhh ... what kind of system did you use for measurement??
> P2-350 with a 20Gb HDD running Linux 2.4.17 on a Slackware 8.0.
> My ~/mail is on a 600 Mb reiserfs partition. I think I will convert my
> /home dir to ext3 and then try Maildir to compare.
well ... promises of ReiserFS should even tell us "it's optimized for 
filesystems holding thousands of small files" ... 

> > on my P100 with a quite old HDD running OpenBSD 2.9 it takes
> > about one minute to open the whole mutt archive of last year
> > (about 9900 messages)...
> It takes about 7 seconds to open my mutt archive with 7914 messages.
ok this system should be quite faster per default than the stuff I use,
but should UFS be really _that_ faster?
I took a watch and measured time.
First opening the folder (sorting by threads and displaying) takes
57seconds. (in words: fifty-seven  :-)
Number of messages: 9089 (mutt-users archive of 2001)
Hardware: P100, 128MB RAM, Socket7 ASUS Mainboard, 2.5GB Western Digital
  IDE Harddrive
System: OpenBSD 2.9
Mutt-Version: 1.2.5 compile options: 
-DOMAIN
-DEBUG
-HOMESPOOL  -USE_SETGID  -USE_DOTLOCK  -USE_FCNTL  +USE_FLOCK
-USE_IMAP  -USE_GSS  -USE_SSL  -USE_POP  +HAVE_REGCOMP  -USE_GNU_REGEX  
+HAVE_COLOR  +HAVE_PGP  -BUFFY_SIZE -EXACT_ADDRESS  +ENABLE_NLS

Filesystem: UFS, mounted sync

> yes, I know. I tried to convert my mbox to Maildirs, but about 3 minutes
> to open a folder is really awfull, so I keep mbox
sure it is... but I think it should really be faster with your hardware,
shouldn't it? (I just guess so, since the HDD alone should already be
faster than my old IDE drive ...)

... strange ...

maybe I'll copy a big folder over to my Dual PIII system with a 
UDMA100 drive having Linux and Ext2 and verify the poor results.

Are there others having such poor performance with Maildir as Benjamin
has? And with which filesystem OS combinations?

confused greetings ... :-)

-- 
Christian Ordig
Germany



Re: maildir over mbox?

2002-01-20 Thread budsz

On Sun, Jan 20, 2002 at 09:39:05PM +0100, Benjamin Michotte wrote:
>On Sun, Jan 20, 2002 at 09:01:39PM, Christian Ordig wrote:
>> uhhh ... what kind of system did you use for measurement??
>P2-350 with a 20Gb HDD running Linux 2.4.17 on a Slackware 8.0.
>My ~/mail is on a 600 Mb reiserfs partition. I think I will convert my
>/home dir to ext3 and then try Maildir to compare.
>
>> on my P100 with a quite old HDD running OpenBSD 2.9 it takes
>> about one minute to open the whole mutt archive of last year
>> (about 9900 messages)...
>It takes about 7 seconds to open my mutt archive with 7914 messages.
>
>> the advantage of Maildir over mbox I see: deleting a single message
>> in a really big mailbox is nothing more than simply deleting one
>> file with mbox it means writing the whole folder again leaving out
>> this one message .
>yes, I know. I tried to convert my mbox to Maildirs, but about 3 minutes
>to open a folder is really awfull, so I keep mbox

If we look in speed to read right..? how about savety...? let's say I want
to copy paste 1000 email to some place, I think maildir format is better
(could separate each other message). 


-- 
budsz



msg23384/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-20 Thread Benjamin Michotte

On Sun, Jan 20, 2002 at 07:33:10PM, Benjamin Michotte wrote:
> > What are the benefits of using one type over the other?
> opening a mbox with ± 7000 mails : less than 10 seconds.
> opening the same in Maildir : 3 minutes...
Oh my good... convert my reiserfs partition to ext3...
about 10-15 seconds to open my mutt Maildir now !!!
Absolutly... reiserfs sucks.

>
---end quoted text---

cu,
binny which is going to convert all his HDD to ext3 now ;)

-- 

question = ( to ) ? be : ! be;
   -- Wm. Shakespeare

°v°  Benjamin Michotte<[EMAIL PROTECTED]>
_o_  web  : http://www.baby-linux.net



msg23380/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-20 Thread Roman Neuhauser

> Date: Sun, 20 Jan 2002 12:08:44 -0600
> From: David Rock <[EMAIL PROTECTED]>
> To: Mutt Users List <[EMAIL PROTECTED]>
> Subject: maildir over mbox?
> 
> I was just wondering what the real differences were between maildir and
> mbox formats? I know mbox is an appended file while maildir is a
> separate directory for each mail (each what, exactly)? 
> 
> What are the benefits of using one type over the other?



mbox

This is the traditional mailbox format. It is a simple plain text
file with the messages in it. Each message has this structure:
   
From [EMAIL PROTECTED] date
Headers

Body


Since MUAs recognize the beginning of a message in an mbox by a line
starting with "From " (this is commonly called "the From_ line"),
any lines in message bodies starting with this string will get
rewritten as ">From " by your MDA.

It is not a good idea to use this mailbox format if you have your
mail on a NFS share. Many NFS implementations have buggy locking,
and you can easily have your mailboxes stomped.

See mbox(5) for more info (different man pages installed at least
with mutt and qmail).
   
maildir

This is a newer format, introduced by the qmail MTA. Each mailbox is
a directory containing three subdirectories: tmp/, new/, and cur/.
When your MDA delivers a message to a maildir mailbox, it writes it
to the tmp/ directory, and when the message is succesfully written,
it moves it to the new/ directory. Since this is atomical operation,
the mailbox is safe from curruption.

It is quite safe to use over NFS.

Also, it can be easier when you want to process your mail using regular
tools provided in the base system: you can use a small shell script to
move old mail away from your "active" mailboxes to an "archive" using
find(1), xargs(1), and alike.
   
This format can get _very_ slow with large mailboxes on filesystems that do
not handle directoris with many files in them. This should include the
Linux ext2fs.
   
FreeBSD post-4.4 FFS with softupdates and dirhash should shine with this
format.

See maildir(5) (installed at least with qmail) for more info.



-- 
FreeBSD 4.4-STABLE
9:47PM up 4:11, 8 users, load averages: 2.25, 2.44, 2.60



Re: maildir over mbox?

2002-01-20 Thread Benjamin Michotte

On Sun, Jan 20, 2002 at 09:01:39PM, Christian Ordig wrote:
> uhhh ... what kind of system did you use for measurement??
P2-350 with a 20Gb HDD running Linux 2.4.17 on a Slackware 8.0.
My ~/mail is on a 600 Mb reiserfs partition. I think I will convert my
/home dir to ext3 and then try Maildir to compare.

> on my P100 with a quite old HDD running OpenBSD 2.9 it takes
> about one minute to open the whole mutt archive of last year
> (about 9900 messages)...
It takes about 7 seconds to open my mutt archive with 7914 messages.

> the advantage of Maildir over mbox I see: deleting a single message
> in a really big mailbox is nothing more than simply deleting one
> file with mbox it means writing the whole folder again leaving out
> this one message .
yes, I know. I tried to convert my mbox to Maildirs, but about 3 minutes
to open a folder is really awfull, so I keep mbox

> Christian Ordig
---end quoted text---

binny

-- 

question = ( to ) ? be : ! be;
   -- Wm. Shakespeare

°v°  Benjamin Michotte<[EMAIL PROTECTED]>
_o_  web  : http://www.baby-linux.net



msg23376/pgp0.pgp
Description: PGP signature


Re: maildir over mbox?

2002-01-20 Thread Christian Ordig

On Sun, Jan 20, 2002 at 07:33:10PM +0100, Benjamin Michotte wrote:
> opening a mbox with ± 7000 mails : less than 10 seconds.
> opening the same in Maildir : 3 minutes...
uhhh ... what kind of system did you use for measurement??

on my P100 with a quite old HDD running OpenBSD 2.9 it takes
about one minute to open the whole mutt archive of last year
(about 9900 messages)...

the advantage of Maildir over mbox I see: deleting a single message
in a really big mailbox is nothing more than simply deleting one
file with mbox it means writing the whole folder again leaving out
this one message .

Not to tell the locking which is obsolete with Maildir, which makes
it easier with Mailfolders over NFS as I also use it ;-)

-- 
Christian Ordig
Germany



Re: maildir over mbox?

2002-01-20 Thread Daniel Eisenbud

On Sun, Jan 20, 2002 at 12:08:44PM -0600, David Rock <[EMAIL PROTECTED]> wrote:
> I was just wondering what the real differences were between maildir and
> mbox formats? I know mbox is an appended file while maildir is a
> separate directory for each mail (each what, exactly)? 
> 
> What are the benefits of using one type over the other?
> 
> Thanks.

Maildir has the significant advantage that it requires no locking,
making it safer to use in environments without good locking primitives,
and that your whole mailbox can never get corrupted by one delivery
going awry, since each message lives in its own file.  Having had my
inbox mysteriously corrupted once years ago, I think that that is a
significant advantage.  Maildir has the disadvantages that it is
nonstandard (not really a problem -- you can very easily save everything
in a maildir mailbox to an mbox mailbox if you want to give it to
someone whose mailer doesn't understand maildir) and that it takes up
more space on disk, since even a tiny message will use up a whole disk
block.  Speed can go either way, depending on the filesystem.
Traditional unix filesystems are very slow at reading huge directories.
Perhaps mutt should support maildirs split up into subdirectories for
this reason?  But it doesn't now.

-Daniel

-- 
Daniel E. Eisenbud
[EMAIL PROTECTED]

"We should go forth on the shortest walk perchance, in the spirit of
undying adventure, never to return,--prepared to send back our embalmed
hearts only as relics to our desolate kingdoms."
--Henry David Thoreau, "Walking"



Re: maildir over mbox?

2002-01-20 Thread Benjamin Michotte

Hi,

On Sun, Jan 20, 2002 at 12:08:44PM, David Rock wrote:
> I was just wondering what the real differences were between maildir and
> mbox formats? I know mbox is an appended file while maildir is a
> separate directory for each mail (each what, exactly)? 
one folder for a box, each mail in a separate file.

> What are the benefits of using one type over the other?
opening a mbox with ± 7000 mails : less than 10 seconds.
opening the same in Maildir : 3 minutes...

> David Rock
---end quoted text---

cu,
binny

-- 

question = ( to ) ? be : ! be;
   -- Wm. Shakespeare

°v°  Benjamin Michotte<[EMAIL PROTECTED]>
_o_  web  : http://www.baby-linux.net



msg23372/pgp0.pgp
Description: PGP signature


maildir over mbox?

2002-01-20 Thread David Rock

I was just wondering what the real differences were between maildir and
mbox formats? I know mbox is an appended file while maildir is a
separate directory for each mail (each what, exactly)? 

What are the benefits of using one type over the other?

Thanks.

-- 
David Rock
[EMAIL PROTECTED]



msg23371/pgp0.pgp
Description: PGP signature