I wrote an mbox to Maildir script that I think is better than the other
alternatives out there, assuming that the mbox uses "Status: RO" and
"X-Status: ADF" headers to keep track of message flags (pine, mutt, and I
think UW-IMAP do this).
The main design goals of this script are Simplicity and Correctness.
Usage is simple:
[pmak@lina pmak]$ ls -l mbox
-rw------- 1 pmak pmak 152548699 Dec 25 04:43 mbox
[pmak@lina pmak]$ ./perfect_maildir Maildir < mbox
Inserted 10011 messages into maildir Maildir in 35 seconds
On an AMD Duron 1GHz, my script can process about 280 messages per second.
perfect_maildir does not attempt to find all the mbox files in a directory
and convert them; I think that's better handled by a shell script such as
this (untested):
su
cd /var/spool/mail
for x in '*'; do
maildirmake ~$x/Maildir
perfect_maildir ~$x/Maildir < $x
chown -R $x ~$x/Maildir
chgrp -R mail ~$x/Maildir
done
#!/usr/bin/perl
# "Simple but Perfect" mbox to Maildir converter v0.1
# by Philip Mak <[EMAIL PROTECTED]>
# Usage: perfect_maildir ~/Maildir < mbox
# Simple - only converts one mbox (can use script in one-liners)
# Perfect - message Flags/X-Flags are converted; "^>From ." line is unescaped
# I wrote this script after being unsatisfied with existing mbox to
# maildir converters. By making it "Simple", code complexity is kept
# low thus making it easy to program and debug. At the same time,
# since it only converts one mbox at a time, it is perfect for use in
# a shell ``for'' loop (for example).
# As for being "Perfect", to the best of my knowledge this script does
# the conversion correctly in all cases; it will translate "Status"
# and "X-Status" fields into maildir info, and it correctly detects
# where messages begin and end. (This is only version 0.1 so I may
# have messed something up though. Please send me feedback!)
# NOTE: The MUA ``mutt'' has a bug/feature where in the message index,
# it claims that all maildir messages have 0 lines unless they have a
# "Lines:" header set. perfect_maildir does not attempt to add the
# "Lines:" header; you may want to reconfigure ``mutt' to display byte
# size instead of lines instead by adding the following line to your
# ~/.muttrc file:
#
# set index_format="%4C %Z %{%b %d} %-15.15L (%4c) %s"
# check for valid arguments
my ($maildir) = @ARGV;
if (!$maildir) {
print STDERR "Usage: perfect_maildir ~/Maildir < mbox\n";
exit 1;
}
# check for writable maildir
unless (-w "$maildir/cur") {
print STDERR "Cannot write to $maildir/cur\n";
exit 1;
}
unless (-w "$maildir/new") {
print STDERR "Cannot write to $maildir/new\n";
exit 1;
}
my $num = 0;
my $time = time;
repeat:
# read header
my $headers = '';
my $flags = '';
my $subject = '';
while (my $line = <STDIN>) {
# detect end of headers
last if $line eq "\n";
# strip "From" line from header
$headers .= $line unless $line =~ /^From ./;
# detect flags
$flags .= $1 if $line =~ /^Status: ([A-Z]+)/;
$flags .= $1 if $line =~ /^X-Status: ([A-Z]+)/;
$subject = $1 if $line =~ /^Subject: (.*)$/;
}
$num++;
# open output file
my $file;
if ($flags =~ /O/) {
$file = "$maildir/cur/$time.$num.$ENV{HOSTNAME}";
my $extra = '';
$extra .= 'F' if $flags =~ /F/; # flagged
$extra .= 'R' if $flags =~ /A/; # replied
$extra .= 'S' if $flags =~ /R/; # seen
$extra .= 'T' if $flags =~ /D/; # trashed
$file .= ":2,$extra" if $extra;
} else {
$file = "$maildir/new/$time.$num.$ENV{HOSTNAME}";
}
# filter out the "DON'T DELETE THIS MESSAGE -- FOLDER INTERNAL DATA" message
$file = '/dev/null' if ($num == 1 and $subject eq "DON'T DELETE THIS MESSAGE -- FOLDER
INTERNAL DATA");
open(FILE, ">$file");
print FILE "$headers\n";
while (my $line = <STDIN>) {
# detect end of message
last if $line =~ /^From ./;
# unescape "From"
$line =~ s/^>From (.)/From $1/;
print FILE $line;
}
close(FILE);
goto repeat unless eof(STDIN);
my $elapsed = time - $time;
print "Inserted $num messages into maildir $maildir in $elapsed seconds\n";