On Tue, Jan 26, 2010 at 9:57 AM, Caio Chassot <[email protected]> wrote:
> I, for one, would love to get my hands on a useful corpus of craptastic mail.

The Python email package dev's have spent quite a lot of time trying
to make sure that the package can take whatever is given to it and end
up with the best possible representation (plus a list of defects).  A
reasonable set of test data is in SVN:

http://svn.python.org/view/python/trunk/Lib/email/test/data/

I'm sure other language/library developers have made similar efforts.
Test data, just like code, should avoid NIH.

(If you want large corpora of mail, rather than mail specifically
designed to illustrate flaws, then there are many of those too.  Just
ask for links).

Cheers,
Tony
_______________________________________________
[email protected] mailing list
List help: http://lists.ranchero.com/listinfo.cgi/email-init-ranchero.com

Reply via email to