On Sat, Dec 3, 2011 at 11:51 PM, Alex Teslik wrote: > OpenWebMail has HTML handling and HTML to text conversions specifically for > email. They are tested and could probably be integrated into mhonarc with > minimal effort. > > HTML handling/scrubbing: > http://openwebmail.acatysmoof.com/dev/svnweb/index.pl/openwebmail/view/trunk/src/cgi-bin/openwebmail/modules/htmlrender.pl > > HTML->Text: > http://openwebmail.acatysmoof.com/dev/svnweb/index.pl/openwebmail/view/trunk/src/cgi-bin/openwebmail/modules/htmltext.pl
It would be interesting to see what kind of test data has been used to verify how good it is at sanitizing data and how well it handles specially crafted large emails. A quick scan at some of the regexes indicate some things may still get through. If you are interested, you can examine the comments of mhonarc's mhtxthtml.pl filter to get an idea of the crap one has to deal with. --ewh