Ryan Steele wrote: >Mark Sapiro wrote: >> Ryan Steele wrote: >> >>> Mark Sapiro wrote: >>> >>>> Do you have convert_html_to_plaintext set to Yes? >>>> >>>> >>> Affirmative, I do. >>> >> >> >> Based on what I see, I don't think that is the problem.
It definitely IS the issue with this one. >I'm still working on getting the client to be able to consistently >reproduce the character misrepresentation (UTF-8) issue, so I won't >bother the list with that for now. However, here's a good example of >one with stripped carriage returns. Sorry for the delay. I didn't post >this to the list because I didn't want a search engine spider crawling >the information in the header... feel free to post a reply with those >snipped! > >Before making it to the list (carriage return omission example): > >###################################################################################### > <snip> >X-Mailer: QUALCOMM Windows Eudora Version 7.1.0.9 >Content-type: text/html; charset=us-ascii <snip> > ><html> ><body> >Hello, Ryan<br><br> >I am writing each line flush left<br><br> >And, I have inserted a CR between each line<br><br> >I am copying you on the original<br><br> >And sending a copy to test2<br><br> >I think you'll see that the text shifts 2 spaces to the right<br><br> >And that all the CRs are deleted<br><br> >And that urls get footnoted.<br><br> >For instance, > ><a href="http://www.odnetwork.org/" eudora="autourl">www.odnetwork.org<br> ><br> ></a>Plus the url in my sig block<br><br> >I am typing bold in <b>bold </b>and italics in <i>italics<br><br> ></i>I hope that helps.<br><br> >Bill you will also get a copy of this, which you can ignore.<br><br> >Matt<br> > <snip> </body> ></html> > >###################################################################################### > >After making it to the list (carriage return omission example): > >###################################################################################### > <snip> >X-Content-Filtered-By: Mailman/MimeDel 2.1.8 <snip> >Content-Type: text/plain; charset="us-ascii" >Content-Transfer-Encoding: 7bit <snip> > > > > Hello, Ryan > I am writing each line flush left > And, I have inserted a CR between each line > I am copying you on the original > And sending a copy to test2 > I think you'll see that the text shifts 2 spaces to the right > And that all the CRs are deleted > And that urls get footnoted. > For instance, [1]www.odnetwork.org > Plus the url in my sig block > I am typing bold in bold and italics in italics > I hope that helps. > Bill you will also get a copy of this, which you can ignore. > Matt > <snip> > >References > > 1. http://www.odnetwork.org/ <snip> The original post was html only, thus it was converted to plain text. The indentation and footnoting of hyperlinks is all done by your HTML_TO_PLAIN_TEXT_COMMAND (default = '/usr/bin/lynx -dump %(filename)s'). I'm not sure what 'carriage returns' are missing, but if you're referring to the html that renders as double spaces being renderd single spaced, that's lynx too. If you prefer a double spaced rendering, you can put HTML_TO_PLAIN_TEXT_COMMAND = '/usr/bin/links -dump %(filename)s' in mm_cfg.py to replace lynx with links (verify that you have links and that is the correct path). The rest of what links does with this is very similar to what lynx does. If in the other cases, we are converting utf-8 html to plain text, I think that explains why we 'lose' the character set. I think there are definitely problems in this area. It didn't look like that was the issue from a previous post, but as the messages were incomplete in that example, I may have misinterpreted what was happening. If your client would post plain text or even multipart/alternative, we wouldn't have to convert the html. See <http://www.expita.com/nomime.html#eudora5> (hopefully it also applies to Eudora 7.) -- Mark Sapiro <[EMAIL PROTECTED]> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan ------------------------------------------------------ Mailman-Users mailing list Mailman-Users@python.org http://mail.python.org/mailman/listinfo/mailman-users Mailman FAQ: http://www.python.org/cgi-bin/faqw-mm.py Searchable Archives: http://www.mail-archive.com/mailman-users%40python.org/ Unsubscribe: http://mail.python.org/mailman/options/mailman-users/archive%40jab.org Security Policy: http://www.python.org/cgi-bin/faqw-mm.py?req=show&file=faq01.027.htp