On Thu, 17 Apr 2008 20:57:21 -0700 (PDT) hdante <[EMAIL PROTECTED]> wrote:
> Don't use old 8-bit encodings. Use UTF-8. Yes, I'll try. But is a problem when I only want to read, not that I'm trying to write or create the content. To blame I suppose is Microsoft's commercial success. They won't adhere to standars if that doesn't make sense for their business. I'll change the approach trying to filter the contents with htmllib and mapping on my own those troubling characters. Anyway this has been a very instructive dive into unicode for me, I've got things cleared up now. Thanks to everyone for the great help. -- http://mail.python.org/mailman/listinfo/python-list