On Thu, 17 Apr 2008 20:57:21 -0700 (PDT)
hdante <[EMAIL PROTECTED]> wrote:

>  Don't use old 8-bit encodings. Use UTF-8.

Yes, I'll try. But is a problem when I only want to read, not that I'm trying 
to write or create the content.
To blame I suppose is Microsoft's commercial success. They won't adhere to 
standars if that doesn't make sense for their business.

I'll change the approach trying to filter the contents with htmllib and mapping 
on my own those troubling characters.
Anyway this has been a very instructive dive into unicode for me, I've got 
things cleared up now.

Thanks to everyone for the great help.
-- 
http://mail.python.org/mailman/listinfo/python-list

Reply via email to