New submission from Lovely Grace <lovelygrac...@hotmail.com>: The following code fails on the print command (see error message bellow the code). I use windows and run the code under command prompt env.
import urllib.request if __name__ == "__main__": sock = urllib.request.urlopen("http://docs.python.org/3.0/library/html.parser.html#html.parser.HTMLParser.handle_startendtag") htmlSource = sock.read() sock.close() htmlSource = htmlSource.decode("utf-8", "strict") print(htmlSource) Error message: Traceback (most recent call last): File "parsehomeless.py", line 68, in <module> print(htmlSource[0:281]) File "C:\Python30\lib\io.py", line 1494, in write b = encoder.encode(s) File "C:\Python30\lib\encodings\cp862.py", line 19, in encode return codecs.charmap_encode(input,self.errors,encoding_map)[0] UnicodeEncodeError: 'charmap' codec can't encode character '\u2014' in position 280: character maps to <undefined> ---------- components: Interpreter Core messages: 82120 nosy: grace severity: normal status: open title: Error printing UTF-8 decoded text type: crash versions: Python 3.1 _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue5263> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com