Ezio Melotti <ezio.melo...@gmail.com> added the comment: I think the problem is in the default encoding used when you call unicode() without specifying any encoding. >>> '\xc5\xa0'.decode('iso-8859-1').split() [u'\xc5'] >>> '\xc5\xa0'.decode('utf-8').split() [u'\u0160']
---------- _______________________________________ Python tracker <rep...@bugs.python.org> <http://bugs.python.org/issue8859> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com