Serhiy Storchaka added the comment:

Seems as Tk stores pasted "\U000104a2" as surrogate pair "\ud801\udca2". Then 
it encoded in UTF-8 as "\xed\xa0\x81\xed\xb2\xa2" end passed to Python. Python 
converts char* to Unicode object with PyUnicode_FromString() which forbids 
invalid UTF-8 including encoded surrogates.

Please test proposed patch on Windows.

----------
Added file: http://bugs.python.org/file31184/tkinter_string_conv.patch

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue13153>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to