Adam Olsen <[email protected]> added the comment: I believe this is a duplicate of issue #3297. When given a high unicode scalar value directly in the source (rather than in escaped form) python will split it into surrogates, even on a UTF-32 build where those surrogates are nonsensical and ill-formed.
Patches for Issue #3672 probably made this more visible. ---------- nosy: +Rhamphoryncus _______________________________________ Python tracker <[email protected]> <http://bugs.python.org/issue7045> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
