Matthew Barnett <[email protected]> added the comment:
On a narrow build, "\N{MATHEMATICAL SCRIPT CAPITAL A}" is stored as 2 code
units, and neither re nor regex recombine them when compiling a regex or
looking for a match.
regex supports \xNN, \uNNNN and \UNNNNNNNN and \N{XYZ} itself, so they can be
used in a raw string literal, but it doesn't recombine code units.
I could add recombination to regex at some point if time has passed and no
further progress has been made in the language's support for Unicode.
----------
_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue12749>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe:
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com