On Oct 26, 2019, at 21:33, Steven D'Aprano <st...@pearwood.info> wrote: > > IronPython and Jython use whatever .Net and Java use.
Which makes them sequences of UTF-16 code units, not code points. Which is allowed for the Python 2.x unicode type, but would violate the rules for 3.x str, but neither one has a 3.x. If you want to deal with code points, you have to handle surrogates manually. (Actually, IIRC, one of the two has a str type that, despite being 2.x, is unicode rather than bytes, but with some extra undocumented functionality to smuggle bytes around in a str and have it sometimes work.) _______________________________________________ Python-ideas mailing list -- python-ideas@python.org To unsubscribe send an email to python-ideas-le...@python.org https://mail.python.org/mailman3/lists/python-ideas.python.org/ Message archived at https://mail.python.org/archives/list/python-ideas@python.org/message/E4VWU42A5RXJQRJSQMQDEN4W3D2FJNZS/ Code of Conduct: http://python.org/psf/codeofconduct/