On Tue, Jul 15, 2014 at 1:47 PM, Michael Torrie <torr...@gmail.com> wrote:
> Python 3 uses UTF-4 encoding under the hood, with a
> compression optimization that removes leading zeros from binary
> representation of each character.

Sorry to nitpick, but in the interests of terminological accuracy I
have to point out that it's UTF-32 or UCS-4, not UTF-4 :)

But otherwise, yes, quite correct. And a system that few, but not no,
other languages use; I do wonder if other languages have considered
switching to this kind of system, but avoided it lest jmf start
haunting them too...

ChrisA
-- 
https://mail.python.org/mailman/listinfo/python-list

Reply via email to