On Tue, Jul 15, 2014 at 1:47 PM, Michael Torrie <torr...@gmail.com> wrote: > Python 3 uses UTF-4 encoding under the hood, with a > compression optimization that removes leading zeros from binary > representation of each character.
Sorry to nitpick, but in the interests of terminological accuracy I have to point out that it's UTF-32 or UCS-4, not UTF-4 :) But otherwise, yes, quite correct. And a system that few, but not no, other languages use; I do wonder if other languages have considered switching to this kind of system, but avoided it lest jmf start haunting them too... ChrisA -- https://mail.python.org/mailman/listinfo/python-list