On 05/31/2014 09:48 AM, jmf wrote: > <falsehoods about python and unicode>
Absolutely FALSE. Python 3.3 and up can handle any and all unicode characters you want to throw at it, without surprises such as what you get in javascript. Python 3 uses UTF-4 encoding under the hood, with a compression optimization that removes leading zeros from binary representation of each character. Windows command consoles are not unicode compliant, and so running python programs a command prompt console will often lead to exceptions because Python must convert unicode to the character set that the console is using, and when a character is hit that cannot be encoded Python defaults to being correct and throws an exception, instead of failing silently. -- https://mail.python.org/mailman/listinfo/python-list