I wrote:
>>> msg = u'The ancient greeks used the letter "\U00010143" for the number 5.'
>>> msg[35:-18]
u'"\U00010143"'
>>> greek_five = msg[36:-19]
>>> len(greek_five)
2


After posting, I realized that it's worse than that. I suspect that if
I tried this on a CPython compiled with wide characters, then
len(greek_five) would be 1.

What should it be? 2? 1? Implementation-dependent?

-- Michael Chermside

_______________________________________________
Python-3000 mailing list
[email protected]
http://mail.python.org/mailman/listinfo/python-3000
Unsubscribe: 
http://mail.python.org/mailman/options/python-3000/archive%40mail-archive.com

Reply via email to