Antoine Pitrou added the comment:
Yes, I think 3.3 is correct here. I think it was Benjamin who fixed/improved
the behaviour of casing methods. Compare 3.3:
>>> "ß".upper()
'SS'
with 3.2:
>>> "ß".upper()
'ß'
Also, 3.2 loses information:
>>> "KİTAP".lower().upper()
'KITAP'
>>> ascii("KİTAP".lower().upper())
"'KITAP'"
while 3.3 retains it:
>>> "KİTAP".lower().upper()
'KİTAP'
>>> ascii("KİTAP".lower().upper())
"'KI\\u0307TAP'"
You can get the combined form again with unicodedata.normalize:
>>> unicodedata.normalize("NFC", "KİTAP".lower().upper())
'KİTAP'
----------
nosy: +benjamin.peterson, haypo, lemburg, pitrou
resolution: -> invalid
status: open -> closed
_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue17252>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe:
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com