New submission from Dogan <[email protected]>:
Hey there,
I believe I've come across a bug. It occurs when you try to lower() the Turkish
uppercase letter "İ". Gonna explain it with example code since it's easier:
>>> len("Ş")
1
>>> len("Ş".lower())
1
>>> len("Ğ")
1
>>> len("Ğ".lower())
1
>>> len("Ö")
1
>>> len("Ö".lower())
1
>>> len("Ç")
1
>>> len("Ç".lower())
1
>>> len("İ")
1
>>> len("İ".lower())
2
When you lower() the Turkish uppercase letter “İ”, it returns a 2 chars long
string with the first character being “i”, and the second being chr(775).
Should it not simply return “i”?
----------
components: Unicode
messages: 325646
nosy: ezio.melotti, vstinner, zamsalak
priority: normal
severity: normal
status: open
title: lower() on Turkish letter "İ" returns a 2-chars-long string
type: behavior
versions: Python 3.6
_______________________________________
Python tracker <[email protected]>
<https://bugs.python.org/issue34723>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe:
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com