[issue46264] 'I'.lower() should give non dotted i for LANG=tr_TR

2022-01-09 Thread Frank Feuerbacher
Frank Feuerbacher added the comment: Using casefold did not help ubuntu Lang is en_US.UTF-8 [GCC 9.3.0] on linux >>> folded_1: str = "Turkish I: İı".casefold() >>> folded_2: str = "tUrkİsh i: iI".casefold() >>> print(folded_1) turkish i: i̇ı >>> print(folded_2) turki̇sh i: ii >>>

[issue46264] 'I'.lower() should give non dotted i for LANG=tr_TR

2022-01-07 Thread Éric Araujo
Éric Araujo added the comment: I suppose the casefold method does not help? https://docs.python.org/3.10/library/stdtypes.html#str.casefold -- nosy: +eric.araujo ___ Python tracker

[issue46264] 'I'.lower() should give non dotted i for LANG=tr_TR

2022-01-06 Thread Frank Feuerbacher
Frank Feuerbacher added the comment: Oh joy. Kodi media server is having unicode issues and this won't help. I'm trying to see how bad it is. The main use for case transformations is for internal keyword lookup/monocasing. Settings, filenames on moncased filesystems, etc. are caseless. On

[issue46264] 'I'.lower() should give non dotted i for LANG=tr_TR

2022-01-05 Thread Serhiy Storchaka
Serhiy Storchaka added the comment: If you are looking for case-insensitive string comparison, look at locale.strcoll() and locale.strxfrm(). They are locale-aware. -- nosy: +serhiy.storchaka ___ Python tracker

[issue46264] 'I'.lower() should give non dotted i for LANG=tr_TR

2022-01-05 Thread Christian Heimes
Christian Heimes added the comment: Python's stdlib does not support locale aware unicode transformations. I recommend that you check out https://pypi.org/project/PyICU . -- nosy: +christian.heimes ___ Python tracker

[issue46264] 'I'.lower() should give non dotted i for LANG=tr_TR

2022-01-04 Thread Frank Feuerbacher
New submission from Frank Feuerbacher : This blasted Turkish I will be the death of us all... https://www.unicode.org/versions/Unicode14.0.0/ch05.pdf has a lovely graphic on page 238 of the behavior of upper/lower casing of the various I's and when locale is Turkish or not. It seems that