[issue12728] Python re lib fails case insensitive matches on Unicode data

2014-11-10 Thread Roundup Robot
Roundup Robot added the comment: New changeset 4caa695af94c by Serhiy Storchaka in branch '2.7': Issue #12728: Different Unicode characters having the same uppercase but https://hg.python.org/cpython/rev/4caa695af94c New changeset 47b3084dd6aa by Serhiy Storchaka in branch '3.4': Issue #12728:

[issue12728] Python re lib fails case insensitive matches on Unicode data

2014-11-10 Thread Serhiy Storchaka
Serhiy Storchaka added the comment: This solution (with hardcoded table of equivalent lowercases) is temporary. In future re engine will be changed to support correct caseless matching of different lowercase forms internally. -- resolution: - fixed stage: patch review - resolved

[issue12728] Python re lib fails case insensitive matches on Unicode data

2014-11-07 Thread Serhiy Storchaka
Changes by Serhiy Storchaka storch...@gmail.com: Removed file: http://bugs.python.org/file36681/re_ignore_case.patch ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue12728 ___

[issue12728] Python re lib fails case insensitive matches on Unicode data

2014-11-07 Thread Serhiy Storchaka
Changes by Serhiy Storchaka storch...@gmail.com: Removed file: http://bugs.python.org/file37087/re_cases.py ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue12728 ___

[issue12728] Python re lib fails case insensitive matches on Unicode data

2014-11-07 Thread Serhiy Storchaka
Serhiy Storchaka added the comment: Could anyone please make a review? The script is updated so that it now is compatible with 2.7. There are some differences in equivalence table between 2.7 and 3.4 (e.g. 'ΐ' (U+0390) is not equivalent to 'ΐ' (U+1FD3) in 2.7). -- Added file:

[issue12728] Python re lib fails case insensitive matches on Unicode data

2014-10-31 Thread Serhiy Storchaka
Serhiy Storchaka added the comment: Here are complete patch and script used to generate equivalence table. -- Added file: http://bugs.python.org/file37086/re_ignore_case_2.patch Added file: http://bugs.python.org/file37087/re_cases.py ___ Python

[issue12728] Python re lib fails case insensitive matches on Unicode data

2014-09-21 Thread Serhiy Storchaka
Serhiy Storchaka added the comment: Here is preliminary patch which fixes case-insensitive regular expression matching of unicode strings. It is incomplete, it needs applying patches from issue17381, which fixes other aspects of case-insensitive matching. One bug is left for Turkish letters.

[issue12728] Python re lib fails case insensitive matches on Unicode data

2013-07-10 Thread Terry J. Reedy
Changes by Terry J. Reedy tjre...@udel.edu: -- versions: +Python 3.4 -Python 3.2 ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue12728 ___ ___

[issue12728] Python re lib fails case insensitive matches on Unicode data

2011-08-26 Thread Guido van Rossum
Guido van Rossum gu...@python.org added the comment: This bug could do with a little less attitude. That said, I think it is a bug and should be fixed, at the very least for Python 3.3. As always, it is a matter of much debate to what extent bugs can be fixed in previous Python versions

[issue12728] Python re lib fails case insensitive matches on Unicode data

2011-08-12 Thread Arfrever Frehtes Taifersar Arahesis
Changes by Arfrever Frehtes Taifersar Arahesis arfrever@gmail.com: -- nosy: +Arfrever ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue12728 ___

[issue12728] Python re lib fails case insensitive matches on Unicode data

2011-08-12 Thread Terry J. Reedy
Terry J. Reedy tjre...@udel.edu added the comment: I am not sure that everyone will agree that this is a bug, rather than a feature request, or that if a bug, that it should be changed in existing releases and possibly break running code. The doc just says, somewhat vaguely, that IGNORECASE

[issue12728] Python re lib fails case insensitive matches on Unicode data

2011-08-12 Thread Tom Christiansen
Tom Christiansen tchr...@perl.com added the comment: Terry J. Reedy tjre...@udel.edu added the comment: I am not sure that everyone will agree that this is a bug, rather than a fe= ature request, or that if a bug, that it should be changed in existing rele= ases and possibly break running

[issue12728] Python re lib fails case insensitive matches on Unicode data

2011-08-12 Thread Matthew Barnett
Changes by Matthew Barnett pyt...@mrabarnett.plus.com: -- nosy: +mrabarnett ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue12728 ___ ___

[issue12728] Python re lib fails case insensitive matches on Unicode data

2011-08-11 Thread Tom Christiansen
New submission from Tom Christiansen tchr...@perl.com: The Python re library is broken in its approach to case-insensitive matches. It erroneously attempts to compare lowercase mappings. This is wrong. You must compare the Unicode casefolds, not the Unicode casemaps. Otherwise you get wrong

[issue12728] Python re lib fails case insensitive matches on Unicode data

2011-08-11 Thread Tom Christiansen
Changes by Tom Christiansen tchr...@perl.com: -- components: +Regular Expressions -Library (Lib) type: - behavior ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue12728 ___

[issue12728] Python re lib fails case insensitive matches on Unicode data

2011-08-11 Thread Ezio Melotti
Changes by Ezio Melotti ezio.melo...@gmail.com: -- nosy: +ezio.melotti ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue12728 ___ ___