Roundup Robot added the comment:
New changeset 4caa695af94c by Serhiy Storchaka in branch '2.7':
Issue #12728: Different Unicode characters having the same uppercase but
https://hg.python.org/cpython/rev/4caa695af94c
New changeset 47b3084dd6aa by Serhiy Storchaka in branch '3.4':
Issue #12728:
Serhiy Storchaka added the comment:
This solution (with hardcoded table of equivalent lowercases) is temporary. In
future re engine will be changed to support correct caseless matching of
different lowercase forms internally.
--
resolution: - fixed
stage: patch review - resolved
Changes by Serhiy Storchaka storch...@gmail.com:
Removed file: http://bugs.python.org/file36681/re_ignore_case.patch
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue12728
___
Changes by Serhiy Storchaka storch...@gmail.com:
Removed file: http://bugs.python.org/file37087/re_cases.py
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue12728
___
Serhiy Storchaka added the comment:
Could anyone please make a review?
The script is updated so that it now is compatible with 2.7. There are some
differences in equivalence table between 2.7 and 3.4 (e.g. 'ΐ' (U+0390) is not
equivalent to 'ΐ' (U+1FD3) in 2.7).
--
Added file:
Serhiy Storchaka added the comment:
Here are complete patch and script used to generate equivalence table.
--
Added file: http://bugs.python.org/file37086/re_ignore_case_2.patch
Added file: http://bugs.python.org/file37087/re_cases.py
___
Python
Serhiy Storchaka added the comment:
Here is preliminary patch which fixes case-insensitive regular expression
matching of unicode strings. It is incomplete, it needs applying patches from
issue17381, which fixes other aspects of case-insensitive matching.
One bug is left for Turkish letters.
Changes by Terry J. Reedy tjre...@udel.edu:
--
versions: +Python 3.4 -Python 3.2
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue12728
___
___
Guido van Rossum gu...@python.org added the comment:
This bug could do with a little less attitude. That said, I think it is a bug
and should be fixed, at the very least for Python 3.3. As always, it is a
matter of much debate to what extent bugs can be fixed in previous Python
versions
Changes by Arfrever Frehtes Taifersar Arahesis arfrever@gmail.com:
--
nosy: +Arfrever
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue12728
___
Terry J. Reedy tjre...@udel.edu added the comment:
I am not sure that everyone will agree that this is a bug, rather than a
feature request, or that if a bug, that it should be changed in existing
releases and possibly break running code. The doc just says, somewhat vaguely,
that IGNORECASE
Tom Christiansen tchr...@perl.com added the comment:
Terry J. Reedy tjre...@udel.edu added the comment:
I am not sure that everyone will agree that this is a bug, rather than a fe=
ature request, or that if a bug, that it should be changed in existing rele=
ases and possibly break running
Changes by Matthew Barnett pyt...@mrabarnett.plus.com:
--
nosy: +mrabarnett
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue12728
___
___
New submission from Tom Christiansen tchr...@perl.com:
The Python re library is broken in its approach to case-insensitive matches. It
erroneously attempts to compare lowercase mappings. This is wrong. You must
compare the Unicode casefolds, not the Unicode casemaps. Otherwise you get
wrong
Changes by Tom Christiansen tchr...@perl.com:
--
components: +Regular Expressions -Library (Lib)
type: - behavior
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue12728
___
Changes by Ezio Melotti ezio.melo...@gmail.com:
--
nosy: +ezio.melotti
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue12728
___
___
16 matches
Mail list logo