[issue46410] TypeError when parsing regexp with unicode named character sequence escape

2022-01-17 Thread Jirka Marsik
New submission from Jirka Marsik : re.compile(r"\N{name of Unicode Named Character Sequence}"), e.g. re.compile(r"\N{KEYCAP NUMBER SIGN}"), throws a TypeError. The regular expression parser relies on 'unicodedata' to lookup character names. The 'unicodedata' module re

[issue45539] Negative lookaround assertions sometimes leak capture groups

2021-10-20 Thread Jirka Marsik
New submission from Jirka Marsik : When you have capture groups inside a negative lookaround assertion, the strings captured by those capture groups can sometimes survive the failure of the assertion and feature in the returned Match object. Here it is illustrated with lookbehinds