Amaury Forgeot d'Arc <[EMAIL PROTECTED]> added the comment: The escaped regexp is not utf-8 (why should it be?), but it still matches the same bytes in the searched text, which has to be utf-8 encoded anyway:
>>> text = u"été".encode('utf-8') >>> regexp = u"é".encode('utf-8') >>> re.findall(regexp, text) ['\xc3\xa9', '\xc3\xa9'] >>> escaped_regexp = re.escape(regexp) >>> re.findall(escaped_regexp, text) ['\xc3\xa9', '\xc3\xa9'] ---------- nosy: +amaury.forgeotdarc _______________________________________ Python tracker <[EMAIL PROTECTED]> <http://bugs.python.org/issue2650> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com