Ramchandra Apte <[email protected]> added the comment: The problem is not in re, it is because you are passing '。' to re.split which in Python 2.x is actually passed as '\xe3\x80\x82'. You should pass u'。' to re.compile. Could we raise a SyntaxError when in a progam a unicode character is in a bytes string? Python 3 does so; it raises "SyntaxError: bytes can only contain ASCII literal characters." when you execute b'。'
---------- _______________________________________ Python tracker <[email protected]> <http://bugs.python.org/issue14068> _______________________________________ _______________________________________________ Python-bugs-list mailing list Unsubscribe: http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com
