Ramchandra Apte <maniandra...@gmail.com> added the comment:

The problem is not in re, it is because you are passing '。' to re.split which 
in Python 2.x is actually passed as '\xe3\x80\x82'.
You should pass u'。' to re.compile.
Could we raise a SyntaxError when in a progam a unicode character is in a bytes 
string?
Python 3 does so; it raises "SyntaxError: bytes can only contain ASCII literal 
characters." when you execute b'。'

----------

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue14068>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to