Is python of two minds about what is white space. I notice that split, strip seem to regard u'\xa0' (NO-BREAK SPACE) as white, but that code is not matched by the \s pattern. If this difference is intended can we rely on it continuing?

>>> u'a b'.split()
[u'a', u'b']
>>> u'a\xa0b'.split()
[u'a', u'b']
>>> re.compile(r'\s').search(u'a b')
<_sre.SRE_Match object at 0x00DBB2C0>
>>> re.compile(r'\s').search(u'a\xa0b')
>>>


--
Robin Becker

--
http://mail.python.org/mailman/listinfo/python-list

Reply via email to