[issue8859] split() splits on non whitespace char when ther is no separator given.

Ezio Melotti Sun, 30 May 2010 12:17:48 -0700

Ezio Melotti <ezio.melo...@gmail.com> added the comment:

Both on Linux and Windows I get:
>>> '\xa0'.isspace()
False
>>> u'\xa0'.isspace()
True


The Unicode char u'\xa0' is U+00A0 NO-BREAK SPACE, so unicode.split correctly 
considers it a whitespace.
However '\xa0' is not a whitespace, so str.split ignores it.
The correct solution is to convert your string to Unicode and then split.
I'd close this as invalid but I'd like you to confirm that the example I posted 
and that 'split' return the same result on both Linux and Windows before doing 
so (the fact that on Linux works it's probably caused by something else -- e.g. 
the label is already Unicode).

----------
nosy: +ezio.melotti
resolution:  -> invalid
status: open -> pending

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue8859>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

[issue8859] split() splits on non whitespace char when ther is no separator given.

Reply via email to