Arian Sanusi wrote:
> concerning to unicode, "\n", "\r "and "\r\n" (0x000A, 0x000D and
0x000D+0x000A) should be threatened as newline character
the link says that your application should treat them line terminators,
not that they should all be equal to a new line character.
to split on Unicode
Hi,
concerning to unicode, "\n", "\r "and "\r\n" (0x000A, 0x000D and
0x000D+0x000A) should be threatened as newline character
at least this is how i understand it:
(http://en.wikipedia.org/wiki/Newline#Unicode)
obviously, the re module does not care, and on unix, only threatens \n
as newline c