On 01May2022 23:30, Stefan Ram <r...@zedat.fu-berlin.de> wrote: >Dan Stromberg <drsali...@gmail.com> writes: >>But what about Unicode? Are all 10 bytes newlines in Unicode encodings? > It seems in UTF-8, when a value is above U+007F, it will be > encoded with bytes that always have their high bit set.
Aye. Design festure enabling easy resync-to-char-boundary at an arbitrary point in the file. > But Unicode has NEL "Next Line" U+0085 and other values that > conforming applications should recognize as line terminators. I disagree. Maybe for printing things. But textual data records? I would hope to end them with NL, and only NL (code 10). Cheers, Cameron Simpson <c...@cskk.id.au> -- https://mail.python.org/mailman/listinfo/python-list