On 01May2022 23:30, Stefan Ram <[email protected]> wrote: >Dan Stromberg <[email protected]> writes: >>But what about Unicode? Are all 10 bytes newlines in Unicode encodings? > It seems in UTF-8, when a value is above U+007F, it will be > encoded with bytes that always have their high bit set.
Aye. Design festure enabling easy resync-to-char-boundary at an arbitrary point in the file. > But Unicode has NEL "Next Line" U+0085 and other values that > conforming applications should recognize as line terminators. I disagree. Maybe for printing things. But textual data records? I would hope to end them with NL, and only NL (code 10). Cheers, Cameron Simpson <[email protected]> -- https://mail.python.org/mailman/listinfo/python-list
