Nuno Lucas wrote:
Line endings don't have a direct relation to character encoding. It's
true that in theory you would need to know the character encoding to
know what a line ending is (like the mentioned line ending Unicode
character), but in practice there are only 3 "standard" line endings
(LF, CR-LF and CR) and if some file uses any other you would need to
use a special program for it, so it's better to treat the file as
binary.

An ASCII text can use any of the 3 line-endings. Some with an UTF-8
text, ISO-8859-1, or any other. No way to know the line ending by the
character encoding.

Yes, but UTF-8 is a _multi-byte_ encoding.
If you see an LF byte, you don't know whether this is a single-byte LF or part of a multi-byte sequence. (I'm not sure if this is a problem with UTF-8 in particular, but it certainly is with 16 or 32-bit encodings, such as UTF-16 and UTF-32.)

-- Ulf

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

_______________________________________________
Monotone-devel mailing list
Monotone-devel@nongnu.org
http://lists.nongnu.org/mailman/listinfo/monotone-devel

Reply via email to