Nuno Lucas wrote:
Line endings don't have a direct relation to character encoding. It's true that in theory you would need to know the character encoding to know what a line ending is (like the mentioned line ending Unicode character), but in practice there are only 3 "standard" line endings (LF, CR-LF and CR) and if some file uses any other you would need to use a special program for it, so it's better to treat the file as binary.An ASCII text can use any of the 3 line-endings. Some with an UTF-8 text, ISO-8859-1, or any other. No way to know the line ending by the character encoding.
Yes, but UTF-8 is a _multi-byte_ encoding.If you see an LF byte, you don't know whether this is a single-byte LF or part of a multi-byte sequence. (I'm not sure if this is a problem with UTF-8 in particular, but it certainly is with 16 or 32-bit encodings, such as UTF-16 and UTF-32.)
-- Ulf
smime.p7s
Description: S/MIME Cryptographic Signature
_______________________________________________ Monotone-devel mailing list Monotone-devel@nongnu.org http://lists.nongnu.org/mailman/listinfo/monotone-devel