Re: Problem with national characters in XHTML

Lennart Borgman Thu, 29 Sep 2005 07:11:48 -0700

Piet van Oostrum wrote:

Mathias Dahl <[EMAIL PROTECTED]> (MD) wrote:

MD> I might be wrong here, but doesn't UTF-8 encode all characters in
MD> Latin-1 (ISO 8859-1) exactly as they are *in* Latin-1 encoding?


No. Iso 8859-1 uses 1 byte for all characters, while UTF-8 uses two bytes
for those characters that are in iso-8859-1. What you probably mean is that
the Unicode value (code point) for each iso-8859-1 character is the same as
its encoding in iso-8859-1.

This is not easy. What you say make it even more interesting why C-q 3 44 RET is stored as 2276 (or what it was) in the XHTML files. How canthat be? (For the context see my earlier mails.)



_______________________________________________
Emacs-devel mailing list
Emacs-devel@gnu.org
http://lists.gnu.org/mailman/listinfo/emacs-devel

Re: Problem with national characters in XHTML

Reply via email to