On May 29, 2007, at 3:13 AM, Henri Sivonen wrote:

To avoid stepping on the toes of Charmod more than is necessary, I suggest making it non-conforming for a document to have bytes in the 0x80…0x9F range when the character encoding is declared to be one of the ISO-8859 family encodings.

(UA conformance requires in some cases these bytes to be decoded in a Charmod-violating way, but reality trumps Charmod for UA conformance. While I'm at it: Surely there are other ISO-8859 family encodings besides ISO-8859-1 that require decoding using the corresponding windows-* family decoder?)

I don't know of any ISO-8859 encodings requiring this, but for all unicode encodings and numeric entity references compatibility requires interpreting this range of code points in the WinLatin1 way.

Regards,
Maciej

Reply via email to