Re: [WSG] HTML Numeric and Named Entities

Lachlan Hunt Wed, 11 Jan 2006 05:21:16 -0800

liorean wrote:

On 11/01/06, Lachlan Hunt <[EMAIL PROTECTED]> wrote:

As far as character references in HTML are concerned, they have always
referred to the Unicode code points since HTML 2.0.


Ah. I just saw

         BASESET  "ISO 646:1983//CHARSET
                   International Reference Version
                   (IRV)//ESC 2/5 4/0"
         BASESET  "ISO Registration Number 100//CHARSET
                   ECMA-94 Right Part of
                   Latin Alphabet Nr. 1//ESC 2/13 4/1"

in HTML3.2 and

          BASESET  "ISO Registration Number 177//CHARSET
                    ISO/IEC 10646-1:1993 UCS-4 with
                    implementation level 3//ESC 2/5 2/15 4/6"

Oh, you're absolutely right. My mistake, ISO-646 is US-ASCII, I forgotthat it formally changed to ISO-10646 in HTML 3.2. However, ISO-10646is mentioned in the prose of RFC 1866 several times and implementationsare advised that numeric character references (beyond latin1) shouldreference those code points. However, HTML 2 does formally use Latin 1(ISO-8859-1) for char refs, but these code points are a subset ofISO-10646 anyway.


--
Lachlan Hunt
http://lachy.id.au/

******************************************************
The discussion list for  http://webstandardsgroup.org/

See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list & getting help
******************************************************

Re: [WSG] HTML Numeric and Named Entities

Reply via email to