Markus Scherer wrote:

> some of the old ones seem to be pre-unicode 1.1. should they not be updated?

No, they are 2.0.
 
> > 1) Unicode code units are 16 bits long; deal with it.

C1 says "A process shall interpret Unicode code values as 16-bit quantities."
"Code unit" is defined in definition D5 as a synonym for "code value".
If this needs updating, it's the Unicode folks who need to update it, not me;
I think it's still all right.

> > 4) Loose surrogates don't mean jack.
> 
> this needs some explanation - they are illegal sequences, but should be passed 
>through for interoperability (i think that is what the book says).

I think that behavior is "MAY" rather than "SHOULD"; the actual verb used is
"does not preclude".  Anyway, this does not mean that loose surrogates
*mean* anything, only that error recovery of some sort is not forbidden.

-- 

Schlingt dreifach einen Kreis um dies! || John Cowan <[EMAIL PROTECTED]>
Schliesst euer Aug vor heiliger Schau,  || http://www.reutershealth.com
Denn er genoss vom Honig-Tau,           || http://www.ccil.org/~cowan
Und trank die Milch vom Paradies.            -- Coleridge (tr. Politzer)

Reply via email to