Marco Cimarosti scripsit:

> But, if there is no precomposed character for "q with tilde", then the
> combining tilde *must* be maintained in all normalization forms. 

Correct.

> Why? Isn't that what W3C asked?

No.  The W3C CharMod wants receivers to check normalization and
reject unnormalized documents, *not* to normalize input.  Silently
normalizing input can conceal the existence of a security-related
spoof that is NFC-equivalent to a genuine document.
It is essentially the same reason that broken HTML or broken UTF-8
should not be silently repaired.

> BTW, are you sure that it is NFKC? My understanding is that it was NFC +
> some extra passages. 

It is NFC, with the additional proviso that n11n must be done even
if characters appear as character references (&#xnnnn;) rather than
actual characters.

-- 
John Cowan           http://www.ccil.org/~cowan              [EMAIL PROTECTED]
To say that Bilbo's breath was taken away is no description at all.  There
are no words left to express his staggerment, since Men changed the language
that they learned of elves in the days when all the world was wonderful.
        --_The Hobbit_

Reply via email to