-----BEGIN PGP SIGNED MESSAGE----- [EMAIL PROTECTED] wrote: [snip] > If "sorting" the diacritical marks in NFD results in rearranging the two > diacritical marks -- in this case, U+0041 U+0301 U+0302 -- then in terms of > Vietnamese orthography, the NFD form may not really be a legitimate way of > representing the Vietnamese letter. > > For example, U+1EAC LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND DOT BELOW is, > in Vietnamese, a circumflexed A to which a tone mark (dot below) has been > added. It is not a dotted-below A to which a circumflex has been added.
They are the same thing: an A with a circumflex above and a tone mark below. The abstract value that a combining sequence represents is an unordered set of sequences of marks, each sequence containing the marks from a given combining class. So I don't see the problem - the ordering of marks from different combining classes is just an encoding artefact, with no semantic significance, and that is what NFD/NFC implement when considered as an equivalence relation. - -- David Hopwood <[EMAIL PROTECTED]> Home page & PGP public key: http://www.users.zetnet.co.uk/hopwood/ RSA 2048-bit; fingerprint 71 8E A6 23 0E D3 4C E5 0F 69 8C D4 FA 66 15 01 Nothing in this message is intended to be legally binding. If I revoke a public key but refuse to specify why, it is because the private key has been seized under the Regulation of Investigatory Powers Act; see www.fipr.org/rip -----BEGIN PGP SIGNATURE----- Version: 2.6.3i Charset: noconv iQEVAwUBPFZGzzkCAxeYt5gVAQHTEAf+NM3T6UFF3040DDcIiPq8Lki8mH/50hHH nN2WeoWUGRgUHhiVI/fOG2jxqdkVIabWiqcRvhs/ZUzLeSl3DraDe9fHqS/Bw7Pq StOAcNEMl2Pm8l0UdI0NFU9jH1TDeEXaBKOiDm6ndcDnenJcZPLye3DUU3zIs6i9 abc/77niF/MuG6SYYei6k01owH87yWJAlOIXtBYH+GuRgfxxLaTiljsE6ZYXeJoy ZVUyK8HCks/dXL73/MymOZE9NSyUG4mp0RyS21twutXpajeO/v6nACusXd7E+WQj TPdz2TKhTA9yVj1InCGXn+yBa/bFtfsJHLBzUNvUledW36YE69yvmg== =ppFL -----END PGP SIGNATURE-----