Kent Karlsson <kentk at cs dot chalmers dot se> wrote: >> Believe it or not, the IJ and ij digraphs *were* included for >> compatibility with an 8-bit legacy character set (ISO 6937). > > 6937 is a multibyte encoding (one or two bytes per character). > There are no combining characters at all in 6937, even though > there is a common misunderstanding that there are, since the > lead bytes are (almost) systematically assigned.
It's still an 8-bit character set. Characters are defined in terms of 8-bit code units; some use one, others use two. This is just like the double-byte character sets used for CJK. -Doug Ewell Fullerton, California http://users.adelphia.net/~dewell/