Kent Karlsson <kentk at cs dot chalmers dot se> wrote:

>> Believe it or not, the IJ and ij digraphs *were* included for
>> compatibility with an 8-bit legacy character set (ISO 6937).
>
> 6937 is a multibyte encoding (one or two bytes per character).
> There are no combining characters at all in 6937, even though
> there is a common misunderstanding that there are, since the
> lead bytes are (almost) systematically assigned.

It's still an 8-bit character set.  Characters are defined in terms of
8-bit code units; some use one, others use two.  This is just like the
double-byte character sets used for CJK.

-Doug Ewell
 Fullerton, California
 http://users.adelphia.net/~dewell/


Reply via email to