well it was an approximation (due to the previously mentionned lack of
vocabulary)
ISO 2022 (as well as SHIFT-JIS and other japaneses encoding of the same
type) use indeed "artificial" 8bit characters.
The 0-127 range is always almost compatible with ASCII and there is 2
escaping character which work like double quotes. Inside quotes, character
are multibyte (indeed it's impossible to store so many kanjis into only 128
slots)

But this option raises more issues than it brings solutions... even if it is
still widely used in japan (ISO 2022 is still their default encoding for
e-mailings)

Yota


On 1/25/07, Werner LEMBERG <[EMAIL PROTECTED]> wrote:


> After having defined a 128 character table (0 -127 on 8 bits, well
> one zero + 7bits) covering only the English characters and some
> signs, called the ASCII table; has been defined many extended tables
> using the 128 - 255 range to store some "regional" character. There
> are around 10 extended tables to fit with either the french, the
> danish, the Greek specificities (but not all at the same time) :
> http://en.wikipedia.org/wiki/Category:ISO_8859

Actually, there are much more.  If I'm not mistaken, the ISO 2022
registry has designed IDs to over 100 different 8bit character sets!

> Japaneses and Chineses had some strange way to store their 36 000
> ideograms and there it started to be a mess.

This isn't true in general.  For handling those languages separately,
16bit character sets work just fine.  Emacs demonstrates that, within
the ISO 2022 framework, multilingual work can be done quite
efficiently.

BTW, the value `36000' is not correct: The CNS 11643 character set
defines around 55000 Chinese characters, and the defunct CCCII
character set covered around 70000 characters (I have the books at
home -- it's almost a half meter on my bookshelf).


    Werner

_______________________________________________
lilypond-user mailing list
lilypond-user@gnu.org
http://lists.gnu.org/mailman/listinfo/lilypond-user

Reply via email to