How can I avoid Latin1 and use UTF-8?

Hans Aberg Mon, 05 Sep 2005 04:03:10 -0700

Was: How can I avoid unicode and use Latin1?

On 5 Sep 2005, at 08:58, [EMAIL PROTECTED] wrote:

Thank you. I didn't know unicode was broader than UTF-8.

Formally, one assigns to abstract characters different non-negativeintegers, called in Unicode lingo "code points". In order to get thisstuff into a computer, one needs an integer to binary translationfunction. This is what UTF-8 does. Different translation functionsprovide different encodings of the same code points.

The 3-byte value
10FFFF (rather than FFFFFF) seems like a rather strange upper limit,

When UTF-16 was designed, one did not think clearly about the aboveseparations, so therefore one thought this upper limit was necessary.The limitation is though imposed by Unicode Inc.; the original ISOUTF-8 does not do that (so there are two differing versions of UTF-8in play). Also, the number of available code points is for thefundamental Unicode Inc. character range so well enough that it willnot fill up in hundreds of years at the current rate of characteraddition. Only if people are allowed to massively register privatecharacters, might it break.

but
that only points up the fact that I'm going to have to learn aboutunicode
once I get through my current arranging binge.


You can read about UTF-8 at
  http://www.cl.cam.ac.uk/~mgk25/unicode.html

Today, Windows uses Unicode exclusively -- even in North America.You
won't have big success with latin1 files.
I routinely switch files between Latin1 text and MS-Word docs with no
problem whatsoever. ... Microsoft's unicode claims are a marketingploy; Latin1 still
rules.

Editors often have a preference where the default encoding can bechosen. And the output encoding can also be chosen automatically. Forexample, the mailer I use, scans through the email, and chooses theencoding suitable, ASCII, ISO-Latin-1, or UTF-8, for example.


  Hans Aberg




_______________________________________________
lilypond-user mailing list
lilypond-user@gnu.org
http://lists.gnu.org/mailman/listinfo/lilypond-user

How can I avoid Latin1 and use UTF-8?

Reply via email to