David Starner <prosfil...@gmail.com> wrote:
> Apparently a tweet before that point is a string of 32-bit integers,
> including all those wonderful characters above U+10FFFF.
What is the position regarding the 32-bit code point space above U+10FFFF
please?
Does the Unicode Consortium and/or ISO or indeed anyone else make any claims
upon it?
Reading this thread, it occurs to me that if two localizable text items were
defined in the plane 0 Private Use Area, each with a visible glyph and with the
intention that they could be localized as a string of ordinary text characters,
either in the mind of the reader or by software in receiving browsers, then
that could lead to widespread use and maybe lead to the localizable text items
and maybe some others becoming encoded, at a later date in regular Unicode,
maybe in plane 7 for Unicode 7.0.
LOCALIZABLE TEXT "Good day."
LOCALIZABLE TEXT "Best regards,"
This would have the effect of greeting a person in his or her own native
language, which I feel is a pleasant thing to do.
If there is enthusiasm, with the Internationalization and Unicode 34 Conference
due next week and no official literary event this year, then maybe the Unicode
Consortium, with the assistance of members of this mailing list, could assemble
files of "Good day." and "Best regards," in many languages of the world, using
Unicode.
There could also be a spontaneous art event if people would like to suggest
designs for visible glyphs.
One of the biggest problems for all of this is just which Private Use Area code
points to suggest. Although the Unicode Standard rules for using the Private
Use Area allow great choice, for practical purpose to try to get something
going I feel that they need to be distinct from at least the following groups
of Private Use Area code point assignments.
U+F000 to U+F0FF due to clashing with Microsoft Symbol fonts and legacy
ligature glyphs for fi and fl.
Any Private Use Area code point used by telephone companies for emoji.
Any Private Use Area code point used in the Code2000 font.
As many as possible of the code points used by MUFI, the Medieval Unicode Font
Initiative, particularly those most commonly being used in fonts generally.
Any other Private Use Area code points that people posting to this thread might
feel could cause a clash that could cause practical problems over font usage.
So, choosing Private Use Area code points just to have a go at this is a puzzle
in itself, yet a good result would be useful so that hopefully many fonts would
gain the glyphs.
William Overington
14 October 2010