What should or should not be encoded in Unicode? (from Re: Egyptian Hieroglyph Man with a Laptop)

[email protected] via Unicode Thu, 13 Feb 2020 07:45:13 -0800

Hans Åberg >>> From the point of view of Unicode, it is simpler: If thecharacter is in use or have had use, it should be included somehow.

Shawn Steele >> That bar, to me, seems too low. Many things are onlyused briefly or in a private context that doesn';t really requireencoding.


Hans Åberg > That is a private use area for more special use.

I have used the Private Use Area, quite a lot over many years.

I have a licence for a fontmaking program, FontCreator. A good featureof the Windows operating system is that all installed fonts can be usedin most installed programs. Private Use Area code points are officialUnicode code points. These three factors together allow me to design andproduce TrueType fonts for new symbols each encoded at a Private UseArea code point (a different code point for each such novel symbol),install the fonts, and use them in various programs, including a desktoppublishing program and thereby make PDF (Portable Document Format)documents that include both ordinary text and the novel symbols. ThesePDF documents are then suitable for placing on the web and for LegalDeposit with The British Library.

Yet a Private Use Area encoding at a particular code point is notunique. Thus, except with care amongst people who are aware of theparticular encoding, there is no interoperability, such as with regularUnicode encoded characters.

However faced with a need for interoperability for my research project,I have found a solution making use of the Glyph Substitution capabilityof an OpenType font.

The solution is to invent my own encoding space. This sits on top ofUnicode, could be (perhaps?) called markup, but it works!

I am hoping that at some future time the results of my research willbecome encoded as an International Standard, and that my encoding spacewill then after that become integrated into Unicode, thus achievingfully standardized unique interoperable encoding as part of Unicode.Quite a dream, but the way to achieve such a fully standardized uniqueinteroperable encoding as part of Unicode is from a technological pointof view, quite straightforward. There are details of this in theAccumulated Feedback on Public Review Issue #408.


https://www.unicode.org/review/pri408/

Yet having my encoding space in this manner is just something that Ihave done on my own initiative. Anybody can have his or her own encodingspace if he or she so chooses. With a little care and consideration forothers these encodings need not clash one with another and all couldeven coexist in one document.

Having my own encoding space has enabled me to make progress with myresearch project.


William Overington

Thursday 13 February 2020

What should or should not be encoded in Unicode? (from Re: Egyptian Hieroglyph Man with a Laptop)

Reply via email to