Asmus Freytag wrote,

> Nevertheless, I think the use of devices like combining underlines
> and superscript letters in plain text are best avoided.

That's probably true according to the spirit of the underlying encoding principles.  But hasn't that genie already left the bottle?

People write their names as they please.  With the entire repertoire of Unicode from which to choose, people are coming up with some amazingly unorthodox ways to "spell" their screen names.  Here's six screen names copy/pasted from an atypical Twitter account's comments sections:

Joё дмёгicди‏

I⃟MAGI⃟NER‏

Եօʍ‏

IXOYE444 (←This one included character U+200F, I removed it.)

Qęy ✪ eT ✪ Dog ► VOTES‏

 🌊 ❄️ 🌵Kἶოἶղმz 🌵 ❄️  🌊‏ (←Note the decorative emoji.)

People are mixing scripts and so forth in order to create distinctive screen names.  Those screen names are out there in the wild and are part of our stored data which future historians are welcome to scratch their heads over.

IIRC, around the time that the math alphanumerics were added to Plane One, Michael Everson noted that once characters are encoded people will use them as they see fit.  In this present thread, Michael Everson wrote:

> And I would not encode it as Mr͇, firstly because it
> would never render properly and you might as well
> encode it as Mr. or M:r, and second because in the
> IPA at least that character indicates an alveolar
> realization in disordered speech. (Of course it
> could be used for anything.)

Yes, it could be used for anything requiring combining-two-lines-below.  At some point, if enough people were doing it, it would morph from a kludge of hacking alveolar whatevers into an accepted convention.  (Not that I am pushing this approach, it's only one suggestion out of many possibilities.  I'm in favor of direct encoding.)  I would not encode the abbreviation as either "Mr." or "M:r" because neither of those text strings appear in the original manuscript.

FAICT, "Եօʍ‏" is pronounced just like "Tom", but it ain't spelled the same.  Likewise for "McCoy" and "M=ͨCoy".

It strikes me as perverse if "Եօʍ‏" can spell his name as he pleases using the UCS but "M=ͨCoy" mustn't.  Especially since names like "M=ͨCoy" and abbreviations such as "M=ͬ" could be typed on old-style mechanical typewriters.  Quintessential plain-text, that.

Reply via email to