Re: Unicode 11 Georgian uppercase vs. fonts

Asmus Freytag (c) via Unicode Fri, 27 Jul 2018 21:52:27 -0700

If that's the case then we shouldn't have this discussion.

Which got started by the ICU folks observing that if they implementedthe changes in property for library functions it would lead tounintelligible text (at least in the short run -- because there was noplan to get font support ready in time and deployed) and text that wasformatted in perhaps unintended ways (most likely permanently -- becausethere was no plan to solve the issues around usage differences).


What you describe as "plan" seems to have been just empty words.

A real plan would have consisted of documentation suggesting how to rollout library update, whether to change/augment CSS styling keywords, whattypes of locale adaptations of case transforms should be implemented,how to get OSs to deliver fonts to people, etc., etc..

If such a plan had existed, and been implemented, then we would not havean e-mail thread started with "OMG we have a crisis".


A./

On 7/27/2018 6:45 PM, Peter Constable wrote:

Just an observation on these issues: When the Mtavruli proposal wasfirst presented to UTC, several UTC members voiced strong reservationbecause of the kind of issues mentioned for case mapping, and inparticular on database indexing and querying. Several months later,various UTC members participated in a teleconference withrepresentation from Georgian institutions, including IT people fromBank of Georgia and TBC Bank. During that meeting, the representativesof the Georgian enterprises (i) demonstrated an understanding of thoseissues and the implications, (ii) gave an indication of support fromthose enterprises and a commitment to update their applications as maybe required, and (iii) gave indication of intent to develop a plan ofaction for preparing their institutions for this change as well ascommunicating that within Georgian industry and society. It was onlyafter that did UTC feel it was viable to proceed with encodingMtavruli characters.
Peter
*From:*Unicode <[email protected]> *On Behalf Of *AsmusFreytag via Unicode
*Sent:* Friday, July 27, 2018 7:01 AM
*To:* [email protected]
*Subject:* Re: Unicode 11 Georgian uppercase vs. fonts

On 7/27/2018 3:42 AM, Michael Everson via Unicode wrote:

    Yes and it explains clearly that “effectively caseless Georgian” is 
incorrect. Georgian has case. Georgian uses case differently from other 
scripts. This is an orthographic distinction, not a structural one. In fact as 
it is also stated in the proposal, there are 19th-century texts which do 
titlecase. It’s just that that orthography is no longer in use and that 
behaviour no longer desirable.

"Georgian uses case differently from other scripts"
That's one of the key issues here for developers (and users) oflibraries. Because it means that any implicit assumptions about theapplicability of a certain case-transform is now broken.
This goes beyond whether fonts are actually installed now or at theend of some transition period, or ever: if functions like ToUpper,which used to have no effect on Georgian before, suddenly do - in waysthat the users of the script do not expect, then your application isbroken, from one day to the next.
The current situation prior to the change is perhaps bestcharacterized by saying that there was support for some localedifferences in the way certain characters were mapped, but not inwhether or not to do a given mapping at all.
If, as has been suggested, the use of case in Georgian is more similarto that of smallcaps in other scripts, then, instead of ToUpper doinga case transformation for Georgian, what would be need is somethinglike a "ToSmallCaps" function (better name here, because the Georgianletters aren't actually "small caps").
That way, the existing "ToUpper" could retain its implicit semantic of"uppercase transformation in those scripts where such transformationsare used in a common way".
This would solve 1/2 of the problem, which is to prevent uppercasingwhere users of Georgian do not expect it. However, it does not work inplain text for the other scripts, because there, small caps are notencoded, so there's no plain-text solution.
To get back to Markus' original question on how to handle this forICU: it seems more and more that Georgian should be exempted fromstandard library functions and that a new function needs to be addedthat just transforms Georgian and leaves all other scripts alone (orone that takes a language/local parameter).
A./

Re: Unicode 11 Georgian uppercase vs. fonts

Reply via email to