On Sun Jul 05 17:56:45 2015, raiph wrote:
> What I did:
> 
> > say 'ffl'.uc; # say the uppercased version of an ffl ligature
> 
> What I got with camelia (rakudo-moar 01edd3):
> 
> ffl
> 
> "What I expected":
> 
> FFL
> 
> ----
> 
> "What I expected" is based on
> http://unicode.org/Public/UNIDATA/SpecialCasing.txt which defines a
> bunch of special casing rules:
> 
> "The data in this file, combined with the simple case mappings in
> UnicodeData.txt, defines the full case mappings Lowercase_Mapping
> (lc), Titlecase_Mapping (tc), and Uppercase_Mapping (uc)."
> 
> The entry for ffl approximates to:
> 
> <code>;  <lower>;  <title>;         <upper>;         # <comment>
> FB04;    FB04;     0046 0066 006C;  0046 0046 004C;  # LATIN SMALL
> LIGATURE FFL
> 
> (Note difference between title case and upper case.)
> 
> ----
> 
> A quick search of MoarVM's source code for SpecialCasing reveals this
> comment:
> 
> # XXX SpecialCasing.txt # haven't decided how to do it
> 
> (in the ucd2c.pl tool)
> 
> I'm surmising that Rakudo (MoarVM) does none of this special casing
> yet.
> 
> ----
> 
We handle SpecialCasing in MoarVM now. I've added and unfudged various 
spectests covering that. The Greek final sigma is also properly handled, the 
various cases well tested.

The Turkish i is not something a generic Unicode implementation should do; it's 
marked with a regional condition in SpecialCasing.txt. Handling of those will 
be left to module space for the time being.

Reply via email to