On Sun Jul 05 17:56:45 2015, raiph wrote: > What I did: > > > say 'ffl'.uc; # say the uppercased version of an ffl ligature > > What I got with camelia (rakudo-moar 01edd3): > > ffl > > "What I expected": > > FFL > > ---- > > "What I expected" is based on > http://unicode.org/Public/UNIDATA/SpecialCasing.txt which defines a > bunch of special casing rules: > > "The data in this file, combined with the simple case mappings in > UnicodeData.txt, defines the full case mappings Lowercase_Mapping > (lc), Titlecase_Mapping (tc), and Uppercase_Mapping (uc)." > > The entry for ffl approximates to: > > <code>; <lower>; <title>; <upper>; # <comment> > FB04; FB04; 0046 0066 006C; 0046 0046 004C; # LATIN SMALL > LIGATURE FFL > > (Note difference between title case and upper case.) > > ---- > > A quick search of MoarVM's source code for SpecialCasing reveals this > comment: > > # XXX SpecialCasing.txt # haven't decided how to do it > > (in the ucd2c.pl tool) > > I'm surmising that Rakudo (MoarVM) does none of this special casing > yet. > > ---- > We handle SpecialCasing in MoarVM now. I've added and unfudged various spectests covering that. The Greek final sigma is also properly handled, the various cases well tested.
The Turkish i is not something a generic Unicode implementation should do; it's marked with a regional condition in SpecialCasing.txt. Handling of those will be left to module space for the time being.