On 01/05/2022 13:10, Jonathan Kew wrote:
Hi Zdeněk,
Checking the Unicode character database[1], U+0587 is listed as having a
*compatibility* decomposition to <0565,0582> (not 0587):
0587;ARMENIAN SMALL LIGATURE ECH YIWN;Ll;0;L;<compat> 0565 0582;;;;N;;;;;
Likewise, the SpecialCasing.txt file[2] that defines case mappings other
than simple 1:1 substitutions shows the same decomposition for the
uppercase form:
0587; 0587; 0535 0582; 0535 0552; # ARMENIAN SMALL LIGATURE ECH YIWN
So if I understand correctly, what \text_uppercase:n is doing is simply
implementing what the Unicode standard defines.
If this isn't the appropriate behavior, at least for some locales, I
believe that will need custom programming at some level, but I don't
know enough about it to get into any details.
Indeed: we will add support for alternative casing for Arminian to
\text_uppercase:nn shortly.
Joseph