On 01/05/2022 13:10, Jonathan Kew wrote:
Hi Zdeněk,

Checking the Unicode character database[1], U+0587 is listed as having a *compatibility* decomposition to <0565,0582> (not 0587):

0587;ARMENIAN SMALL LIGATURE ECH YIWN;Ll;0;L;<compat> 0565 0582;;;;N;;;;;

Likewise, the SpecialCasing.txt file[2] that defines case mappings other than simple 1:1 substitutions shows the same decomposition for the uppercase form:

0587; 0587; 0535 0582; 0535 0552; # ARMENIAN SMALL LIGATURE ECH YIWN

So if I understand correctly, what \text_uppercase:n is doing is simply implementing what the Unicode standard defines.

If this isn't the appropriate behavior, at least for some locales, I believe that will need custom programming at some level, but I don't know enough about it to get into any details.

Indeed: we will add support for alternative casing for Arminian to \text_uppercase:nn shortly.

Joseph

Reply via email to