On 12/12/23 1:39 PM, Jeff Davis wrote: > On Sun, 2023-12-10 at 10:39 +1300, Thomas Munro wrote: >> Unless you also >> implement built-in case mapping, you'd still have to call libc or ICU >> for that, right? > > We can do built-in case mapping, see: > > https://postgr.es/m/ff4c2f2f9c8fc7ca27c1c24ae37ecaeaeaff6b53.ca...@j-davis.com > >> It seems a bit strange to use different systems for >> classification and mapping. If you do implement mapping too, you >> have >> to decide if you believe it is language-dependent or not, I think? > > A complete solution would need to do the language-dependent case > mapping. But that seems to only be 3 locales ("az", "lt", and "tr"), > and only a handful of mapping changes, so we can handle that with the > builtin provider as well.
This thread has me second-guessing the reply I just sent on the other thread. Is someone able to test out upper & lower functions on U+A7BA ... U+A7BF across a few libs/versions? Theoretically the upper/lower behavior should change in ICU between Ubuntu 18.04 LTS and Ubuntu 20.04 LTS (specifically in ICU 64 / Unicode 12). And I have no idea if or when glibc might have picked up the new unicode characters. -Jeremy -- http://about.me/jeremy_schneider