On Tue, 5 Dec 2023 10:35:05 GMT, Magnus Ihse Bursie <i...@openjdk.org> wrote:
> We're currently setting LC_ALL=C. Not all tools will default to utf-8 as > their encoding of choice when they see this locale, but use an arbitrarily > encoding, which might not properly handle all UTF-8 characters. Since in > practice, all our encoding is utf8, we should tell our tools this as well. > > This will at least have effect on how Java treats path names including > unicode characters. I'll make such a run. But I discovered now that C.UTF-8 is not available on (some?) macs, so I guess I'll need to add a configure check to determine if we should use C or C.UTF-8. The latter is the norm on Linux nowadays (or, to be precise, locales should in general be specified as `<locale>.UTF-8`). ------------- PR Comment: https://git.openjdk.org/jdk/pull/16971#issuecomment-1842772271