On Tue, 5 Dec 2023 10:35:05 GMT, Magnus Ihse Bursie <i...@openjdk.org> wrote:

> We're currently setting LC_ALL=C. Not all tools will default to utf-8 as 
> their encoding of choice when they see this locale, but use an arbitrarily 
> encoding, which might not properly handle all UTF-8 characters. Since in 
> practice, all our encoding is utf8, we should tell our tools this as well.
> 
> This will at least have effect on how Java treats path names including 
> unicode characters.

I'll make such a run. But I discovered now that C.UTF-8 is not available on 
(some?) macs, so I guess I'll need to add a configure check to determine if we 
should use C or C.UTF-8. The latter is the norm on Linux nowadays (or, to be 
precise, locales should in general be specified as `<locale>.UTF-8`).

-------------

PR Comment: https://git.openjdk.org/jdk/pull/16971#issuecomment-1842772271

Reply via email to