On Mon, 5 Feb 2024 10:38:26 GMT, Magnus Ihse Bursie <i...@openjdk.org> wrote:
>> We're currently setting LC_ALL=C. Not all tools will default to utf-8 as >> their encoding of choice when they see this locale, but use an arbitrarily >> encoding, which might not properly handle all UTF-8 characters. Since in >> practice, all our encoding is utf8, we should tell our tools this as well. >> >> This will at least have effect on how Java treats path names including >> unicode characters. > > Magnus Ihse Bursie has updated the pull request incrementally with two > additional commits since the last revision: > > - Update copyright year > - check for utf-8 first Explicitly adding java/nio/charset/StandardCharsets and slightly growing the default set seems quite reasonable. IME using this explicitly seems very common in real applications. ------------- PR Comment: https://git.openjdk.org/jdk/pull/16971#issuecomment-1926913952