Feel free to provide a PR on GitHub where the unit test must fail if main changes are not applied. You can also provide a PR that only contains a unit test.
Gary On Tue, Oct 3, 2023, 4:13 AM Laurence Gonsalves <laure...@gonsalv.es> wrote: > Hello, > > It looks like XmlStreamReader is not correctly handling several encodings > in Commons IO 2.14.0 that previously worked in version 2.13.0. > > Here's a self-contained snippet (Kotlin) that demonstrates the problem: > > val xml = "<?xml version='1.0' encoding='437'?><root>Ç</root>" > > val stream = xml.byteInputStream(Charset.forName("437")) > > val reader = XmlStreamReader.builder() > .setInputStream(stream) > .setLenient(false) > .get() > > reader.readText() shouldBe xml > > With 2.13.0 this code works fine, but in 2.14.0 the "Ç" (C-cedilla) becomes > a "�" (Unicode replacement character). > > We're seeing similar issues with all of the other code page encodings we've > tried (850, 852, 855, 857, 860, 861, 862, 863, 865, and 866). >