On Thu, 2015-01-29 at 08:02 +0000, tomm...@apache.org wrote: > + String modelString = IOUtils.toString(nGramModelStream); > + String outputString = > out.toString(Charset.defaultCharset().name());
The XML serialization writes it in UTF-8. Shouldn't you use UTF-8 for this test too instead of the default encoding? Jörn