On 25/09/2018 13:19, Nick Wellnhofer wrote:
libxml2 serializes documents without an encoding declaration differently than
documents with an explicit UTF-8 encoding:
It seems that this was partially changed in 2005 with the following commit:
https://gitlab.gnome.org/GNOME/libxml2/commit/64354ea7d6b8e0d95f3f9bcfdc98bddd065b65fc
But this change only applies to text nodes, not attribute content. It also
only applies when serializing with xmlNodeDumpOutput or xmlNodeDump, not when
using the xmlSave API (which xmllint uses).
The whole situation is a mess. I'd love to change the code so that non-ASCII
chars are always encoded as UTF-8, but I'm scared to break things.
Nick
_______________________________________________
xml mailing list, project page http://xmlsoft.org/
xml@gnome.org
https://mail.gnome.org/mailman/listinfo/xml