I'm not 100% sure, since this module is a parser and not a serializer, but it appears HTML::HTML5::Parser is just building a DOM, and the serialization is then done by XML::LibXML. Therefore, it seems likely the bug is indeed in HTML::HTML5::Parser.
The upstream bug tracker is at https://rt.cpan.org/Public/Dist/Display.html?Name=HTML-HTML5-Parser but unfortunately, it doesn't see a lot of attention these days. Nevertheless, please submit upstream. ** Changed in: libhtml-html5-parser-perl (Ubuntu) Status: New => Confirmed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1706274 Title: html2xhtml produces invald XML for MS Office HTML output To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/libhtml-html5-parser-perl/+bug/1706274/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs