https://bugs.documentfoundation.org/show_bug.cgi?id=119944

--- Comment #2 from Jens Troeger <jens.troe...@light-speed.de> ---
I’ve updated the HTML file: the new one is generated from the w3 reference
webpage and should include all HTML entities in their text/hex/dec encodings.

The Python script I used to generate that file is commented into that same
file; notice, however, that Python’s html5 entity lookup is also incomplete
resulting in a "???" string rather than the proper text.

Poked around a bit here:

   
https://github.com/LibreOffice/core/blob/master/svtools/source/svhtml/parhtml.cxx#L394-L622

but it seems that the entity-aware string object messes things up.  The entity
parser itself looks ok to me.

-- 
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
Libreoffice-bugs@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs

Reply via email to