Attribute on html tag not represented in XHTML
-----------------------------------------------
Key: TIKA-379
URL: https://issues.apache.org/jira/browse/TIKA-379
Project: Tika
Issue Type: Bug
Components: parser
Reporter: Julien Nioche
The following HTML document :
<html lang="fi"><head>document 1 title</head><body>jotain suomeksi</body></html>
is rendered as the following xhtml by Tika :
<?xml version="1.0" encoding="UTF-8"?><html
xmlns="http://www.w3.org/1999/xhtml"><head><title/></head><body>document 1
titlejotain suomeksi</body></html>
with the lang attribute getting lost.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.