[
https://issues.apache.org/jira/browse/TIKA-772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joseph Vychtrle updated TIKA-772:
-
Attachment: html.zip
> media type detection fails for html documents, results in text/plain ins
[
https://issues.apache.org/jira/browse/TIKA-772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joseph Vychtrle updated TIKA-772:
-
Attachment: tika.png
I don't know then. Take a look at my results with tika v 0.10
[
https://issues.apache.org/jira/browse/TIKA-772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joseph Vychtrle updated TIKA-772:
-
Attachment: it.html
> media type detection fails for html documents, results in text/plain inst