HtmlParser - use encoding hint only if charset is supported
-----------------------------------------------------------
Key: TIKA-319
URL: https://issues.apache.org/jira/browse/TIKA-319
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 0.4
Reporter: Piotr B.
Encoding hint should be considered only if that encoding is supported.
Diff of my fix:
--- HtmlParser.java (wersja 835302)
+++ HtmlParser.java (kopia robocza)
@@ -46,7 +46,7 @@
// Prepare the input source using the encoding hint if available
InputSource source = new InputSource(stream);
String encoding = metadata.get(Metadata.CONTENT_ENCODING);
- if (encoding != null) {
+ if (encoding != null && Charset.isSupported(encoding)) {
source.setEncoding(encoding);
}
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.