Calls to Charset.isSupported() will throw exceptions for invalid charset names
------------------------------------------------------------------------------
Key: TIKA-359
URL: https://issues.apache.org/jira/browse/TIKA-359
Project: Tika
Issue Type: Bug
Affects Versions: 0.5
Reporter: Ken Krugler
Assignee: Ken Krugler
Fix For: 0.6
The HtmlParser and TXTParser code currently call Charset.isSupported() to
determine if charset hint info (from meta tags or incoming metadata).
But this method throws IllegalCharsetNameException for unknown (versus
unsupported) encoding names, which kills the parsing process.
What's needed is a wrapper that catches this exception and returns false.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.