[
https://issues.apache.org/jira/browse/TIKA-352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved TIKA-352.
--------------------------------
Resolution: Fixed
Fix Version/s: 0.6
Assignee: Jukka Zitting
Patch applied in revision 891082 along with an added minor fix for the case
when CONTENT_TYPE is *not* set in the input metadata for TXTParser.
> Use MediaType.parse when extracting charset from content-type metadata in
> parsers
> ---------------------------------------------------------------------------------
>
> Key: TIKA-352
> URL: https://issues.apache.org/jira/browse/TIKA-352
> Project: Tika
> Issue Type: Improvement
> Affects Versions: 0.6
> Reporter: Ken Krugler
> Assignee: Jukka Zitting
> Priority: Minor
> Fix For: 0.6
>
> Attachments: TIKA-352.patch
>
>
> Both HtmlParser and TXTParser have their own code to extract the charset from
> a content-type metadata string. They should be using MediaType.parse to do
> this.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.