[ https://issues.apache.org/jira/browse/TIKA-822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13173769#comment-13173769 ]
peter royal commented on TIKA-822: ---------------------------------- the rfc for mime isn't clear on whether single quotes make a valid quoted string. overall, the parser needs a bit more work to be fully rfc-compliant (quoted strings can have equals in them, for instance). I was just trying to fix the simple case I came across. the java mail API generates quoted charset fields for text attachments, which is how I found this. > MediaType fails to parse charset that has quoted value > ------------------------------------------------------ > > Key: TIKA-822 > URL: https://issues.apache.org/jira/browse/TIKA-822 > Project: Tika > Issue Type: Bug > Components: mime > Affects Versions: 1.0 > Reporter: peter royal > > If a mime type is > text/html; charset="UTF-8" > the value is incorrectly "UTF-8" not UTF-8 > patch available at > https://github.com/osi/tika/commit/b77814874ebff8f412ebb2f2adc52c6465d603c4 > i have a CLA on file. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira