[ 
https://issues.apache.org/jira/browse/TIKA-1110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jukka Zitting resolved TIKA-1110.
---------------------------------

    Resolution: Fixed
      Assignee: Jukka Zitting

Thanks, Vadim! I committed your patch (with spaces instead of tabs) in revision 
1553617.

I also made a few follow-up improvements (and added a test case) in revisions 
1553618 and 1553619 to make the integration with AutoDetectParser work more 
smoothly.

> Incorrectly declared SUPPORTED_TYPES in ChmParser.
> --------------------------------------------------
>
>                 Key: TIKA-1110
>                 URL: https://issues.apache.org/jira/browse/TIKA-1110
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.3, 1.4
>            Reporter: Andrzej Bialecki 
>            Assignee: Jukka Zitting
>             Fix For: 1.5
>
>         Attachments: TIKA-1110.patch
>
>
> [This 
> link|http://www.iana.org/assignments/media-types/application/vnd.ms-htmlhelp] 
> assigns the official mime type for these files to 
> "application/vnd.ms-htmlhelp". In the wild there are also two other types 
> used:
> * application/chm
> * application/x-chm
> tika-mimetypes.xml uses the correct official mime type, but ChmParser 
> declares that it supports only "application/chm". For this reason content 
> that uses the official mime type (e.g. coming via Detector or parsed using 
> AutoDetectParser, or simply declared in metadata) fails to parse due to 
> unknown mime type.
> The fix seems simple - ChmParser should declare also all of the above types 
> in its SUPPORTED_TYPES.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to