[ 
https://issues.apache.org/jira/browse/TIKA-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15770558#comment-15770558
 ] 

Tim Allison commented on TIKA-1946:
-----------------------------------

As mentioned, I went with more specific identification of file version via mime 
types...the overall mime is still {{application/vnd.wordperfect}}.  This leads 
only the right version of files to be directed to the new parsers.  They will 
throw exceptions if they are somehow given the wrong version.  

This behavior seems to be inline for how we're handling file types for which we 
don't have parsers, which, in effect, is what we have here.

I'd like to leave {{UnsupportedFormatException}} for those cases where we can't 
determine the mime before sending it to the general parser that should 
generally handle it.

I'm happy to change this behavior if desired.  Thank you, all!

> Add mime detection and parser for WordPerfect
> ---------------------------------------------
>
>                 Key: TIKA-1946
>                 URL: https://issues.apache.org/jira/browse/TIKA-1946
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime, parser
>            Reporter: Nick C
>             Fix For: 2.0, 1.15
>
>         Attachments: TIKA-1946-pascal.essiembre-01.patch, 
> wordperfect_mimes_fuller.zip, wordperfect_signatures_by_versions.xlsx
>
>
> I noticed some code on github for parsing WordPerfect files 
> (https://github.com/Norconex/importer) Also looks like the author 
> [~pascal.essiembre] has contributed to Tika before



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to