[ 
https://issues.apache.org/jira/browse/TIKA-1946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15768100#comment-15768100
 ] 

Pascal Essiembre commented on TIKA-1946:
----------------------------------------

I also checked. Looks like a version issue.  Files that pass are .wpd and when 
I debug, I see the "major version" number obtained by parsing the file is 2.  
Those that are .wp files have a major version of 0 which is wrong.   .wpd files 
are more recent.  .wp and wp[7654] are for older WordPerfect formats.  Shall we 
throw a TikaException when an unsupported format is encountered?  So far those 
that are invalid (older ones) do appear to open up fine in Word as you mention. 
 Not sure if the format compatible with the MSOffice parser.  May be worth a 
try?

> Add mime detection and parser for WordPerfect
> ---------------------------------------------
>
>                 Key: TIKA-1946
>                 URL: https://issues.apache.org/jira/browse/TIKA-1946
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime, parser
>            Reporter: Nick C
>             Fix For: 2.0, 1.15
>
>
> I noticed some code on github for parsing WordPerfect files 
> (https://github.com/Norconex/importer) Also looks like the author 
> [~pascal.essiembre] has contributed to Tika before



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to