[ 
https://issues.apache.org/jira/browse/TIKA-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14738638#comment-14738638
 ] 

Tim Allison commented on TIKA-1731:
-----------------------------------

Thank you for looking into this.  

bq. can Tika+POI as they are handle it

I should have been more specific -- this question was about ooxml.  If I 
understand correctly, 5.0 use OLE objects, not ooxml.

If you generate an ooxml file via HWP and then run it through Tika, do you get 
what you expect...or are there some non-standard (or not-yet-handled-by-Tika) 
features that we need to fix.

> Try to integrate java-hwp into Tika
> -----------------------------------
>
>                 Key: TIKA-1731
>                 URL: https://issues.apache.org/jira/browse/TIKA-1731
>             Project: Tika
>          Issue Type: New Feature
>            Reporter: Tim Allison
>            Priority: Minor
>
> Now that we have detection working for hwp files, it would be great to add a 
> parser.
> [java-hwp|https://github.com/ddoleye/java-hwp] looks like a promising 
> candidate.  We'd need to ask ddoleye about a potential change in license and 
> then interest in maintenance + pushing to maven.
> Any other candidates?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to