[ 
https://issues.apache.org/jira/browse/TIKA-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14747292#comment-14747292
 ] 

Tim Allison commented on TIKA-1731:
-----------------------------------

Please don't stop watching.  We can use your help!  Many thanks for your 
contributions so far.

Once we do the integration, it would be helpful to have a document that tests 
various components (headers, footers, footnotes, tables, text boxes, embedded 
documents, table of contents...to name a few) for each of the document formats.

Or, at the least, if you could run the integration (once it is completed) 
against a batch of docs and let us know what you find, that would be helpful.

Thank you, again!

> Try to integrate java-hwp into Tika
> -----------------------------------
>
>                 Key: TIKA-1731
>                 URL: https://issues.apache.org/jira/browse/TIKA-1731
>             Project: Tika
>          Issue Type: New Feature
>            Reporter: Tim Allison
>            Priority: Minor
>
> Now that we have detection working for hwp files, it would be great to add a 
> parser.
> [java-hwp|https://github.com/ddoleye/java-hwp] looks like a promising 
> candidate.  We'd need to ask ddoleye about a potential change in license and 
> then interest in maintenance + pushing to maven.
> Any other candidates?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to