[
https://issues.apache.org/jira/browse/TIKA-402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12866621#action_12866621
]
Jukka Zitting commented on TIKA-402:
------------------------------------
Nice work, thanks!
I took the liberty of changing the indentation to the four spaces we use
elsewhere in Tika and of adding the Apache license headers based on your
indication to license the code to Apache. I also made some adjustments to the
code, mostly to avoid inner classes and inheritance from XMLParser. the
resulting code was committed in revision 943569.
I'm leaving this issue open for further improvements like the Numbers support
you mentioned.
> Support for Keynote and Pages documents
> ---------------------------------------
>
> Key: TIKA-402
> URL: https://issues.apache.org/jira/browse/TIKA-402
> Project: Tika
> Issue Type: New Feature
> Components: parser
> Reporter: Jukka Zitting
> Attachments: iwork.patch, iwork.patch, iwork.patch, testKeynote.key,
> testPages.pages
>
>
> It would be nice to have support for documents created by Apple's Keynote and
> Pages applications. Both file formats are described in
> http://developer.apple.com/mac/library/documentation/AppleApplications/Conceptual/iWork2-0_XML/Chapter01/Introduction.html.
> I'm not sure if there already are open source parser libraries for these
> formats or if we'd need to directly process the XML content.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.