[ https://issues.apache.org/jira/browse/TIKA-149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12619460#action_12619460 ]
Jukka Zitting commented on TIKA-149: ------------------------------------ Looks good, though I don't think we need to copy the zip entries to temporary files before parsing. Also, instead of using ParseUtils.getParser, how about using an instance variable and a setter method for the delegate parser? > Parser for zip files > -------------------- > > Key: TIKA-149 > URL: https://issues.apache.org/jira/browse/TIKA-149 > Project: Tika > Issue Type: New Feature > Components: parser > Reporter: Jukka Zitting > Attachments: TIKA-149.patch > > > Tika should be able to parse zip files. The resulting XHTML document should > be something like this: > <xhtml> > <head>...</head> > <body> > <div class="file"> > <h1>path/to/file/inside/the/zip</h1> > ... (parsed contents of the file) > </div> > ... > </body> > </xhtml> -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.