[ https://issues.apache.org/jira/browse/TIKA-815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13418425#comment-13418425 ]
Chris A. Mattmann commented on TIKA-815: ---------------------------------------- Hi Jerome: what do you think about contributing the Tika hardener? I'm +1 to Jukka's suggestion on that and we'd love to have you helping out and appreciate your contributions so far! > Tika parsers should handle failures more gracefully > --------------------------------------------------- > > Key: TIKA-815 > URL: https://issues.apache.org/jira/browse/TIKA-815 > Project: Tika > Issue Type: Test > Components: parser > Affects Versions: 1.0 > Reporter: Jerome Lacoste > > We encountered an OOM while parsing a Word document. We will report the > failure to POI. > This raises the question about the general robustness of the parsers. > We've written a little test tool that reproduces the aforementionned OOM and > other potential issues that will be reported to the individual parsers. It's > the responsibility of the parsers to handle those failures gracefully. > Yet it's easy to write generic tools at the Tika level to make these kind of > tests. > So we also submit this issue here to start a discussion on what role should > Tika have when it comes to validate its parsers. > Code here: https://github.com/lacostej/tika-hardener -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira