[
https://issues.apache.org/jira/browse/TIKA-741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15137097#comment-15137097
]
Tim Allison commented on TIKA-741:
----------------------------------
How are you replicating this with 1.11? I'm not able to replicate this with
the tika-app gui or commandline. I'm also not able to replicate this as a unit
test in trunk.
In the stack trace, it looks like there are modified Tika classes:
EnhancedPDFParser and EnhancedPDF2XHTML. If these modified classes are
forgetting to close an entity, you'll get this exception...as I found when
working with Acroforms :/
In short, if you can help me replicate this with pure Tika, I'll be happy to
take a look.
> "Zip bomb" (XML nesting) detection is too strict
> ------------------------------------------------
>
> Key: TIKA-741
> URL: https://issues.apache.org/jira/browse/TIKA-741
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 0.10
> Reporter: Erik Hetzner
> Assignee: Jukka Zitting
> Priority: Minor
> Fix For: 1.0
>
>
> I get "zip bomb" errors from many HTML documents, e.g.
> http://www.akhbaar.org/wesima_articles/index-20100101-82736.html
> Is there a way that the element nesting level could be made configurable? 30
> elements just doesn't seem to be enough.
> Thanks!
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)