[ 
https://issues.apache.org/jira/browse/TIKA-741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15137097#comment-15137097
 ] 

Tim Allison commented on TIKA-741:
----------------------------------

How are you replicating this with 1.11?  I'm not able to replicate this with 
the tika-app gui or commandline.  I'm also not able to replicate this as a unit 
test in trunk.

In the stack trace, it looks like there are modified Tika classes: 
EnhancedPDFParser and EnhancedPDF2XHTML.  If these modified classes are 
forgetting to close an entity, you'll get this exception...as I found when 
working with Acroforms :/

In short, if you can help me replicate this with pure Tika, I'll be happy to 
take a look.

> "Zip bomb" (XML nesting) detection is too strict
> ------------------------------------------------
>
>                 Key: TIKA-741
>                 URL: https://issues.apache.org/jira/browse/TIKA-741
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.10
>            Reporter: Erik Hetzner
>            Assignee: Jukka Zitting
>            Priority: Minor
>             Fix For: 1.0
>
>
> I get "zip bomb" errors from many HTML documents, e.g. 
> http://www.akhbaar.org/wesima_articles/index-20100101-82736.html
> Is there a way that the element nesting level could be made configurable? 30 
> elements just doesn't seem to be enough.
> Thanks!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to