[
https://issues.apache.org/jira/browse/TIKA-4424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17956998#comment-17956998
]
Tim Allison commented on TIKA-4424:
-----------------------------------
And, y, [~tilman] , you're right – the change in markLimit broke this. I set
that to -1 thinking only of the TikaInputStream as the use case, but of course,
that's very wrong for any other type of stream. :( Mea culpa.
> Regression in zip-based detection with an InputStream in 3.2.0
> --------------------------------------------------------------
>
> Key: TIKA-4424
> URL: https://issues.apache.org/jira/browse/TIKA-4424
> Project: Tika
> Issue Type: Task
> Components: detector
> Affects Versions: 3.2.0
> Reporter: Tim Allison
> Priority: Major
> Labels: regression
> Fix For: 4.0.0, 3.2.1
>
> Attachments: tika-4424.zip
>
>
> On the user list, Craig Muchinsky and Pontus Amberg noted new problems with
> detection of zip based files.
> Craig noted that this affects InputStream detection, and Pontus noted that
> even if he switched to a TikaInputStream, his kmz file was getting detected
> as a zip.
> This is Pontus' code:
> {noformat}
> Tike.detect(InputStream stream, String name)
> {noformat}
> {noformat}
> pp//org.apache.tika.io.BoundedInputStream.reset(BoundedInputStream.java:115)
> app//org.apache.tika.detect.zip.DefaultZipContainerDetector.detectStreaming(DefaultZipContainerDetector.java:279)
> app//org.apache.tika.detect.zip.DefaultZipContainerDetector.detect(DefaultZipContainerDetector.java:192)
> app//org.apache.tika.detect.CompositeDetector.detect(CompositeDetector.java:84)
> {noformat}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)