[ 
https://issues.apache.org/jira/browse/TIKA-3196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17324384#comment-17324384
 ] 

ASF GitHub Bot commented on TIKA-3196:
--------------------------------------

lfcnassif edited a comment on pull request #364:
URL: https://github.com/apache/tika/pull/364#issuecomment-821914480


   This was merged in 1.26 at least without fixing the thread safety issue 
noticed by Tim.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> PackageParser should attempt to parse entries from zip files with STORED 
> entries with data descriptor
> -----------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-3196
>                 URL: https://issues.apache.org/jira/browse/TIKA-3196
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>            Reporter: Trevor Bentley
>            Priority: Major
>             Fix For: 2.0.0, 1.25
>
>         Attachments: OOO-107047-0.oxt-145.zip
>
>
> We are currently using tika for text extraction. Currently some sites are 
> returning zips that have entries with stored data descriptors which fail to 
> extract due to the ZipArchiveInputStream (in commons-compress) defaulting to 
> false for 'allowStoredEntriesWithDataDescriptor'.
> Since ZipArchiveInputStream has support for reading zips with data 
> descriptors we should attempt to read the zip with that feature enabled when 
> we get a data descriptor UnsupportedZipFeatureException.
> Pull Request: 
> [https://github.com/apache/tika/pull/356|https://github.com/apache/tika/pull/355]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to