tballison edited a comment on pull request #356: URL: https://github.com/apache/tika/pull/356#issuecomment-698525643
If we do go with the stream the full thing first to see if there are data descriptors option, we should enable users to configure the PKGParser to do that because of the hit in parsing time. Yes, ideally, we'd be able to handle this all behind the scenes, but it is sounding complicated and there are tradeoffs we'll need to make. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org