[ https://issues.apache.org/jira/browse/PDFBOX-4836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17199850#comment-17199850 ]
Andreas Lehmkühler commented on PDFBOX-4836: -------------------------------------------- ScratchFile isn't used for the reading of COSStreams any more. Furthermore I've removed the usage of ScratchFile as a cache for PDFs which are provided as InputStream. Now, those are always copied to a memory based buffer before processing. I know the latter is something some users might miss, but IMHO we shouldn't mix up the external caching of the input data with the internal caching of COSStreams. If there is any need for that in the feature we should extend the RandomAccessRead classes to support different caching strategies for streams as parser input. > Reduce the usage of ScatchFileBuffer when parsing a pdf > ------------------------------------------------------- > > Key: PDFBOX-4836 > URL: https://issues.apache.org/jira/browse/PDFBOX-4836 > Project: PDFBox > Issue Type: Improvement > Components: Parsing > Affects Versions: 3.0.0 PDFBox > Reporter: Andreas Lehmkühler > Assignee: Andreas Lehmkühler > Priority: Major > Fix For: 3.0.0 PDFBox > > Attachments: EDGE11896203.pdf, image-2020-05-17-16-40-28-712.png, > raw_image_demo.pdf > > > Instead of using a scatchfile buffer to read a COSStream the parser should > use the source directly to reduce the memory footprint -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org