[ 
https://issues.apache.org/jira/browse/PDFBOX-4836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17199850#comment-17199850
 ] 

Andreas Lehmkühler commented on PDFBOX-4836:
--------------------------------------------

ScratchFile isn't used for the reading of COSStreams any more. 

Furthermore I've removed the usage of ScratchFile as a cache for PDFs which are 
provided as InputStream. Now, those are always copied to a memory based buffer 
before processing. I know the latter is something some users might miss, but 
IMHO we shouldn't mix up the external caching of the input data with the 
internal caching of COSStreams. If there is any need for that in the feature we 
should extend the RandomAccessRead classes to support different caching 
strategies for streams as parser input.


> Reduce the usage of ScatchFileBuffer when parsing a pdf
> -------------------------------------------------------
>
>                 Key: PDFBOX-4836
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4836
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing
>    Affects Versions: 3.0.0 PDFBox
>            Reporter: Andreas Lehmkühler
>            Assignee: Andreas Lehmkühler
>            Priority: Major
>             Fix For: 3.0.0 PDFBox
>
>         Attachments: EDGE11896203.pdf, image-2020-05-17-16-40-28-712.png, 
> raw_image_demo.pdf
>
>
> Instead of using a scatchfile buffer to read a COSStream the parser should 
> use the source directly to reduce the memory footprint



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to