Andreas Lehmkühler created PDFBOX-5483:
------------------------------------------

             Summary: Replace methods using an InputStream from Loader.loadPDF
                 Key: PDFBOX-5483
                 URL: https://issues.apache.org/jira/browse/PDFBOX-5483
             Project: PDFBox
          Issue Type: Improvement
          Components: Parsing
    Affects Versions: 3.0.0 PDFBox
            Reporter: Andreas Lehmkühler
            Assignee: Andreas Lehmkühler
             Fix For: 3.0.0 PDFBox


As discussed on dev@pdfbox
{quote}
We have to remove the loadPDF variants using InputStream and replace them with 
RandomAccessRead.

If it comes to InputStreams users have to decide how to procide:
* copy the InputStream to memory by using RandomAccessReadBuffer
* copy the InputStream to a file and use RandomAccessReadBufferedFile or 
RandomAccessReadMemoryMappedFile

This would make it more transparent what happens under the hood when using the 
different kinds of loadPDF methods:

* a byte array as source is already in memory and the obvious choice is to use 
RandomAccessReadBuffer as a wrapper
* a file as source targets a local file and the most obvious choice is to use 
RandomAccessReadBufferedFile as a wrapper. We should document that as the other 
alternative RandomAccessReadMemoryMappedFile is offered in this case
* RandomAccessRead as source is the most obvious one and the user decides how 
to create it. Additionally is ist possible to implement some own caching 
loading and/or mechanism
{quote}

see PDFBOX-5462 and [High memory usage with pdfbox 
3|https://lists.apache.org/thread/6mmgp23v8b2yztj4hghkgkd14s1gzs8g] as well




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to