Re: memory requirements when merging PDF files?

2022-01-08 Thread Gilad Denneboom
If you're using the PDFMergerUtility class you can specify a MemoryUsageSetting parameter when calling mergedDocuments. Use its setupTempFileOnly method to create a temporary file instead of doing it in-memory. On Thu, Jan 6, 2022 at 6:27 PM John Lussmyer wrote: > I have a need to merge a couple

Re: memory requirements when merging PDF files?

2022-01-07 Thread John Lussmyer
On Fri Jan 07 08:55:38 PST 2022 ke...@trumpetinc.com said: >If you use the temporary file memory storage, it should be possible to work >with very large files. Thanks, I was hoping there was some way to deal with this case. I just ran a quick test, generating a 2000 page PDF by placing a 1 page P

Re: memory requirements when merging PDF files?

2022-01-07 Thread Tilman Hausherr
Yes that's a good idea. However it will be much slower, and the objects will still be in memory, only the stream contents (e.g. images, fonts, content streams) will be on disk. Tilman Am 07.01.2022 um 17:55 schrieb Kevin Day: If you use the temporary file memory storage, it should be possible

Re: memory requirements when merging PDF files?

2022-01-07 Thread Kevin Day
If you use the temporary file memory storage, it should be possible to work with very large files. https://stackoverflow.com/questions/11301818/pdfbox-working-with-very-large-pdfs/38859566 This isn't streaming (pdf is not really amenable to streaming like you are asking), but the disk based scrat

Re: memory requirements when merging PDF files?

2022-01-06 Thread Tilman Hausherr
Am 06.01.2022 um 18:26 schrieb John Lussmyer: I have a need to merge a couple thousand PDF's into one humongous PDF. The old tool we use for PDF manipulation runs out of memory as it builds the result PDF in memory, and only writes it out when done. Can PDFBox do something more like streaming t

memory requirements when merging PDF files?

2022-01-06 Thread John Lussmyer
I have a need to merge a couple thousand PDF's into one humongous PDF. The old tool we use for PDF manipulation runs out of memory as it builds the result PDF in memory, and only writes it out when done. Can PDFBox do something more like streaming the output as it's built? or even not load all