[iText-questions] Problem parsing huge PDF files

Massimiliano Ziccardi Thu, 19 Jun 2008 03:51:25 -0700

Hi all.

I have a bug problem. I need to parse very big files (>800 MB) with lots of
pages (> 400.000 pages).


When I open it with Adobe Acrobat Reader, I've no problem: the file is
opened in less than 1 second, and
the reader is able to show me the page count in a while.

If I try to instantiate a PDF reader on the file, I get OutOfMemory
exception: to rise the JVM memory is not
an option, since I can't know the size of my PDFs.

Is there a way I can parse the PDF without loading so much data in memory
(as, I think, Adobe do)?

To solve the problem I'm facing right now, a simple way to count the pages
would be enough.
However, I think, a more generalized solution would be a nice-to-have
feature for a PDF library.

Thanks in advance,
Massimiliano Ziccardi

-------------------------------------------------------------------------
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://sourceforge.net/services/buy/index.php

_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Do you like iText?
Buy the iText book: http://www.1t3xt.com/docs/book.php
Or leave a tip: https://tipit.to/itexttipjar

[iText-questions] Problem parsing huge PDF files

Reply via email to