[iText-questions] pdfReader and very large pdf documents

Steve Rossi Fri, 15 Aug 2008 13:06:05 -0700

I have a theory that what I'm trying to do isn't supported or 
recommended but I thought I'd give it a shot.


I have several PDF documents that have 250,000 pages each.  The pdf file 
size averages 400-500MB.  All I need is to extract 1-2 pages 
(sequential) from somewhere inside that document and present a new pdf 
document with a watermark.  The pdfs were created using iText.  The 
latter half of this I have no problems with, it's the reader that I'm 
struggling with, specifically:

reader = New text.pdf.PdfReader(New 
text.pdf.RandomAccessFileOrArray(src.FullName), Nothing)

takes anywhere from 25 minutes to an hour.  Now you're probably asking 
why would anyone create a PDF that large?  We've been doing it for about 
6 years now and the PDF software we used before iText could extract a 
page in about 2-3 seconds.

I'm assuming that pdfReader is opening the entire file and 
pre-processing it.  Is there a method/property/class I missed that can 
stop this preprocessing and allow me to go straight to the using 
selection?  Again, this may not be what the reader was designed for and 
feel free to tell me that.

Development environment is iTextSharp 4.1.2 for vb.net / Framework 3.5.

Appreciate any suggestions,

-Steve


-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

Buy the iText book: http://www.1t3xt.com/docs/book.php

[iText-questions] pdfReader and very large pdf documents

Reply via email to