if you set the system property "org.apache.pdfbox.pdfparser.nonSequentialPDFParser.parseMinimal" to true page content is only parsed if you request the page. In addition the default behavior also creates less objects. Although this is not the behavior you requested for i.e. streaming like Stax it is an improvement to the standard parser.
As Andreas suggested you should give it a try. Kind regards Maruan Am 05.02.2013 um 19:38 schrieb Andreas Lehmkuehler <[email protected]>: > Hi, > > Am 05.02.2013 15:20, schrieb VIGNESH S: >> I think non sequential PDF Parser also loads everyobjects in Objectpool.. >> >> The diffrence I think in nonsequential is that it reads the Xref table >> in trailer to know the PDF structure instead of linearly traversing >> the document. > Yes, it works different (following the specs) than the old one. > Did you try it? > >> Correct me if Iam wrong. > > BR > Andreas Lehmkühler

