That's not how PDF works, Scott. There is only a single %PDF in the file, starting at byte 0. There may be multiple %%EOFs, but they represent "update sections" and have nothing to do with any "original formats".
And a content stream represents the page content, which is just one of the MANY objects found in the "body" of the PDF. Leonard On 11/22/11 1:13 PM, "Scott Harris" <sharris...@comcast.net> wrote: >I'm not certain, but since I'm not (yet) familiar with the page dictionary >or annotations, I'm inclined to guess that it would be the latter - >parsing >a content stream identifying a single document as being delimited by %PDF >and %%EOF representing "start doc" and "end doc" respectively. > >Thanks again. >-smh > >-----Original Message----- >From: 1T3XT BVBA [mailto:i...@1t3xt.info] >Sent: Tuesday, November 22, 2011 12:00 PM >To: Post all your questions about iText here >Subject: Re: [iText-questions] - Extracting multiple PDF docs from a >consolidated PDF "container" doc... > >On 22/11/2011 18:35, Scott Harris wrote: >> I need to burst documents not by page, but at arbitrary points in the >> content marked by particular delimiters denoting the start and end of >> each original PDF document. >What do these delimiters look like? Are they extra entries in the page >dictionary? Are they annotations added to the page? Or do you mean you >have to parse the content stream of all pages because the delimiters are >part of the text? > >-------------------------------------------------------------------------- >-- >-- >All the data continuously generated in your IT infrastructure >contains a definitive record of customers, application performance, >security threats, fraudulent activity, and more. Splunk takes this >data and makes sense of it. IT sense. And common sense. >http://p.sf.net/sfu/splunk-novd2d >_______________________________________________ >iText-questions mailing list >iText-questions@lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/itext-questions > >iText(R) is a registered trademark of 1T3XT BVBA. >Many questions posted to this list can (and will) be answered with a >reference to the iText book: http://www.itextpdf.com/book/ >Please check the keywords list before you ask for examples: >http://itextpdf.com/themes/keywords.php > > >-------------------------------------------------------------------------- >---- >All the data continuously generated in your IT infrastructure >contains a definitive record of customers, application performance, >security threats, fraudulent activity, and more. Splunk takes this >data and makes sense of it. IT sense. And common sense. >http://p.sf.net/sfu/splunk-novd2d >_______________________________________________ >iText-questions mailing list >iText-questions@lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/itext-questions > >iText(R) is a registered trademark of 1T3XT BVBA. >Many questions posted to this list can (and will) be answered with a >reference to the iText book: http://www.itextpdf.com/book/ >Please check the keywords list before you ask for examples: >http://itextpdf.com/themes/keywords.php ------------------------------------------------------------------------------ All the data continuously generated in your IT infrastructure contains a definitive record of customers, application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-novd2d _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php