That's not how PDF works, Scott.

There is only a single %PDF in the file, starting at byte 0.  There may be
multiple %%EOFs, but they represent "update sections" and have nothing to
do with any "original formats".

And a content stream represents the page content, which is just one of the
MANY objects found in the "body" of the PDF.

Leonard


On 11/22/11 1:13 PM, "Scott Harris" <sharris...@comcast.net> wrote:

>I'm not certain, but since I'm not (yet) familiar with the page dictionary
>or annotations, I'm inclined to guess that it would be the latter -
>parsing
>a content stream identifying a single document as being delimited by %PDF
>and %%EOF representing "start doc" and "end doc" respectively.
>
>Thanks again.
>-smh
>
>-----Original Message-----
>From: 1T3XT BVBA [mailto:i...@1t3xt.info]
>Sent: Tuesday, November 22, 2011 12:00 PM
>To: Post all your questions about iText here
>Subject: Re: [iText-questions] - Extracting multiple PDF docs from a
>consolidated PDF "container" doc...
>
>On 22/11/2011 18:35, Scott Harris wrote:
>> I need to burst documents not by page, but at arbitrary points in the
>> content marked by particular delimiters denoting the start and end of
>> each original PDF document.
>What do these delimiters look like? Are they extra entries in the page
>dictionary? Are they annotations added to the page? Or do you mean you
>have to parse the content stream of all pages because the delimiters are
>part of the text?
>
>--------------------------------------------------------------------------
>--
>--
>All the data continuously generated in your IT infrastructure
>contains a definitive record of customers, application performance,
>security threats, fraudulent activity, and more. Splunk takes this
>data and makes sense of it. IT sense. And common sense.
>http://p.sf.net/sfu/splunk-novd2d
>_______________________________________________
>iText-questions mailing list
>iText-questions@lists.sourceforge.net
>https://lists.sourceforge.net/lists/listinfo/itext-questions
>
>iText(R) is a registered trademark of 1T3XT BVBA.
>Many questions posted to this list can (and will) be answered with a
>reference to the iText book: http://www.itextpdf.com/book/
>Please check the keywords list before you ask for examples:
>http://itextpdf.com/themes/keywords.php
>
>
>--------------------------------------------------------------------------
>----
>All the data continuously generated in your IT infrastructure
>contains a definitive record of customers, application performance,
>security threats, fraudulent activity, and more. Splunk takes this
>data and makes sense of it. IT sense. And common sense.
>http://p.sf.net/sfu/splunk-novd2d
>_______________________________________________
>iText-questions mailing list
>iText-questions@lists.sourceforge.net
>https://lists.sourceforge.net/lists/listinfo/itext-questions
>
>iText(R) is a registered trademark of 1T3XT BVBA.
>Many questions posted to this list can (and will) be answered with a
>reference to the iText book: http://www.itextpdf.com/book/
>Please check the keywords list before you ask for examples:
>http://itextpdf.com/themes/keywords.php


------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure 
contains a definitive record of customers, application performance, 
security threats, fraudulent activity, and more. Splunk takes this 
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Reply via email to