Thanks, I have the book and looked through chapter 15. However in the section (15.3.2) about using PdfContentSteamProcessor the code samples only extract raw text and then ordered raw text. The document talks about extracting fonts but doesn't talk about which class will help me achieve this. The closest I see is listContentStream, but this is so verbose and the output seems complex, not sure how to determine what is text and font? If I should be using listContentStream, then that is what it is and I will continue trying to understand the structure. But I am hoping there is a better way.
Thanks for your help, Michael -----Original Message----- From: 1T3XT BVBA Sent: Monday, June 27, 2011 7:59 AM To: Post all your questions about iText here Subject: Re: [iText-questions] How to extract title / heading from document contents On 26/06/2011 20:29, Michael O'Donovan wrote: > Any ideas how I go about doing that? With the functionality that can be found in the com.itextpdf.text.pdf.parser package. If you need the documentation, it's in chapter 15 of the book. ------------------------------------------------------------------------------ All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 _______________________________________________ iText-questions mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php ------------------------------------------------------------------------------ All of the data generated in your IT infrastructure is seriously valuable. Why? It contains a definitive record of application performance, security threats, fraudulent activity, and more. Splunk takes this data and makes sense of it. IT sense. And common sense. http://p.sf.net/sfu/splunk-d2d-c2 _______________________________________________ iText-questions mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php
