identifying paragraphs in PDF documents

Michael Smolyak Tue, 06 Oct 2009 13:02:01 -0700

Hello,

I am new to PDFBox, so I apologize ahead of time if this is not an appropriate 
forum for this sort of questions.


I have a requirement to extract text from PDF documents breaking it into 
paragraphs. The examples if text extraction I saw did not make it clear whether 
this is possible. HTML extraction identifies lines and pages but not paragraphs.

Is it possible to extract text from PDF documents one paragraph ata a time? If 
so could you supply a code sample?

Thank you,

Michael

identifying paragraphs in PDF documents

Reply via email to