Hello, I am new to PDFBox, so I apologize ahead of time if this is not an appropriate forum for this sort of questions.
I have a requirement to extract text from PDF documents breaking it into
paragraphs. The examples if text extraction I saw did not make it clear whether
this is possible. HTML extraction identifies lines and pages but not paragraphs.
Is it possible to extract text from PDF documents one paragraph ata a time? If
so could you supply a code sample?
Thank you,
Michael
