have a look at the class PDFText2HTML
The description in the comments is:
" * Wrap stripped text in simple HTML, trying to form HTML paragraphs.
Paragraphs
 * broken by pages, columns, or figures are not mended."

As far as PDF -> fully formatted HTML ... I don't think so.

Daniel Wilson

2009/4/24 César Fernando Henriques <[email protected]>

> Hi guys, anyone knows if there are some Java code that use pdfbox that
> convert pdf files to html?
>
>
> thanks!
>

Reply via email to