have a look at the class PDFText2HTML The description in the comments is: " * Wrap stripped text in simple HTML, trying to form HTML paragraphs. Paragraphs * broken by pages, columns, or figures are not mended."
As far as PDF -> fully formatted HTML ... I don't think so. Daniel Wilson 2009/4/24 César Fernando Henriques <[email protected]> > Hi guys, anyone knows if there are some Java code that use pdfbox that > convert pdf files to html? > > > thanks! >
