I've heard of that software, but haven't tried it. My use of PDFBox is in rendering PDF's as images. So I've done nothing with the HTML-related functions.
Daniel Wilson http://BlackLocustSoftware.com 2009/4/24 César Fernando Henriques <[email protected]> > Daniel, > > I have been playing with PDFText2HTML I will have to add a lot of code to > meet my requirements, but I don't think there is any other option at this > time. I have been searching in internet for a while, the only think I found > that works well was a C/C++ program called pdf2html. > > > thanks! > > > > > On Fri, Apr 24, 2009 at 1:37 PM, Daniel Wilson < > [email protected]> wrote: > >> have a look at the class PDFText2HTML >> The description in the comments is: >> " * Wrap stripped text in simple HTML, trying to form HTML paragraphs. >> Paragraphs >> * broken by pages, columns, or figures are not mended." >> >> As far as PDF -> fully formatted HTML ... I don't think so. >> >> Daniel Wilson >> >> 2009/4/24 César Fernando Henriques <[email protected]> >> >>> Hi guys, anyone knows if there are some Java code that use pdfbox that >>> convert pdf files to html? >>> >>> >>> thanks! >>> >> >> >
