Daniel,

I have been playing with PDFText2HTML I will have to add a lot of code to
meet my requirements, but I don't think there is any other option at this
time. I have been searching in internet for a while, the only think I found
that works well was a C/C++ program called pdf2html.


thanks!



On Fri, Apr 24, 2009 at 1:37 PM, Daniel Wilson <
[email protected]> wrote:

> have a look at the class PDFText2HTML
> The description in the comments is:
> " * Wrap stripped text in simple HTML, trying to form HTML paragraphs.
> Paragraphs
>  * broken by pages, columns, or figures are not mended."
>
> As far as PDF -> fully formatted HTML ... I don't think so.
>
> Daniel Wilson
>
> 2009/4/24 César Fernando Henriques <[email protected]>
>
>> Hi guys, anyone knows if there are some Java code that use pdfbox that
>> convert pdf files to html?
>>
>>
>> thanks!
>>
>
>

Reply via email to