I've heard of that software, but haven't tried it.

My use of PDFBox is in rendering PDF's as images.  So I've done nothing with
the HTML-related functions.

Daniel Wilson
http://BlackLocustSoftware.com

2009/4/24 César Fernando Henriques <[email protected]>

> Daniel,
>
> I have been playing with PDFText2HTML I will have to add a lot of code to
> meet my requirements, but I don't think there is any other option at this
> time. I have been searching in internet for a while, the only think I found
> that works well was a C/C++ program called pdf2html.
>
>
> thanks!
>
>
>
>
> On Fri, Apr 24, 2009 at 1:37 PM, Daniel Wilson <
> [email protected]> wrote:
>
>> have a look at the class PDFText2HTML
>> The description in the comments is:
>> " * Wrap stripped text in simple HTML, trying to form HTML paragraphs.
>> Paragraphs
>>  * broken by pages, columns, or figures are not mended."
>>
>> As far as PDF -> fully formatted HTML ... I don't think so.
>>
>> Daniel Wilson
>>
>> 2009/4/24 César Fernando Henriques <[email protected]>
>>
>>> Hi guys, anyone knows if there are some Java code that use pdfbox that
>>> convert pdf files to html?
>>>
>>>
>>> thanks!
>>>
>>
>>
>

Reply via email to