On Tue, 20 Sep 2005 02:34 pm, Jeremias Maerki wrote:
<snip/>
>
> But the underlying PDF library looks quite interesting (PDFBox). Has
> anyone had any experiences with it? If yes, we should add it to our
> PDF post-processors list on the website, if just because it has a
> better license than iText.

I am using it in a project which provides on-line searchable PDF files 
(Government Acts and Regulations) indexed using Lucene and in that 
context it works fine, that is as a backend activity PDFBox extracts 
the text components from the PDF for Lucene to index and as an on-line 
activity once matching documents are found the PDF is searched again 
using PDFBox to find the search terms in the PDF file so an Acrobat 
highlight XML file can be constructed.

<snip/>
>
> Jeremias Maerki

Manuel

Reply via email to