subject:"Indexing PDF documents with structure information"

Re: Indexing PDF documents with structure information

2007-08-14 Thread Mathieu Lecarme

Thomas Arni a écrit : > Hello Luceners > > I have started a new project and need to index pdf documents. > There are several projects around, which allow to extract the content, > like pdfbox, xpdf and pjclassic. > > As far as I studied the FAQ's and examples, all these > tools allow simple text ex

Indexing PDF documents with structure information

2007-08-13 Thread Thomas Arni

Hello Luceners I have started a new project and need to index pdf documents. There are several projects around, which allow to extract the content, like pdfbox, xpdf and pjclassic. As far as I studied the FAQ's and examples, all these tools allow simple text extraction. Which of these open sour