Check out:

http://www.foolabs.com/xpdf (yes, this is a real website)

and 

http://www.opengroup.org/inforsrv/PDF/xpdf

These tools even decrypt! Way too cool. I am working on integrating these
into my company's web page, which has already implemented the Lucene search
engine

My approach will be: in the IndexFiles class, when a file has a PDF
extension, it will run this converter, then index the text file but with the
PDF file name.



_______________________________________________
Lucene-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/lucene-dev

Reply via email to