Re: Question

Ben Martz Wed, 06 Jan 2010 11:51:22 -0800

Todd,

I would definitely take Michael's advice to learn more about theoverall issue before you get too far.

A quick answer that may help is Windows does not ship with an iFilterfor PDF built-in. Installing Adobe Reader 8 or higher will install adecent PDF iFilter.

I am a little surprised by your question though - I assume that youhave access to your own source code and could examine the result fromthe iFilter that's being fed to the IndexWriter and compare thebehavior in the TXT case with the behavior in the PDF case?


Cheers,
Ben

Sent from my iPhone

On Jan 6, 2010, at 10:13, Michael Garski <[email protected]>wrote:

Todd,
You'll need some way to extract the text from the PDF prior toindexing. I'm not familiar with any packages that can do that but Ihave heard of them. You may want to try searching the mailing listto see if there has been mention of one previously. LucidImagination hosts a great mailing list search tool at http://www.lucidimagination.com/search/
Michael

-----Original Message-----
From: Todd McIndoo [mailto:[email protected]]
Sent: Wednesday, January 06, 2010 10:11 AM
To: [email protected]
Subject: Question

Sorry if this is duplicate
We are using Lucene.net of version 2.0.0.4. I am trying to search adocumentwhich contains lots of PDFs. I want to search a document, whichcontains aspecific word, using Lucene.net. We are yielding results in textdocumentsbut not in PDF. Is there something we have to do to be able tosearch in PDF
Documents. All ifilters have been installed on the computer so I donot
think that is the issue.



Regards,

SPEEDY SOLUTIONS



Todd McIndoo

Re: Question

Reply via email to