Hi Shukla, Lucene indexes just "text" files. Therefore conversion of a pdf document(or word,excel,image etc.) to text is not related with Lucene. Before indexing, you should convert them to text.
IFilter provides just a standard approach for this kind of conversions. Below link may be helpful for you http://www.codeproject.com/csharp/IFilter.asp DIGY -----Original Message----- From: shukla dhaval v (JIRA) [mailto:[EMAIL PROTECTED] Sent: Monday, June 25, 2007 3:49 PM To: [email protected] Subject: [jira] Created: (LUCENENET-44) Indexing of some pdf files doesnt give desired result in ver 1.9.0.5 but works fine in ver 1.3.3.1 Indexing of some pdf files doesnt give desired result in ver 1.9.0.5 but works fine in ver 1.3.3.1 -------------------------------------------------------------------------------------------------- Key: LUCENENET-44 URL: https://issues.apache.org/jira/browse/LUCENENET-44 Project: Lucene.Net Issue Type: Bug Environment: .NET, Windows XP,lucene.net ver1.9.0.5 Reporter: shukla dhaval v Dear Sir, We are using lucene.net ver. 1.9.0.5 for content searching. The problem we are facing is with indexing of .pdf files. We have installed the ifilters for pdf files. There are certain pdf files which give result with the older version of lucene.net 1.3.3.1 but not with the current one. Please advise how to solve this issue. Thank you Dhaval Shukla Programmer Sansun Software Pvt Ltd Product Development Division of: Easy Data Access 5988 Mid Rivers Mall Drive St. Charles, MO 63304 www.edausa.com -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
