Hi Andi, Thank you for ur timely help. following are the libraries available in java.
Word & Excel Documents - Jakarta POI http://jakarta.apache.org/poi/ PDF Documents - http://pdfbox.org/ With Regards, Chandrashekhar ----- Original Message ----- From: "Andi Vajda" <[EMAIL PROTECTED]> To: <[email protected]> Sent: Monday, January 24, 2005 11:53 PM Subject: [MAY BE SPAM] - Re: [pylucene-dev] 3rd party lib - Email found in subject > > > I am new to py-lucene but have worked on java lucene 1.4.3. > > How I can index following types of files by using py-lucene? > > [word files, pdf , excel, xsl, xml, open office files] > > is there any support of 3rd party lib in py-lucene also? > > If there are python equivalent plain-text filters for these types of files, > use them. If not, you can try to integrate the java version into your build of > PyLucene. Take a look at how the sandbox Highlighter package was integrated as > an example. If the licensing of these third party tools allows it, and if > there is enough interest, I could integrate them into PyLucene too. > > Can you send me URLs to these Java packages ? > > Andi.. > _______________________________________________ > pylucene-dev mailing list > [email protected] > http://lists.osafoundation.org/mailman/listinfo/pylucene-dev _______________________________________________ pylucene-dev mailing list [email protected] http://lists.osafoundation.org/mailman/listinfo/pylucene-dev
