Hi Andi,
Thank you for ur timely help.
following are the libraries available in java.

Word & Excel Documents - Jakarta POI
http://jakarta.apache.org/poi/





PDF Documents -

http://pdfbox.org/







With Regards,
Chandrashekhar
----- Original Message ----- 
From: "Andi Vajda" <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Monday, January 24, 2005 11:53 PM
Subject: [MAY BE SPAM] - Re: [pylucene-dev] 3rd party lib - Email found in
subject


>
> > I am new to py-lucene but have worked on java lucene 1.4.3.
> > How I can index following types of files by using py-lucene?
> > [word files, pdf , excel, xsl, xml, open office files]
> > is there any support of 3rd party lib in py-lucene also?
>
> If there are python equivalent plain-text filters for these types of
files,
> use them. If not, you can try to integrate the java version into your
build of
> PyLucene. Take a look at how the sandbox Highlighter package was
integrated as
> an example. If the licensing of these third party tools allows it, and if
> there is enough interest, I could integrate them into PyLucene too.
>
> Can you send me URLs to these Java packages ?
>
> Andi..
> _______________________________________________
> pylucene-dev mailing list
> [email protected]
> http://lists.osafoundation.org/mailman/listinfo/pylucene-dev

_______________________________________________
pylucene-dev mailing list
[email protected]
http://lists.osafoundation.org/mailman/listinfo/pylucene-dev

Reply via email to