Did you check POI javadocs? Look for org.apache.poi.hslf.extractor.PowerPointExtractor. It's one of the most straightforward classes from POI as far extracting text for indexing is concerned.
-Gopi On 9/7/06, Venkateshprasanna <[EMAIL PROTECTED]> wrote:
Is there any filter available for extracting text from MS Powerpoint files and indexing them? The lucene website suggests the POI project, which, it seems does not support PPT files as of now. Regards, Venkateshprasanna -- View this message in context: http://www.nabble.com/which-way-to-index-pdf%2Cword%2Cexcel-tf2224468.html#a6185039 Sent from the Lucene - Java Users forum at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]