John Bachir wrote: > On Apr 1, 2007, at 12:47 PM, Florian Gilcher wrote: >> Catdoc and Antiword for example. Simple shell commands to extract text >> from files. >> >> http://www.45.free.net/~vitus/software/catdoc/ >> http://www.winfield.demon.nl/ > >> If you use them, I would be interested in >> feedback on how well it works. > > I am now using catdoc, catppt, and xls2csv to index all of my > documents, and it is working well. > > The content out of catppt seems to be rather incomplete, but is Good > Enough for our purposes.
If you were going to be happy with the plain contents being indexed, I'd suggest just running the powerpoint document through strings before indexing it. I don't know if catppt does more or less than that, but it'd be useful to compare. -- Alex _______________________________________________ Ferret-talk mailing list [email protected] http://rubyforge.org/mailman/listinfo/ferret-talk

