On Apr 1, 2007, at 5:37 AM, Andreas Korth wrote: > Are you serious? You're adding raw, unprocessed PPT files to your > index? > > Now this is just wrong. PPT files may contain all sorts of binary > data, such as images and videos. I just had a look at the sample > presentation that came with my Office installation. This file is > 3.5MB in size with a (plain text) payload of less than 1KB.
As I stated in my previous email, I am conjecturing that indexing these documents will not affect search performance. Do you disagree? > I'm sure there's some tool available which converts PPT to plain text > and I strongly recommend you go out and find it. I've searched far and wide and have found none. john _______________________________________________ Ferret-talk mailing list [email protected] http://rubyforge.org/mailman/listinfo/ferret-talk

