John Bachir wrote:
> On Apr 1, 2007, at 12:47 PM, Florian Gilcher wrote:
>> Catdoc and Antiword for example. Simple shell commands to extract text
>> from files.
>>
>> http://www.45.free.net/~vitus/software/catdoc/
>> http://www.winfield.demon.nl/
> 
>>  If you use them, I would be interested in
>> feedback on how well it works.
> 
> I am now using catdoc, catppt, and xls2csv to index all of my  
> documents, and it is working well.
> 
> The content out of catppt seems to be rather incomplete, but is Good  
> Enough for our purposes.

If you were going to be happy with the plain contents being indexed, I'd 
suggest just running the powerpoint document through strings before 
indexing it.  I don't know if catppt does more or less than that, but 
it'd be useful to compare.

-- 
Alex
_______________________________________________
Ferret-talk mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/ferret-talk

Reply via email to