Hi,

I have a directory with some 350 Microsoft Powerpoint documents. I have been charged with the task of extracting the raw text from these documents into a single raw text file. Each document consists of 3-5 slides, each containing up to three text boxes. Some documents have bitmapped background images.

Having tried various combinations of 'strings' and 'sed', I have concluded that the text cannot be reliably extracted without some more intelligent parsing of the PPT format. OO obviously performs this parsing since all the PPT files open flawlessly in OpenOffice.org Impress.

Is there any way I can, using OpenOffice.org, create a macro to extract the text from all of these files? There must be something better than 1500 copy/paste operations!

thanks,
Greg

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to