Hi,
I have a directory with some 350 Microsoft Powerpoint documents. I have
been charged with the task of extracting the raw text from these
documents into a single raw text file. Each document consists of 3-5
slides, each containing up to three text boxes. Some documents have
bitmapped background images.
Having tried various combinations of 'strings' and 'sed', I have
concluded that the text cannot be reliably extracted without some more
intelligent parsing of the PPT format. OO obviously performs this
parsing since all the PPT files open flawlessly in OpenOffice.org Impress.
Is there any way I can, using OpenOffice.org, create a macro to extract
the text from all of these files? There must be something better than
1500 copy/paste operations!
thanks,
Greg
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]