> -----Original Message-----
> From: Bastien Koert [mailto:[EMAIL PROTECTED]
> Sent: Monday, September 22, 2008 4:00 PM
> To: [EMAIL PROTECTED]
> Cc: php-general
> Subject: Re: [PHP] Reading a Word document from PHP
> 
> On Mon, Sep 22, 2008 at 4:58 PM, Ashley Sheridan
> <[EMAIL PROTECTED]>wrote:
> 
> > On Mon, 2008-09-22 at 16:47 -0400, Bastien Koert wrote:
> > >
> > >
> > > On Mon, Sep 22, 2008 at 3:56 PM, Ashley Sheridan
> > > <[EMAIL PROTECTED]> wrote:
> > >         Hi All,
> > >
> > >         I recently asked a question regarding reading a PDF with
> PHP.
> > >         I've tried
> > >         Zend_pdf, but all this is able to give me is the number of
> > >         pages in a
> > >         PDF, and cannot extract the text from the PDF files I
have.
> I
> > >         thought
> > >         I'd try a different method, and try to extract the text
> > >         straight from
> > >         the Word document which is used to generate the PDF. Does
> > >         anyone have
> > >         any experience with this sort of thing, or enough to
> suggest a
> > >         library
> > >         which is capable of this?
> > >
> > >
> > >         Ash
> > >         www.ashleysheridan.co.uk
> > >
> > >
> > >         --
> > >         PHP General Mailing List (http://www.php.net/)
> > >         To unsubscribe, visit: http://www.php.net/unsub.php
> > >
> > >
> > >
> > > This may help
> > >
> > > http://drewd.com/2007/01/25/reading-from-a-word-document-with-com-
> in-php
> > >
> > > http://www.phpclasses.org/browse/package/388.html
> > >
> > >
> >
http://www.developertutorials.com/blog/php/extracting-text-from-word-
> documents-via-php-and-com-81/
> > >
> > >
> > > --
> > >
> > > Bastien
> > >
> > > Cat, the other other white meat
> > >
> > Unfortunately all of those use COM, which is only available on
> > Windows... I'm guessing this isn't possible on a proper OS?
> >
> >
> > Ash
> > www.ashleysheridan.co.uk
> >
> >
> Sadly, no...the closest you could come to, would be to try and see if
> you
> can manipulate OpenOffice via exec or something like that to read the
> document and do what you need to

There's a Python script described in this article:

http://www.gsdesign.ro/blog/php-convert-microsoft-word-doc-to-pdf/

...that sounds like it will do what you want. Sucks that it's using
Python, but at least it's a technology that isn't hard to put on a Linux
machine. Other than that, I would recommend Mono, perhaps, and use a
.NET DLL (or similar construct).

HTH,


Todd Boyd
Web Programmer




--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to