Curtis Leach schrieb am 19.08.2010 um 12:00 (-0700): > Hi I'm going to be working with PDF files on a new project, and I'm > wondering if anyone can recommend a Perl module for working with PDF > documents. I'm using Perl 5.8.8 on a Windows platform and will > eventually be porting it to AIX Unix. I currently don't know much > about PDF. > > I was looking at the PDF module on CPAN and I see it hasn't been > updated since 2000. And one of the open bugs seems to imply that it > stopped working with Acrobat version 5.x and I'll probably be > accessing recently created PDF documents from various sources. So it > looks like I'll have to choose from one of the other PDF modules. > > I just need read only access to PDF documents to extract information > from them. Such as verifying it's actually a PDF document, # of pages > and searching for particular string values so that it can be indexed > before archived. Some of them may even be secured documents. > > Any recommendation would be appreciated.
I've used XSL-FO and Prince XML, but they're not readily available from Perl. Can't tell from proper experience, but I've seen a couple of people recommend CAM::PDF, which seems to be Pure Perl, so it might cover your cross-platform needs. I'd take a look at PDF::API2 as well. http://cpanratings.perl.org/dist/CAM-PDF http://cpanratings.perl.org/dist/PDF-API2 How can I do a full-text search of PDF files from Perl? http://stackoverflow.com/questions/139015/ -- Michael Ludwig _______________________________________________ ActivePerl mailing list [email protected] To unsubscribe: http://listserv.ActiveState.com/mailman/mysubs
