Doug Farrell wrote: > Hi all, > > Does anyone know how to extract images from a PDF file? What I'm looking > to do is use pdflib_py to open large PDF files on our Linux servers, > then use PIL to verify image data. I want to do this in order > to find corrupt images in the PDF files. If anyone could help > me out, or point me in the right direction, it would be most > appreciated! >
If you are ok shelling out to a binary: pdfimages - Portable Document Format (PDF) image extractor (version 3.00) http://packages.ubuntu.com/gutsy/text/xpdf-utils I am trying to convert the pdf to a png, but without having to run external commands. so I will understand if you arn't happy with pdfimages. Carl K -- http://mail.python.org/mailman/listinfo/python-list