Re: Extracting images from a PDF file

Carl K Wed, 26 Dec 2007 22:16:43 -0800

Doug Farrell wrote:
> Hi all,
> 
> Does anyone know how to extract images from a PDF file? What I'm looking
> to do is use pdflib_py to open large PDF files on our Linux servers,
> then use PIL to verify image data. I want to do this in order
> to find corrupt images in the PDF files. If anyone could help
> me out, or point me in the right direction, it would be most
> appreciated!
>


If you are ok shelling out to a binary:

pdfimages  -  Portable  Document  Format (PDF) image extractor (version
        3.00)
http://packages.ubuntu.com/gutsy/text/xpdf-utils

I am trying to convert the pdf to a png, but without having to run external 
commands.  so I will understand if you arn't happy with pdfimages.

Carl K
-- 
http://mail.python.org/mailman/listinfo/python-list

Re: Extracting images from a PDF file

Reply via email to