Hello everybody,
I'm trying to extract images from a pdf file which won't work...:-(
I tried the ExtractImages.exe which results in:
>ExtractImages.exe "C:\path\to\pdf_file"
Exception in thread "main" java.lang.NullPointerException
at org.pdfbox.ExtractImages.extractImages(ExtractImages.java:138)
at org.pdfbox.ExtractImages.main(ExtractImages.java:72)
Then I tried to extract the images using code I copied from the ExtractImages
class:
Here's a snippet:
PDXObjectImage image = (PDXObjectImage) images.get(key);
String name = getUniqueFileName(key, image.getSuffix());
image.write2file(name);
The execution of the last line results in:
java.util.zip.ZipException: unknown compression method
at java.util.zip.InflaterInputStream.read(InflaterInputStream.java:140)
at org.pdfbox.filter.FlateFilter.decode(FlateFilter.java:110)
at org.pdfbox.cos.COSStream.doDecode(COSStream.java:290)
at org.pdfbox.cos.COSStream.doDecode(COSStream.java:235)
at org.pdfbox.cos.COSStream.getUnfilteredStream(COSStream.java:170)
at
org.pdfbox.pdmodel.common.PDStream.createInputStream(PDStream.java:226)
at org.pdfbox.pdmodel.common.PDStream.getByteArray(PDStream.java:481)
at
org.pdfbox.pdmodel.graphics.xobject.PDPixelMap.getRGBImage(PDPixelMap.java:138)
at
org.pdfbox.pdmodel.graphics.xobject.PDPixelMap.write2OutputStream(PDPixelMap.java:166)
at
org.pdfbox.pdmodel.graphics.xobject.PDXObjectImage.write2file(PDXObjectImage.java:118)
at
de.thecode.pdf.pdfbox.ExtractImages.extractImages(ExtractImages.java:52)
at de.thecode.pdf.pdfbox.ExtractImages.main(ExtractImages.java:30)
Anybody knows how to get the image extraction work correctly...?
Best regards,
Abid
--
Abid Hussain