[ https://issues.apache.org/jira/browse/PDFBOX-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ravi updated PDFBOX-3727: ------------------------- Hi, I can't upload the document as its bit confidential, but i can email to individual. Please let me know Thx From: Tilman Hausherr (JIRA) <j...@apache.org> To: ravi_d...@yahoo.com Sent: Tuesday, March 21, 2017 1:00 AM Subject: [jira] [Commented] (PDFBOX-3727) "premature EOF, image will be incomplete" [ https://issues.apache.org/jira/browse/PDFBOX-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15934092#comment-15934092 ] Tilman Hausherr commented on PDFBOX-3727: ----------------------------------------- Please attach the PDF, to verify that the images are really too short. -- This message was sent by Atlassian JIRA (v6.3.15#6346) > "premature EOF, image will be incomplete" > ----------------------------------------- > > Key: PDFBOX-3727 > URL: https://issues.apache.org/jira/browse/PDFBOX-3727 > Project: PDFBox > Issue Type: Bug > Components: Parsing, Text extraction > Affects Versions: 2.0.4, 2.0.5 > Environment: Windows 10/X64 > Reporter: Ravi > > I am trying to extract all the embeded images from Pdf file. But some times > the images extracted are throwing warnings below. > {code} > [main] WARN o.a.p.p.g.image.SampledImageReader - premature EOF, image will > be incomplete > {code} > The extracted images are half-complete(half- greyed out) > I would like to know if any solution available for this. Below is my code > snippet > Any Help is greatly appreciated. > {code} > public static void testPDFBoxExtractImages() throws Exception { > PDDocument document = PDDocument.load(new File(fileName)); > PDPageTree list = document.getPages(); > for (PDPage page : list) { > PDResources pdResources = page.getResources(); > System.out.println(page.getRotation()); > for (COSName c : pdResources.getXObjectNames()) { > PDXObject o = pdResources.getXObject(c); > if (o instanceof > org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject) { > File file = new File("C:/temp/" + System.nanoTime() + > ".png"); > > ImageIO.write(((org.apache.pdfbox.pdmodel.graphics.image.PDImageXObject)o).getImage(), > "png", file); > } > } > } > } > {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org