[ https://issues.apache.org/jira/browse/PDFBOX-958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andreas Lehmkühler reopened PDFBOX-958: --------------------------------------- > convertToImage mangles images which were in the PDF > --------------------------------------------------- > > Key: PDFBOX-958 > URL: https://issues.apache.org/jira/browse/PDFBOX-958 > Project: PDFBox > Issue Type: Bug > Affects Versions: 1.2.1, 1.4.0, 1.5.0 > Environment: RHEL5 and WinXP, java version "1.6.0_23" > Reporter: Eric Schwarzenbach > Assignee: Andreas Lehmkühler > Priority: Critical > Fix For: 1.6.0 > > Attachments: Image of Page 13.jpeg, Image of Page 13.png, > PDFBOX958-WrycanLoremIpsumTest.pdf > > > Of the PDFs we've tried running through PDFBox and generating page images, a > number of them (coming from disparate sources and method of creation) seem to > produce images where an image that was embedded in the page of the PDF shows > somewhat mangled. It seems to be divided by horizontal stripes, where some > stripes look normal, others seem to have some kind of "smearing" effect going > on. See attached images and original PDF (image is of page 13). > I marked this as critical as we are trying to use PDFBox in a project where > page images are crucial, and inability to produce reasonable looking page > images is pretty much a deal breaker. > The code we use to extract the images looks more or less like the following: > BufferedImage image = > page.convertToImage(); > > SmartDeferredFileOutputStream outStream > = new SmartDeferredFileOutputStream(); > String[] writerFormatNames = > ImageIO.getWriterFormatNames(); > ImageIO.write(image, "jpeg", outStream); > outStream.close() > We've also tried specifying "png". In both "jpg" and "png" cases we get an > image file that is indeed the correct format, and both images look exactly > the same. -- This message was sent by Atlassian JIRA (v6.2#6252)