Is the file encrypted with a user or owner password?
Is it an indexed file (such as a GIF or indexed PNG)?

---- 
Thanks,
Adam





From:
"Eric Schwarzenbach (JIRA)" <j...@apache.org>
To:
dev@pdfbox.apache.org
Date:
03/04/2011 17:07
Subject:
[jira] Commented: (PDFBOX-958) convertToImage mangles images which were in 
the PDF




    [ 
https://issues.apache.org/jira/browse/PDFBOX-958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13002898#comment-13002898
 
] 

Eric Schwarzenbach commented on PDFBOX-958:
-------------------------------------------

We also have another document (from a client so I can't share it), with a 
number of black and white photos in it, all of which come out with the 
colors inverted (like a negative). There is also a non-photographic image, 
a map, which gets messed up in an entirely different way. In general our 
success rate just trying random PDFs had not been great for images. Could 
one of the PDFBox developers comment? Is this an area of the tool that is 
in rudimentary state?

> convertToImage mangles images which were in the PDF
> ---------------------------------------------------
>
>                 Key: PDFBOX-958
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-958
>             Project: PDFBox
>          Issue Type: Bug
>    Affects Versions: 1.2.1, 1.4.0, 1.5.0
>         Environment: RHEL5 and WinXP, java version "1.6.0_23"
>            Reporter: Eric Schwarzenbach
>            Priority: Critical
>         Attachments: Image of Page 13.jpeg, Image of Page 13.png, 
Wrycan® Lorem Ipsum Test.pdf
>
>
> Of the PDFs we've tried running through PDFBox and generating page 
images, a number of them (coming from disparate sources and method of 
creation) seem to produce images where an image that was embedded in the 
page of the PDF shows somewhat mangled. It seems to be divided by 
horizontal stripes, where some stripes look normal, others seem to have 
some kind of "smearing" effect going on. See attached images and original 
PDF (image is of page 13).
> I marked this as critical as we are trying to use PDFBox in a project 
where page images are crucial, and inability to produce reasonable looking 
page images is pretty much a deal breaker. 
> The code we use to extract the images looks more or less like the 
following:
>  BufferedImage image = page.convertToImage();
>  
>  SmartDeferredFileOutputStream outStream = new 
SmartDeferredFileOutputStream();
>  String[] writerFormatNames = ImageIO.getWriterFormatNames();
>  ImageIO.write(image, "jpeg", outStream);
>  outStream.close()
> We've also tried specifying "png". In both "jpg" and "png" cases we get 
an image file that is indeed the correct format, and both images look 
exactly the same. 

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

 





- FHA 203b; 203k; HECM; VA; USDA; Conventional 
- Warehouse Lines; FHA-Authorized Originators 
- Lending and Servicing in over 45 States 
www.swmc.com   -  www.simplehecmcalculator.com   
Visit  www.swmc.com/resources   for helpful links on Training, Webinars, Lender 
Alerts and Submitting Conditions  

This email and any content within or attached hereto from Sun West Mortgage 
Company, Inc. is confidential and/or legally privileged. The information is 
intended only for the use of the individual or entity named on this email. If 
you are not the intended recipient, you are hereby notified that any 
disclosure, copying, distribution or taking any action in reliance on the 
contents of this email information is strictly prohibited, and that the 
documents should be returned to this office immediately by email. Receipt by 
anyone other than the intended recipient is not a waiver of any privilege. 
Please do not include your social security number, account number, or any other 
personal or financial information in the content of the email. Should you have 
any questions, please call (800) 453 7884.  

Reply via email to