[ 
https://issues.apache.org/jira/browse/PDFBOX-2043?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13984155#comment-13984155
 ] 

Venkatesan commented on PDFBOX-2043:
------------------------------------

Hi

The above mentioned solution works file, and for the Same PDF file 
PDFTextStripper.getText() method trims the spaces available between the values. 
Please see the attached screenshot and the result text.

And we have some PDF files which read successfully in version PDFBox 0.7.3, But 
in PDFBox V1.8.4 it throws WrappedIOException.

Thanks and Regards
Venkatesan

> While Reading a PDF which contains Image the Content of the PDF is misaligned 
> in the resulting text.
> ----------------------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-2043
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2043
>             Project: PDFBox
>          Issue Type: Bug
>         Environment: Visual Studio 2005
>            Reporter: Venkatesan
>         Attachments: Result.txt, Sample.pdf, Space.jpg
>
>
> We are trying to read content of a PDF file, The PDF has images in the 
> header. We use the PDFTextStripper.getText() method. After calling this 
> method the resulting text is misaligned compare to the Original PDF.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to