[ 
https://issues.apache.org/jira/browse/PDFBOX-2043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler closed PDFBOX-2043.
--------------------------------------

    Resolution: Not a Problem
      Assignee: Andreas Lehmkühler

[~venkateshmpr] There is some space but no spaces. The given pdf doesn't 
contain any spaces at the postion you've mentioned and PDFBox doesn't add some 
spaces to preserve the (visual) formatting. It simply extracts the whole text 
of a pdf. So everything works as expected.

Set to closed

> While Reading a PDF which contains Image the Content of the PDF is misaligned 
> in the resulting text.
> ----------------------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-2043
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2043
>             Project: PDFBox
>          Issue Type: Bug
>         Environment: Visual Studio 2005
>            Reporter: Venkatesan
>            Assignee: Andreas Lehmkühler
>         Attachments: Result.txt, Sample.pdf, Space.jpg
>
>
> We are trying to read content of a PDF file, The PDF has images in the 
> header. We use the PDFTextStripper.getText() method. After calling this 
> method the resulting text is misaligned compare to the Original PDF.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to