Vitalie Bureanu created PDFBOX-1858:
---------------------------------------

             Summary: Extracted text does not have spaces
                 Key: PDFBOX-1858
                 URL: https://issues.apache.org/jira/browse/PDFBOX-1858
             Project: PDFBox
          Issue Type: Bug
          Components: Parsing, Text extraction
    Affects Versions: 1.8.3
         Environment: Linux 64bit, Java
            Reporter: Vitalie Bureanu


Extracted text does not have spaces between some words.

Use to test please a string on line 74a... inside of attached test.pdf.

It will be extracted as: "74a Amount of line73youwant refunded toyou . If 
Form8888 isattached , checkhere"

The result is not seems to be good, the words are "glued".

I tried to use a class PDF Text Stripper but the resultstill remain the same.

Can it be solved, please?

With respect,
Vitalie



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to