[ 
https://issues.apache.org/jira/browse/PDFBOX-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr closed PDFBOX-2749.
-----------------------------------
    Resolution: Cannot Reproduce

No answer, closing. Sizes seem to be pretty normal:
{code}
String[90.1,83.20001 fs=12.0 xscale=12.0 height=5.619141 space=4.176 
width=9.276001]A
String[99.388,83.20001 fs=12.0 xscale=12.0 height=5.619141 space=4.176 
width=7.104004]c
String[106.492004,83.20001 fs=12.0 xscale=12.0 height=5.619141 space=4.176 
width=7.104004]c
String[113.59601,83.20001 fs=12.0 xscale=12.0 height=5.619141 space=4.176 
width=8.531998]u
String[122.188,83.20001 fs=12.0 xscale=12.0 height=5.619141 space=4.176 
width=7.1399994]s
String[129.292,83.20001 fs=12.0 xscale=12.0 height=5.619141 space=4.176 
width=8.244003]o
String[137.488,83.20001 fs=12.0 xscale=12.0 height=5.619141 space=4.176 
width=5.220001]f
String[142.78001,83.20001 fs=12.0 xscale=12.0 height=5.619141 space=4.176 
width=5.7360077]t
String[148.48003,83.20001 fs=12.0 xscale=12.0 height=5.619141 space=4.176 
width=4.175995] 
String[152.66801,83.20001 fs=12.0 xscale=12.0 height=5.619141 space=4.176 
width=8.783997]P
String[161.45201,83.20001 fs=12.0 xscale=12.0 height=5.619141 space=4.176 
width=5.9160004]r
{code}

> Annotations character bounding boxes size 3 times higher than expected
> ----------------------------------------------------------------------
>
>                 Key: PDFBOX-2749
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2749
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.8.4
>            Reporter: Hayk Hayryan
>            Priority: Critical
>         Attachments: RESULT.pdf
>
>
> After text extraction the character bounding boxes 3 times higher than 
> expected. For example, see the first few character bounding boxes below:
> [90.1,46,6.64,40.06],[96.7,46,5.09,40.06],[101.79,46,5.8,40.06].
> The values are x, y, width, height. The width of the characters are between 5 
> and 7 pixels, but the height of the characters are 40.6 pixels. The actual 
> height of each line of text appears to be about 12 pixels. The example pdf 
> document attached.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to