[ 
https://issues.apache.org/jira/browse/PDFBOX-2749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14483703#comment-14483703
 ] 

Tilman Hausherr commented on PDFBOX-2749:
-----------------------------------------

Could you please test again with 1.8.9 to see if it has improved? If it still 
happens, please attach the smallest possible code with the problem so that we 
can test if it also happens with 2.0.

> Annotations character bounding boxes size 3 times higher than expected
> ----------------------------------------------------------------------
>
>                 Key: PDFBOX-2749
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2749
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.8.4
>            Reporter: Hayk Hayryan
>            Priority: Critical
>         Attachments: RESULT.pdf
>
>
> After text extraction the character bounding boxes 3 times higher than 
> expected. For example, see the first few character bounding boxes below:
> [90.1,46,6.64,40.06],[96.7,46,5.09,40.06],[101.79,46,5.8,40.06].
> The values are x, y, width, height. The width of the characters are between 5 
> and 7 pixels, but the height of the characters are 40.6 pixels. The actual 
> height of each line of text appears to be about 12 pixels. The example pdf 
> document attached.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to