[ https://issues.apache.org/jira/browse/PDFBOX-4390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tilman Hausherr resolved PDFBOX-4390. ------------------------------------- Resolution: Fixed > ExtractText loses spaces when rotationMagic option is used > ---------------------------------------------------------- > > Key: PDFBOX-4390 > URL: https://issues.apache.org/jira/browse/PDFBOX-4390 > Project: PDFBox > Issue Type: Bug > Components: Text extraction > Affects Versions: 2.0.12, 2.0.13 > Reporter: Tilman Hausherr > Assignee: Tilman Hausherr > Priority: Major > Fix For: 2.0.14 > > Attachments: PDFBOX-4390-082220-p1.pdf > > > This was detected by looking at the result of a regression test thankfully > done by [~talli...@apache.org] (see at the end of PDFBOX-4371) for his work > in TIKA-2779, there were many new words but some didn't have the spaces. This > is the result of a bad angle (180 instead of 0), because the font matrix > hasn't been considered, for type 3 fonts this is often a rotation or a flip. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org