[ https://issues.apache.org/jira/browse/PDFBOX-3239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15153114#comment-15153114 ]
Tilman Hausherr commented on PDFBOX-3239: ----------------------------------------- Then they are better than Adobe Reader. Or they are using some heuristics, i.e. guessing. Or they are doing OCR. > PDFTextStripper parses document as empty > ---------------------------------------- > > Key: PDFBOX-3239 > URL: https://issues.apache.org/jira/browse/PDFBOX-3239 > Project: PDFBox > Issue Type: Bug > Affects Versions: 1.8.11, 2.0.0 > Environment: OSX 10.10.5, JAVA8 > Reporter: Vojtech Knyttl > > The document is parsed as empty with new lines at the end of each page. > http://pub.goout.cz/malformed_parse.pdf -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org