[ https://issues.apache.org/jira/browse/PDFBOX-3950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tilman Hausherr updated PDFBOX-3950: ------------------------------------ Attachment: 6MNJMPJVZMZRSTE5A4ENHP3F5SIOG27T.pdf I'm being lenient on the missing /gs name too because it gives us an advantage in text extraction, see page 4 of attached file 6MNJMPJVZMZRSTE5A4ENHP3F5SIOG27T.pdf. I'll have to adjust the test too and fix an NPE in font processing that happens with the original file (where my fix won't improve anything). > NPE in PageIterator.enqueueKids > ------------------------------- > > Key: PDFBOX-3950 > URL: https://issues.apache.org/jira/browse/PDFBOX-3950 > Project: PDFBox > Issue Type: Bug > Components: Parsing > Affects Versions: 2.0.8 > Reporter: Tilman Hausherr > Assignee: Andreas Lehmkühler > Labels: regression > Fix For: 2.0.8, 3.0.0 > > Attachments: 23EGDHXSBBYQLKYOKGZUOVYVNE675PRD.pdf, > 6MNJMPJVZMZRSTE5A4ENHP3F5SIOG27T.pdf > > > {code} > Exception in thread "main" java.lang.NullPointerException > at java.util.ArrayDeque.addLast(ArrayDeque.java:244) > at java.util.ArrayDeque.add(ArrayDeque.java:418) > at > org.apache.pdfbox.pdmodel.PDPageTree$PageIterator.enqueueKids(PDPageTree.java:178) > at > org.apache.pdfbox.pdmodel.PDPageTree$PageIterator.enqueueKids(PDPageTree.java:173) > at > org.apache.pdfbox.pdmodel.PDPageTree$PageIterator.<init>(PDPageTree.java:159) > at > org.apache.pdfbox.pdmodel.PDPageTree$PageIterator.<init>(PDPageTree.java:153) > at org.apache.pdfbox.pdmodel.PDPageTree.iterator(PDPageTree.java:123) > at > org.apache.pdfbox.text.PDFTextStripper.processPages(PDFTextStripper.java:282) > {code} > This worked in 2.0.7. There are about 200 occurences of this exception in the > tests by Tim. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org