Johan Delouvée created PDFBOX-3845: --------------------------------------
Summary: Could not find referenced cmap stream H Key: PDFBOX-3845 URL: https://issues.apache.org/jira/browse/PDFBOX-3845 Project: PDFBox Issue Type: Bug Components: FontBox, PDModel Affects Versions: 2.0.6, 2.0.2 Environment: Windows 7 Reporter: Johan Delouvée Attachments: refman.pdf The PDFTextStripperByArea.extractRegions fails for the first page of the attached document. java.io.IOException: Error: Could not find referenced cmap stream H at org.apache.fontbox.cmap.CMapParser.getExternalCMap(CMapParser.java:415) at org.apache.fontbox.cmap.CMapParser.parsePredefined(CMapParser.java:87) at org.apache.pdfbox.pdmodel.font.CMapManager.getPredefinedCMap(CMapManager.java:54) at org.apache.pdfbox.pdmodel.font.PDType0Font.readEncoding(PDType0Font.java:181) at org.apache.pdfbox.pdmodel.font.PDType0Font.<init>(PDType0Font.java:129) at org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:83) at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:123) at org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:60) at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:815) at com.geensoft.traceability.converters.pdf.strippers.AreaStripper.processOperator(AreaStripper.java:324) at org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:472) at org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:446) at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:149) at org.apache.pdfbox.text.PDFTextStreamEngine.processPage(PDFTextStreamEngine.java:136) at org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:391) at org.apache.pdfbox.text.PDFTextStripperByArea.extractRegions(PDFTextStripperByArea.java:132) -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org