Johan Delouvée created PDFBOX-3845:
--------------------------------------

             Summary: Could not find referenced cmap stream H
                 Key: PDFBOX-3845
                 URL: https://issues.apache.org/jira/browse/PDFBOX-3845
             Project: PDFBox
          Issue Type: Bug
          Components: FontBox, PDModel
    Affects Versions: 2.0.6, 2.0.2
         Environment: Windows 7
            Reporter: Johan Delouvée
         Attachments: refman.pdf

The PDFTextStripperByArea.extractRegions fails for the first page of the 
attached document.

java.io.IOException: Error: Could not find referenced cmap stream H
        at 
org.apache.fontbox.cmap.CMapParser.getExternalCMap(CMapParser.java:415)
        at 
org.apache.fontbox.cmap.CMapParser.parsePredefined(CMapParser.java:87)
        at 
org.apache.pdfbox.pdmodel.font.CMapManager.getPredefinedCMap(CMapManager.java:54)
        at 
org.apache.pdfbox.pdmodel.font.PDType0Font.readEncoding(PDType0Font.java:181)
        at 
org.apache.pdfbox.pdmodel.font.PDType0Font.<init>(PDType0Font.java:129)
        at 
org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(PDFontFactory.java:83)
        at org.apache.pdfbox.pdmodel.PDResources.getFont(PDResources.java:123)
        at 
org.apache.pdfbox.contentstream.operator.text.SetFontAndSize.process(SetFontAndSize.java:60)
        at 
org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(PDFStreamEngine.java:815)
        at 
com.geensoft.traceability.converters.pdf.strippers.AreaStripper.processOperator(AreaStripper.java:324)
        at 
org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDFStreamEngine.java:472)
        at 
org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDFStreamEngine.java:446)
        at 
org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDFStreamEngine.java:149)
        at 
org.apache.pdfbox.text.PDFTextStreamEngine.processPage(PDFTextStreamEngine.java:136)
        at 
org.apache.pdfbox.text.PDFTextStripper.processPage(PDFTextStripper.java:391)
        at 
org.apache.pdfbox.text.PDFTextStripperByArea.extractRegions(PDFTextStripperByArea.java:132)



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to