[ https://issues.apache.org/jira/browse/PDFBOX-4324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16629720#comment-16629720 ]
Amit Maheshwari commented on PDFBOX-4324: ----------------------------------------- [~tilman] Got that. Thanks. I'll let you know if any further clarification needed. > while extracting text from region : "Error: expected hex character and not > s:115" > --------------------------------------------------------------------------------- > > Key: PDFBOX-4324 > URL: https://issues.apache.org/jira/browse/PDFBOX-4324 > Project: PDFBox > Issue Type: Bug > Components: Text extraction > Affects Versions: 2.0.2 > Reporter: Amit Maheshwari > Priority: Major > Attachments: SRI NAGAR.PDF, ToUnicode.txt > > > I am getting following error when I try to extract text of any specific > region of 2nd page of attached pdf (while 1st page is working fine) > > Error Message: > "Error: expected hex character and not s:115" > > Stack-trace: > at org.apache.pdfbox.contentstream.PDFStreamEngine.operatorException(Operator > operator, List operands, IOException e) > at org.apache.pdfbox.contentstream.PDFStreamEngine.processOperator(Operator > operator, List operands) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStreamOperators(PDContentStream > ) > at > org.apache.pdfbox.contentstream.PDFStreamEngine.processStream(PDContentStream > ) > at org.apache.pdfbox.contentstream.PDFStreamEngine.processPage(PDPage page) > at org.apache.pdfbox.text.PDFTextStreamEngine.processPage(PDPage ) > at org.apache.pdfbox.text.PDFTextStripper.processPage(PDPage page) > at org.apache.pdfbox.text.PDFTextStripperByArea.extractRegions(PDPage page) -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org