[ https://issues.apache.org/jira/browse/PDFBOX-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tilman Hausherr updated PDFBOX-1298: ------------------------------------ Description: I get this exception with page.convertToImage(): Apr 27, 2012 10:58:46 AM org.apache.pdfbox.util.PDFStreamEngine processOperator Warnung: java.lang.IllegalArgumentException: fromIndex(0) > toIndex(-2) java.lang.IllegalArgumentException: fromIndex(0) > toIndex(-2) at java.util.ArrayList.subListRangeCheck(ArrayList.java:924) at java.util.ArrayList$SubList.subList(ArrayList.java:1109) at org.apache.fontbox.cff.CharStringConverter.handleType2Command(CharStringConverter.java:259) at org.apache.fontbox.cff.CharStringConverter.handleCommand(CharStringConverter.java:81) at org.apache.fontbox.cff.CharStringHandler.handleSequence(CharStringHandler.java:53) at org.apache.fontbox.cff.CharStringConverter.convert(CharStringConverter.java:64) at org.apache.fontbox.cff.CFFFont$Mapping.toType1Sequence(CFFFont.java:374) at org.apache.fontbox.cff.Type1FontFormatter.printEexecFontDictionary(Type1FontFormatter.java:177) at org.apache.fontbox.cff.Type1FontFormatter.printFontDictionary(Type1FontFormatter.java:127) at org.apache.fontbox.cff.Type1FontFormatter.printFont(Type1FontFormatter.java:57) at org.apache.fontbox.cff.Type1FontFormatter.format(Type1FontFormatter.java:47) at org.apache.pdfbox.pdmodel.font.PDType1CFont.prepareAwtFont(PDType1CFont.java:546) at org.apache.pdfbox.pdmodel.font.PDType1CFont.getawtFont(PDType1CFont.java:287) at org.apache.pdfbox.pdmodel.font.PDType1Font.getawtFont(PDType1Font.java:216) at org.apache.pdfbox.pdmodel.font.PDSimpleFont.drawString(PDSimpleFont.java:101) at org.apache.pdfbox.pdfviewer.PageDrawer.processTextPosition(PageDrawer.java:194) at org.apache.pdfbox.util.PDFStreamEngine.processEncodedText(PDFStreamEngine.java:494) at org.apache.pdfbox.util.operator.ShowText.process(ShowText.java:45) at org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:551) at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:274) at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:251) at org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:225) at org.apache.pdfbox.pdfviewer.PageDrawer.drawPage(PageDrawer.java:107) at org.apache.pdfbox.pdmodel.PDPage.convertToImage(PDPage.java:722) at org.apache.pdfbox.pdmodel.PDPage.convertToImage(PDPage.java:693) at pdfboxpageimageextraction.ExtractImages.doPdf(ExtractImages.java:78) at pdfboxpageimageextraction.ExtractImages.main(ExtractImages.java:52) the code: ... document = PDDocument.load(pdfFilename); List pages = document.getDocumentCatalog().getAllPages(); int p = 0; for (Object pobj : pages) { PDPage page = (PDPage) pobj; ++p; BufferedImage bim = page.convertToImage(); .... This happens on page 25, (and others) on the attached PDF. (Note that when viewing that file with Acrobat Reader, the page number is "xxiv"). It seems to deal with special characters. In the attached PNG file, at least one such character is missing. was: I get this exception with page.convertToImage(): Apr 27, 2012 10:58:46 AM org.apache.pdfbox.util.PDFStreamEngine processOperator Warnung: java.lang.IllegalArgumentException: fromIndex(0) > toIndex(-2) java.lang.IllegalArgumentException: fromIndex(0) > toIndex(-2) at java.util.ArrayList.subListRangeCheck(ArrayList.java:924) at java.util.ArrayList$SubList.subList(ArrayList.java:1109) at org.apache.fontbox.cff.CharStringConverter.handleType2Command(CharStringConverter.java:259) at org.apache.fontbox.cff.CharStringConverter.handleCommand(CharStringConverter.java:81) at org.apache.fontbox.cff.CharStringHandler.handleSequence(CharStringHandler.java:53) at org.apache.fontbox.cff.CharStringConverter.convert(CharStringConverter.java:64) at org.apache.fontbox.cff.CFFFont$Mapping.toType1Sequence(CFFFont.java:374) at org.apache.fontbox.cff.Type1FontFormatter.printEexecFontDictionary(Type1FontFormatter.java:177) at org.apache.fontbox.cff.Type1FontFormatter.printFontDictionary(Type1FontFormatter.java:127) at org.apache.fontbox.cff.Type1FontFormatter.printFont(Type1FontFormatter.java:57) at org.apache.fontbox.cff.Type1FontFormatter.format(Type1FontFormatter.java:47) at org.apache.pdfbox.pdmodel.font.PDType1CFont.prepareAwtFont(PDType1CFont.java:546) at org.apache.pdfbox.pdmodel.font.PDType1CFont.getawtFont(PDType1CFont.java:287) at org.apache.pdfbox.pdmodel.font.PDType1Font.getawtFont(PDType1Font.java:216) at org.apache.pdfbox.pdmodel.font.PDSimpleFont.drawString(PDSimpleFont.java:101) at org.apache.pdfbox.pdfviewer.PageDrawer.processTextPosition(PageDrawer.java:194) at org.apache.pdfbox.util.PDFStreamEngine.processEncodedText(PDFStreamEngine.java:494) at org.apache.pdfbox.util.operator.ShowText.process(ShowText.java:45) at org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:551) at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:274) at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:251) at org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:225) at org.apache.pdfbox.pdfviewer.PageDrawer.drawPage(PageDrawer.java:107) at org.apache.pdfbox.pdmodel.PDPage.convertToImage(PDPage.java:722) at org.apache.pdfbox.pdmodel.PDPage.convertToImage(PDPage.java:693) at pdfboxpageimageextraction.ExtractImages.doPdf(ExtractImages.java:78) at pdfboxpageimageextraction.ExtractImages.main(ExtractImages.java:52) the code: ... document = PDDocument.load(pdfFilename); List pages = document.getDocumentCatalog().getAllPages(); int p = 0; for (Object pobj : pages) { PDPage page = (PDPage) pobj; ++p; BufferedImage bim = page.convertToImage(); .... This happens on page 25, (and others) on the attached PDF. (Note that in the PDF, the page itself is xxiv). It seems to deal with special characters. In the attached PNG file, at least one such character is missing. > java.lang.IllegalArgumentException: fromIndex(0) > toIndex(-2) > -------------------------------------------------------------- > > Key: PDFBOX-1298 > URL: https://issues.apache.org/jira/browse/PDFBOX-1298 > Project: PDFBox > Issue Type: Bug > Components: FontBox > Affects Versions: 1.6.0 > Environment: XP, JDK 1.7 > Reporter: Tilman Hausherr > Attachments: document-25.png, document.pdf > > > I get this exception with page.convertToImage(): > Apr 27, 2012 10:58:46 AM org.apache.pdfbox.util.PDFStreamEngine > processOperator > Warnung: java.lang.IllegalArgumentException: fromIndex(0) > toIndex(-2) > java.lang.IllegalArgumentException: fromIndex(0) > toIndex(-2) > at java.util.ArrayList.subListRangeCheck(ArrayList.java:924) > at java.util.ArrayList$SubList.subList(ArrayList.java:1109) > at > org.apache.fontbox.cff.CharStringConverter.handleType2Command(CharStringConverter.java:259) > at > org.apache.fontbox.cff.CharStringConverter.handleCommand(CharStringConverter.java:81) > at > org.apache.fontbox.cff.CharStringHandler.handleSequence(CharStringHandler.java:53) > at > org.apache.fontbox.cff.CharStringConverter.convert(CharStringConverter.java:64) > at > org.apache.fontbox.cff.CFFFont$Mapping.toType1Sequence(CFFFont.java:374) > at > org.apache.fontbox.cff.Type1FontFormatter.printEexecFontDictionary(Type1FontFormatter.java:177) > at > org.apache.fontbox.cff.Type1FontFormatter.printFontDictionary(Type1FontFormatter.java:127) > at > org.apache.fontbox.cff.Type1FontFormatter.printFont(Type1FontFormatter.java:57) > at > org.apache.fontbox.cff.Type1FontFormatter.format(Type1FontFormatter.java:47) > at > org.apache.pdfbox.pdmodel.font.PDType1CFont.prepareAwtFont(PDType1CFont.java:546) > at > org.apache.pdfbox.pdmodel.font.PDType1CFont.getawtFont(PDType1CFont.java:287) > at > org.apache.pdfbox.pdmodel.font.PDType1Font.getawtFont(PDType1Font.java:216) > at > org.apache.pdfbox.pdmodel.font.PDSimpleFont.drawString(PDSimpleFont.java:101) > at > org.apache.pdfbox.pdfviewer.PageDrawer.processTextPosition(PageDrawer.java:194) > at > org.apache.pdfbox.util.PDFStreamEngine.processEncodedText(PDFStreamEngine.java:494) > at org.apache.pdfbox.util.operator.ShowText.process(ShowText.java:45) > at > org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFStreamEngine.java:551) > at > org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:274) > at > org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDFStreamEngine.java:251) > at > org.apache.pdfbox.util.PDFStreamEngine.processStream(PDFStreamEngine.java:225) > at org.apache.pdfbox.pdfviewer.PageDrawer.drawPage(PageDrawer.java:107) > at org.apache.pdfbox.pdmodel.PDPage.convertToImage(PDPage.java:722) > at org.apache.pdfbox.pdmodel.PDPage.convertToImage(PDPage.java:693) > at pdfboxpageimageextraction.ExtractImages.doPdf(ExtractImages.java:78) > at pdfboxpageimageextraction.ExtractImages.main(ExtractImages.java:52) > the code: > ... > document = PDDocument.load(pdfFilename); > List pages = document.getDocumentCatalog().getAllPages(); > int p = 0; > for (Object pobj : pages) > { > PDPage page = (PDPage) pobj; > ++p; > BufferedImage bim = page.convertToImage(); > .... > This happens on page 25, (and others) on the attached PDF. (Note that when > viewing that file with Acrobat Reader, the page number is "xxiv"). It seems > to deal with special characters. In the attached PNG file, at least one such > character is missing. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira