Marian Mar created PDFBOX-2544:
----------------------------------
Summary: Building pdfbox 1.8.6 or 1.8.7 using IKVM
Key: PDFBOX-2544
URL: https://issues.apache.org/jira/browse/PDFBOX-2544
Project: PDFBox
Issue Type: Bug
Components: .NET, FontBox, Text extraction
Affects Versions: 1.8.7, 1.8.6
Environment: Windows 8.1 Pro, IKVM 8.0.5415 or IKVM 7.2.4630.5, JDK 7
(or 8) 64 bit, .NET 4.0
Reporter: Marian Mar
I tried to build pdfbox 1.8.6 (and 1.8.7) using IKVM according to this
http://stackoverflow.com/questions/8441991/how-to-build-pdfbox-for-net
instructions. Everything went fine, but If I tested this code below on some pdf
(e.g. http://www.energy.umich.edu/sites/default/files/pdf-sample.pdf) I get
exception on line with writeText. If I tested the same with pdfbox 1.8.5 I had
no problems.
class MyPDFTextStripper : PDFTextStripper {
...
java.io.StringWriter outputStream = new java.io.StringWriter();
writeText(document, outputStream); // Exception
...
}
Exceprtion:
An unhandled exception of type 'System.TypeInitializationException'
occurred in pdfbox-1.8.7.dll
Additional information: The type initializer for
'org.apache.fontbox.cmap.CMap' threw an exception.
Stack trace:
at org.apache.fontbox.cmap.CMap..ctor()
at org.apache.fontbox.cmap.CMapParser.parse(String resourceRoot, InputStream
input)
at org.apache.pdfbox.pdmodel.font.PDFont.parseCmap(String cmapRoot,
InputStream cmapStream)
at org.apache.pdfbox.pdmodel.font.PDSimpleFont.extractToUnicodeEncoding()
at org.apache.pdfbox.pdmodel.font.PDSimpleFont.determineEncoding()
at org.apache.pdfbox.pdmodel.font.PDType1Font.determineEncoding()
at org.apache.pdfbox.pdmodel.font.PDFont..ctor(COSDictionary fontDictionary)
at org.apache.pdfbox.pdmodel.font.PDSimpleFont..ctor(COSDictionary
fontDictionary)
at org.apache.pdfbox.pdmodel.font.PDType1Font..ctor(COSDictionary
fontDictionary)
at org.apache.pdfbox.pdmodel.font.PDFontFactory.createFont(COSDictionary dic)
at org.apache.pdfbox.pdmodel.PDResources.getFonts()
at org.apache.pdfbox.util.PDFStreamEngine.getFonts()
at org.apache.pdfbox.util.operator.SetTextFont.process(PDFOperator operator,
List arguments)
at org.apache.pdfbox.util.PDFStreamEngine.processOperator(PDFOperator
operator, List arguments)
at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(COSStream )
at org.apache.pdfbox.util.PDFStreamEngine.processSubStream(PDPage aPage,
PDResources resources, COSStream cosStream)
at org.apache.pdfbox.util.PDFStreamEngine.processStream(PDPage aPage,
PDResources resources, COSStream cosStream)
at org.apache.pdfbox.util.PDFTextStripper.processPage(PDPage page, COSStream
content)
at org.apache.pdfbox.util.PDFTextStripper.processPages(List pages)
at org.apache.pdfbox.util.PDFTextStripper.writeText(PDDocument doc, Writer
outputStream)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)