[
https://issues.apache.org/jira/browse/PDFBOX-533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12761553#action_12761553
]
Lars Torunski commented on PDFBOX-533:
--------------------------------------
fontbox-0.8.0-incubating.jar and icu4j-3.8.jar are installed, but I'm getting
similar thread dumps:
org.apache.pdfbox.exceptions.WrappedIOException
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:237)
at mycompany....
at java.lang.Thread.run(Thread.java:534)
Caused by: java.util.NoSuchElementException
at java.util.AbstractList$Itr.next(AbstractList.java:426)
at
org.apache.pdfbox.pdfparser.PDFXrefStreamParser.parse(PDFXrefStreamParser.java:115)
at
org.apache.pdfbox.cos.COSDocument.parseXrefStreams(COSDocument.java:538)
at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:203)
... 11 more
PDF file: www.oppenheim.pl/plpl/_download/09_05_11_Archiv.pdf
> PDFTextStripper.writeCharacters is called no where in the class
> ---------------------------------------------------------------
>
> Key: PDFBOX-533
> URL: https://issues.apache.org/jira/browse/PDFBOX-533
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 0.8.0-incubator
> Reporter: Navendu Garg
> Attachments: TestPDFTextStripperPerf.java
>
>
> It seems writeCharacters method is not called anywhere in the PDFTextStripper
> class. This makes it impossible for handling character TextPosition as well
> as Line Separator because processLineSeparator method is no longer there and
> writeLineSeparator is called when actual writing happens.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.