[
https://issues.apache.org/jira/browse/PDFBOX-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13900421#comment-13900421
]
Henning Bermeyer commented on PDFBOX-1835:
------------------------------------------
I have tried to break recursion by detecting a simple self reference using the
code of 1.8.4, but as I expected this did not help.
Side note: The problem persists in 1.8.4.
The problem tastes a little like an invalid reuse of collection values, a
violation of immutability. Unfortunately I have no clue as to how the PDF
parser works and how the dictionary is used, so it is hard for me to debug this.
> Stackoverflow pdfbox 1.8.3
> --------------------------
>
> Key: PDFBOX-1835
> URL: https://issues.apache.org/jira/browse/PDFBOX-1835
> Project: PDFBox
> Issue Type: Bug
> Affects Versions: 1.8.3
> Environment: CentOS6
> Java 7
> Reporter: Johnny Minty
>
> I guys I'm using Apache Tika in an application which uses PDFbox for PDF
> parsing.
> When trying to read in a specific PDF attachment i'm receiving the following
> stack overflow:
> Caused by: java.lang.StackOverflowError
> at java.lang.String.valueOf(String.java:2854)
> at java.lang.StringBuilder.append(StringBuilder.java:128)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1450)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
> at
> org.apache.pdfbox.cos.COSDictionary.toString(COSDictionary.java:1453)
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)