[
https://issues.apache.org/jira/browse/PDFBOX-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12922897#action_12922897
]
Jukka Zitting commented on PDFBOX-874:
--------------------------------------
A good way to debug this is to set <argLine>-Xmx128m
-XX:+HeapDumpOnOutOfMemoryError</argLine>.
I ran the tests with such a setting, and the resulting heap dump shows font and
cmap objects taking up lots of memory.
> OutOfMemoryError in text extraction tests
> -----------------------------------------
>
> Key: PDFBOX-874
> URL: https://issues.apache.org/jira/browse/PDFBOX-874
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 1.3.0
> Reporter: Jukka Zitting
>
> As reported on dev@:
> TestTextStripper failed:
> testExtract(org.apache.pdfbox.util.TestTextStripper) Time elapsed:
> 7.32 sec <<< ERROR!
> java.lang.OutOfMemoryError: Java heap space
> at
> com.ibm.icu.impl.UCharacterNameReader.read(UCharacterNameReader.java:90)
> I can reproduce it by adding a <argLine>-Xmx128m</argLine> option to the
> surefire plugin configuration in pdfbox/pom.xml. The same problem doesn't
> occur with 1.2.1.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.