[ https://issues.apache.org/jira/browse/TIKA-1462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
James Hardwick closed TIKA-1462. -------------------------------- Resolution: Duplicate Looks like this was already handled via TIKA-1424 > PDFont consumes all heap space > ------------------------------ > > Key: TIKA-1462 > URL: https://issues.apache.org/jira/browse/TIKA-1462 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.6 > Reporter: James Hardwick > Priority: Critical > > See https://issues.apache.org/jira/browse/PDFBOX-2200 for more details. > In short, PDFont will not release resources, and will eventually amass enough > objects to consume all available memory. We are encountering this in > productions environments, causing our solr server to crash when ingesting > large amounts of PDF documents. > The fix is supposedly in for the 2.0.0 release of PDFBox, but that version > has been outstanding for so long that I'd suggest implementing the workaround > as proposed in the PDFBox issue. -- This message was sent by Atlassian JIRA (v6.3.4#6332)