[ https://issues.apache.org/jira/browse/TIKA-521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12917819#action_12917819 ]
Sjoerd Smeets commented on TIKA-521: ------------------------------------ I'm facing the same issue. Increasing the heapssize to the maximum will cover for a certain amount of xlsx files, but there are still a lot of files causing an OutOfMemoryError (> 10 Mb XLS files). The XSSFEventBasedExcelExtractor indeed processes these files as we would like to. What would be the draw back of using XSSFEventBasedExcelExtractor? > OutOfMemoryError Parsing XSLX File > ---------------------------------- > > Key: TIKA-521 > URL: https://issues.apache.org/jira/browse/TIKA-521 > Project: Tika > Issue Type: Bug > Affects Versions: 0.7, 0.8 > Reporter: Stephen Duncan Jr > Attachments: memory-test.xlsx > > > I have several XSLX files I'm trying to parse with Tika that are failing with > an OutOfMemoryError even when using a large heap size. For instance the > attached 1.26MB excel file fails using a 512MB heap. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.