[ https://issues.apache.org/jira/browse/TIKA-636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13017311#comment-13017311 ]
Maxim Valyanskiy commented on TIKA-636: --------------------------------------- It is known problem in POI, afaik there is no event-model for parsing docx, so we had to build complete object (xmlbean) tree to process it > Taking very high heap space while parsing docx - Resulting in OOM in tha app > ---------------------------------------------------------------------------- > > Key: TIKA-636 > URL: https://issues.apache.org/jira/browse/TIKA-636 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 0.9 > Environment: Linux box > JDK 1.6 > Reporter: Jayesh K Rajpurohit > > I am using Tika-core-0.9 jar and poi 3.2-Final jar and poi-3.7 jars for > parsing the documents. But while parsing 3MB docx it is using 500 MB of RAM > space which is too high resulting in OOM in the application. > Do I have to tweak in at some place for reducing down the memory consumption. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira