[jira] [Updated] (TIKA-734) Out of memory exception with Xlsx file less than 5 MB

Anirban Mitra (Updated) (JIRA) Tue, 11 Oct 2011 11:09:36 -0700

     [ 
https://issues.apache.org/jira/browse/TIKA-734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Anirban Mitra updated TIKA-734:
-------------------------------

    Attachment: Sample BIG Excel 2007 File.xls

Hi,
The out of memory issue is resolved now. but we are seeing a huge performance 
issue with 10 concurrent users when we tried to parse the attached 10 MB xlsx 
file.it takes around 15 mins in average for 10 concurrent users to parse the 
document.After profiling the code using JProfiler, we found 
AutoDetectParser.Parse() takes most of the time. and many threads are 
waiting/blocked.i am using XML beans jar xmlbeans-2.3.0.jar and 
xml-apis-1.0.b2.jar. any suggestions will be helpful.
Thanks
Anirban
                
> Out of memory exception with Xlsx file less than 5 MB
> -----------------------------------------------------
>
>                 Key: TIKA-734
>                 URL: https://issues.apache.org/jira/browse/TIKA-734
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.7
>         Environment: Windows Vista , JUnit test cases running in RAD, JVM 
> heap memory - 500MB
>            Reporter: Anirban Mitra
>         Attachments: Sample BIG Excel 2007 File.xls
>
>
> I am trying to parse and extract a pattern from Xlsx files.i tried using a 5 
> MB file and when i run my
> JUnit test cases, it fails and i see heap memory out of size exception.Do we 
> have any resolution for the same ?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (TIKA-734) Out of memory exception with Xlsx file less than 5 MB

Reply via email to