[jira] [Resolved] (TIKA-734) Out of memory exception with Xlsx file less than 5 MB

Jukka Zitting (Resolved) (JIRA) Thu, 13 Oct 2011 12:35:40 -0700

     [ 
https://issues.apache.org/jira/browse/TIKA-734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Jukka Zitting resolved TIKA-734.
--------------------------------

    Resolution: Cannot Reproduce

I can't reproduce the problem you're describing.

On my computer the following code (that parses the attached file 400 times in 
total using 20 concurrent threads to do so) completes in less than a minute and 
requires less than 200MB of memory (10MB per thread).

{code}
final Tika tika = new Tika();
final File file = new File("Sample BIG Excel 2007 File.xls");
for (int i = 0; i < 20; i++) {
    new Thread(new Runnable() {
        public void run() {
            for (int i = 0; i < 20; i++) {
                try {
                    tika.parseToString(file);
                } catch (Exception e) {
                    e.printStackTrace();
                }
            }
        }
    }).start();
}
{code}
                
> Out of memory exception with Xlsx file less than 5 MB
> -----------------------------------------------------
>
>                 Key: TIKA-734
>                 URL: https://issues.apache.org/jira/browse/TIKA-734
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 0.7
>         Environment: Windows Vista , JUnit test cases running in RAD, JVM 
> heap memory - 500MB
>            Reporter: Anirban Mitra
>         Attachments: Sample BIG Excel 2007 File.xls
>
>
> I am trying to parse and extract a pattern from Xlsx files.i tried using a 5 
> MB file and when i run my
> JUnit test cases, it fails and i see heap memory out of size exception.Do we 
> have any resolution for the same ?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (TIKA-734) Out of memory exception with Xlsx file less than 5 MB

Reply via email to