Hi Sangri, One more thing - you should post your question to the [email protected] list too, as this is more of a user question...
Thanks, and HTH! Cheers, Chris On 3/22/10 6:53 PM, "sangri" <[email protected]> wrote: Hello I'm using Tika on my final year project. I want to parse an XML document that is very large around 90MB. I have Apache Tika 0.6 and when I run the command: java -jar tika-app-0.6.jar -g theXMLfile.xml I see the output on the command prompt, showing the data extracted from the XML file. But after like 30 minutes, Tika crashes with an OutOfMemory Exception. Can someone help me with this issue? How can I fix this, is there a way to set the heap size when running Tika? Thanks in advance. ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Senior Computer Scientist NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 171-266B, Mailstop: 171-246 Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Assistant Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
