On Thu, 12 Aug 2010 14:32:19 +0200 Lannig Carina <lan...@ssi-schaefer-noell.com> wrote:
> Hi, > > I'm trying to index a txt-File (~150MB) using Solr Cell/Tika. > The curl command aborts due to a java.lang.OutOfMemoryError. [...] > AFAIK Tika keeps the whole file in RAM and posts it as one single > string to Solr. I'm using JVM-args: Xmx1024M and solr default > config with [...] Do not know about Tika, but what is the size of your Solr index, and the number of documents in it? Solr seems to need RAM, and while we did not do real benchmarks then, even with a few tens of thousands of documents, performance seemed to improve by allocating 2GB RAM. Besides, unless you are on a very tight budget, throwing a few GB more RAM at the problem seems to be an easy, and not very expensive, way out. Regards, Gora