On Thu, 12 Aug 2010 14:32:19 +0200
Lannig Carina <lan...@ssi-schaefer-noell.com> wrote:

> Hi,
> 
> I'm trying to index a txt-File (~150MB) using Solr Cell/Tika.
> The curl command aborts due to a java.lang.OutOfMemoryError.
[...]
> AFAIK Tika keeps the whole file in RAM and posts it as one single
> string to Solr. I'm using JVM-args: Xmx1024M and solr default
> config with
[...]

Do not know about Tika, but what is the size of your Solr index,
and the number of documents in it? Solr seems to need RAM, and
while we did not do real benchmarks then, even with a few tens of
thousands of documents, performance seemed to improve by allocating
2GB RAM. Besides, unless you are on a very tight budget, throwing a
few GB more RAM at the problem seems to be an easy, and not
very expensive, way out.

Regards,
Gora

Reply via email to