Re: Re: Re: Solr Cell Questions

2012-09-25 Thread Erick Erickson
t; http://wiki.apache.org/solr/ExtractingRequestHandler#SolrJ > > > Erick Erickson schrieb am 25.09.2012 15:47:34: > >> Von: >> >> Erick Erickson >> >> An: >> >> solr-user@lucene.apache.org >> >> Datum: >> >> 25.09.2012 15:48 >>

Re: Solr Cell Questions

2012-09-25 Thread Jack Krupansky
a separate process) to minimize thread issues, GC issues, hung parsers, etc. -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Tuesday, September 25, 2012 10:24 AM To: solr-user@lucene.apache.org Subject: Re: Solr Cell Questions Are you by any chance committing

Re: Solr Cell Questions

2012-09-25 Thread Alexandre Rafalovitch
Are you by any chance committing after every file being indexed? That could cause the speed issues. Also, have you tried to optimize your indexer's java memory params. I use this for mine which used to run out of memory as well: java -server -Xms512m -Xmx2048m Regards, Alex. P.s. I may have so

Antwort: Re: Re: Solr Cell Questions

2012-09-25 Thread Johannes . Schwendinger
2 15:47:34: > Von: > > Erick Erickson > > An: > > solr-user@lucene.apache.org > > Datum: > > 25.09.2012 15:48 > > Betreff: > > Re: Re: Solr Cell Questions > > bq: how many documents per minute, second, what ever can i put into solr >

Re: Re: Solr Cell Questions

2012-09-25 Thread Erick Erickson
ings? > > > Best > Johannes > > Erick Erickson schrieb am 25.09.2012 00:22:26: > >> Von: >> >> Erick Erickson >> >> An: >> >> solr-user@lucene.apache.org >> >> Datum: >> >> 25.09.2012 00:23 >> >> Bet

Antwort: Re: Solr Cell Questions

2012-09-25 Thread Johannes . Schwendinger
kson > > An: > > solr-user@lucene.apache.org > > Datum: > > 25.09.2012 00:23 > > Betreff: > > Re: Solr Cell Questions > > If you're concerned about throughput, consider moving all the > SolrCell (Tika) processing off the server. SolrC

Re: Solr Cell Questions

2012-09-24 Thread Erick Erickson
If you're concerned about throughput, consider moving all the SolrCell (Tika) processing off the server. SolrCell is way cool for showing what can be done, but its downside is you're moving all the processing of the structured documents to the same machine doing the indexing. Pretty soon, especiall