Antwort: Re: Solr Cell Questions

2012-09-25 Thread Johannes . Schwendinger
erickerick...@gmail.com An: solr-user@lucene.apache.org Datum: 25.09.2012 00:23 Betreff: Re: Solr Cell Questions If you're concerned about throughput, consider moving all the SolrCell (Tika) processing off the server. SolrCell is way cool for showing what can be done, but its

Re: Re: Solr Cell Questions

2012-09-25 Thread Erick Erickson
...@gmail.com schrieb am 25.09.2012 00:22:26: Von: Erick Erickson erickerick...@gmail.com An: solr-user@lucene.apache.org Datum: 25.09.2012 00:23 Betreff: Re: Solr Cell Questions If you're concerned about throughput, consider moving all the SolrCell (Tika) processing off the server

Antwort: Re: Re: Solr Cell Questions

2012-09-25 Thread Johannes . Schwendinger
am 25.09.2012 15:47:34: Von: Erick Erickson erickerick...@gmail.com An: solr-user@lucene.apache.org Datum: 25.09.2012 15:48 Betreff: Re: Re: Solr Cell Questions bq: how many documents per minute, second, what ever can i put into solr Too many variables to say. I've seen

Re: Solr Cell Questions

2012-09-25 Thread Alexandre Rafalovitch
Are you by any chance committing after every file being indexed? That could cause the speed issues. Also, have you tried to optimize your indexer's java memory params. I use this for mine which used to run out of memory as well: java -server -Xms512m -Xmx2048m Regards, Alex. P.s. I may have

Re: Solr Cell Questions

2012-09-25 Thread Jack Krupansky
as a separate process) to minimize thread issues, GC issues, hung parsers, etc. -- Jack Krupansky -Original Message- From: Alexandre Rafalovitch Sent: Tuesday, September 25, 2012 10:24 AM To: solr-user@lucene.apache.org Subject: Re: Solr Cell Questions Are you by any chance committing

Re: Re: Re: Solr Cell Questions

2012-09-25 Thread Erick Erickson
://wiki.apache.org/solr/ExtractingRequestHandler#SolrJ Erick Erickson erickerick...@gmail.com schrieb am 25.09.2012 15:47:34: Von: Erick Erickson erickerick...@gmail.com An: solr-user@lucene.apache.org Datum: 25.09.2012 15:48 Betreff: Re: Re: Solr Cell Questions bq: how many documents per

Solr Cell Questions

2012-09-24 Thread Johannes . Schwendinger
Hi, Im currently experimenting with Solr Cell to index files to Solr. During this some questions came up. 1. Is it possible (and wise) to connect to Solr Cell with multiple Threads at the same time to index several documents at the same time? This question came up because my prrogramm takes

Re: Solr Cell Questions

2012-09-24 Thread Erick Erickson
If you're concerned about throughput, consider moving all the SolrCell (Tika) processing off the server. SolrCell is way cool for showing what can be done, but its downside is you're moving all the processing of the structured documents to the same machine doing the indexing. Pretty soon,