Re: concept and choice: custom sharding or auto sharding?

2015-09-23 Thread scott chu
From: Erick Erickson To: solr-user Date: 2015-09-04, 01:47:23 Subject: Re: Re: Re: Re: Re: concept and choice: custom sharding or auto sharding? Ah, that may make my suggestions unworkable re: just reindexing. Still, how much time are we talking about here? I've very often foun

Re: Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Erick Erickson
Ah, that may make my suggestions unworkable re: just reindexing. Still, how much time are we talking about here? I've very often found that indexing performance isn't gated by the Solr processing, but by whatever is feeding Solr. A quick test is to fire up your indexing and see if the CPU utilizat

Re: Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Toke Eskildsen
scott chu wrote: > No, both. But first I have to face the indexing performance problem. > Where can I see information about concurrent/parallel indexing on Solr? Depends on how you index. If you use a Java program, http://lucene.apache.org/solr/5_2_0/solr-solrj/index.html?org/apache/solr/client/s

Re: Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread scott chu
: Re: Re: Re: concept and choice: custom sharding or auto sharding? scott chu wrote: ? > I keep forgeting to mention one thing along the discussion session. > Our data is Chinese news articles and we use CJK tokenizer > (i.e. 2-gram) currently. The time spent to indexing is quite slow, &

Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Toke Eskildsen
scott chu wrote:   > I keep forgeting to mention one thing along the discussion session. > Our data is Chinese news articles and we use CJK tokenizer > (i.e. 2-gram) currently. The time spent to indexing is quite slow, > compared to indexing english articles. That's why I am so > worrying about in

Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread scott chu
year using MMSeg algorithm or 1-ngram+query-preprocessor). - Original Message - From: Erick Erickson To: solr-user Date: 2015-09-04, 00:07:43 Subject: Re: Re: Re: concept and choice: custom sharding or auto sharding? bq: If you switch to SolrCloud, will you still keep numShards para

Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Erick Erickson
t; to SolrCloud possibly because I want to take advange of the Zookeeper > functionality for HA/DR. > > - Original Message ----- > From: Toke Eskildsen > To: solr-user > Date: 2015-09-03, 18:33:39 > Subject: Re: Re: concept and choice: custom sharding or auto sharding? > >

Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread scott chu
--- Original Message - From: Toke Eskildsen To: solr-user Date: 2015-09-03, 18:33:39 Subject: Re: Re: concept and choice: custom sharding or auto sharding? On Thu, 2015-09-03 at 18:24 +0800, Scott Chu wrote: > Do you use master-slave or SolrCloud for that single shard? Due to legacy reasons we

Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Toke Eskildsen
On Thu, 2015-09-03 at 18:24 +0800, Scott Chu wrote: > Do you use master-slave or SolrCloud for that single shard? Due to legacy reasons we are just using 2 fully independent Solrs, each indexing independently, with an Apache load balancer in front for the searches. It does give us the occasional h

Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Scott Chu
5/9/3 - Original Message - From: Toke Eskildsen To: solr-user Date: 2015-09-03, 17:46:22 Subject: Re: Re: concept and choice: custom sharding or auto sharding? On Wed, 2015-09-02 at 08:30 -0700, Erick Erickson wrote: > Because I routinely see 50M docs on a single node and I've seen o

Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Toke Eskildsen
On Wed, 2015-09-02 at 08:30 -0700, Erick Erickson wrote: > Because I routinely see 50M docs on a single node and I've seen over 300M docs > on a single node with sub-second responses. For what it's worth, we also do article-based search of newspaper based material (old OCR'ed papers). We use a sin

Re: concept and choice: custom sharding or auto sharding?

2015-09-03 Thread Charlie Hull
On 02/09/2015 15:47, scott chu wrote: > I post a question on Stackoverflow > http://stackoverflow.com/questions/32343813/custom-sharding-or-auto-sharding-on-solrcloud: > However, since this is a mail-list, I repost the question below to request > for suggestion and more subtle concept of SolrClou

Re: concept and choice: custom sharding or auto sharding?

2015-09-02 Thread Shawn Heisey
On 9/2/2015 9:19 AM, scott chu wrote: > Mail > Do you mean I only have to put 10M documents in one index and copy > it to many slaves in a classic Solr master-slave architecture to > provide querying serivce on internet, and it won't have obvious > downgrade of query performance? But I did have add

Re: Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-02 Thread scott chu
solr-user,妳好 Sorry ,wrong again. Auto sharding is not implicit router. - Original Message - From: scott chu To: solr-user Date: 2015-09-02, 23:50:20 Subject: Re: Re: Re: concept and choice: custom sharding or auto sharding? solr-user,妳好 Thanks! I'll go back to check m

Re: Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-02 Thread scott chu
o use SolrCloud afte all, I may have to use auto sharding (i.e. implicit router). - Original Message - From: Erick Erickson To: solr-user Date: 2015-09-02, 23:30:53 Subject: Re: Re: concept and choice: custom sharding or auto sharding? bq: Why do you say: "at 10M documents there'

Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-02 Thread Erick Erickson
nce is kinda sad. Why do you say: "at 10M documents there's rarely a > need to shard at all?" Do I provide too few slaves? What amount of documents > is suitable for a need for shard in SolrCloud? > > - Original Message ----- > > From: Erick Erickson > To:

Re: Re: concept and choice: custom sharding or auto sharding?

2015-09-02 Thread scott chu
oud? - Original Message - From: Erick Erickson To: solr-user Date: 2015-09-02, 23:00:29 Subject: Re: concept and choice: custom sharding or auto sharding? Frankly, at 10M documents there's rarely a need to shard at all. Why do you think you need to? This seems like adding comp

Re: concept and choice: custom sharding or auto sharding?

2015-09-02 Thread Erick Erickson
Frankly, at 10M documents there's rarely a need to shard at all. Why do you think you need to? This seems like adding complexity for no good reason. Sharding should only really be used when you have too many documents to fit on a single shard as it adds some overhead, restricts some possibilities (

concept and choice: custom sharding or auto sharding?

2015-09-02 Thread scott chu
I post a question on Stackoverflow http://stackoverflow.com/questions/32343813/custom-sharding-or-auto-sharding-on-solrcloud: However, since this is a mail-list, I repost the question below to request for suggestion and more subtle concept of SolrCloud's behavior on document routing. I want to es