Re: Number of shards in 4 node Cluster

2015-03-18 Thread Georgi Ivanov
My rules is : 1 primary shard per server. Also make some estimation how big will be the single index/shard I think it is not good if single shard exceed 10 GB, although there is no exact limit. Georgi On Tuesday, March 17, 2015 at 7:00:23 PM UTC+1, John S wrote: Hi All, Is there any

Re: Number of shards in 4 node Cluster

2015-03-18 Thread Mark Walkom
We recommend shards no larger than 50GB, but as you mention there is no exact limit. On 18 March 2015 at 04:09, Georgi Ivanov georgi.r.iva...@gmail.com wrote: My rules is : 1 primary shard per server. Also make some estimation how big will be the single index/shard I think it is not good if

Re: Number of shards in 4 node Cluster

2015-03-18 Thread Andrew Selden
As Mark said, there is no hard limit on how big a single shard can be, but just so it’s clear, 10GB is actually quite small for a single shard. It’s not at all uncommon for me to see shards with upwards of 60 GB or more. On Mar 18, 2015, at 4:09 AM, Georgi Ivanov georgi.r.iva...@gmail.com

Re: Number of shards in 4 node Cluster

2015-03-18 Thread Andrej Rosenheinrich
Hi Mark, may I ask what the reason for this recommendation is? Thanks, Andrej Am Mittwoch, 18. März 2015 17:50:09 UTC+1 schrieb Mark Walkom: We recommend shards no larger than 50GB, but as you mention there is no exact limit. On 18 March 2015 at 04:09, Georgi Ivanov georgi@gmail.com

Re: Number of shards in 4 node Cluster

2015-03-18 Thread Mark Walkom
Part of it is based on knowledge picked up from our customers, part of it is that once you have to start shifting files larger than this around (during reallocation or recovery) it can take excessive time. There is also a ~2 billion hard limit for documents in a single shard, which is a lucene

Re: Number of shards in 4 node Cluster

2015-03-17 Thread Mark Walkom
What sort of data do you have, time based or static? If it's the former then going with any arbitrary number is less of a problem as you can change this the next roll over period. If it's static then 4 would be a good start. There aren't any metrics around this, other than *not* creating a large

Re: Number of shards in 4 node Cluster

2015-03-17 Thread Andrew Selden
I typically suggest to start with the default of 5 shards. A single shard can hold several tens of gigabytes. Certainly in your case it seems like 20 shards is overkill for a 4 node cluster. On Mar 17, 2015, at 11:00 AM, John S bun...@gmail.com wrote: Hi All, Is there any best