Thanks Adrien!

Very much appreciate your time and help.

Chris


On Mon, Aug 25, 2014 at 3:44 AM, Adrien Grand <
adrien.gr...@elasticsearch.com> wrote:

> I meant tens of shards per node. So if you have N nodes with I indices
> which have S shards and R replicas, that would be (I * S * (1 + R)) / N.
>
> One shard per node is optimal but doesn't allows for growth: if you add
> one more node, you cannot spread the indexing work load, that is why it is
> common to have a few shards per node in order to allow elasticsearch to
> spread the load in case you would introduce a new node in your cluster to
> improve your cluster capacity.
>
>
> On Mon, Aug 25, 2014 at 12:07 AM, Chris Neal <chris.n...@derbysoft.net>
> wrote:
>
>> Adrien,
>>
>> Thanks so much for the response.  It was very helpful.  I will check out
>> those links on capacity planning for sure.
>>
>> One followup question.  You mention that tens of shards per node would be
>> ok.  Are you meaning tens of shards from tens of indexes?  Or tens of
>> shards for a single index?  Right now I have two servers configured with
>> the index getting 2 shards (one per server), and 1 replica (per server).
>>
>> Chris
>>
>>
>> On Fri, Aug 22, 2014 at 5:58 PM, Adrien Grand <
>> adrien.gr...@elasticsearch.com> wrote:
>>
>>> Hi Chris,
>>>
>>> Usually, the problem is not that much in terms of indices but shards,
>>> which are the physical units of data storage (an index being a logical view
>>> over several shards).
>>>
>>> Something to beware of is that shards typically have some constant
>>> overhead (disk space, file descriptors, memory usage) that does not depend
>>> on the amount of data that they store. Although it would be ok to have up
>>> to a few tens of shards per nodes, you should avoid to have eg. thousands
>>> of shards per node.
>>>
>>> if you plan on always adding a filter for a specific application in your
>>> search requests, then splitting by application makes sense since this will
>>> make the filter useless at search time, you will just need to query the
>>> application-specific index. On the other hand if you don't filter by
>>> application, then splitting data by yourself into smaller indices would be
>>> pretty equivalent to storing everything in a single index with a higher
>>> number of shards.
>>>
>>> You might want to check out the following resources that talk about
>>> capacity planning:
>>>  - http://www.elasticsearch.org/videos/big-data-search-and-analytics/
>>>  -
>>> http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/capacity-planning.html
>>>
>>>
>>>
>>> On Fri, Aug 22, 2014 at 9:08 PM, Chris Neal <chris.n...@derbysoft.net>
>>> wrote:
>>>
>>>> Hi all,
>>>>
>>>> As the subject says, I'm wondering about index size vs. number of
>>>> indexes.
>>>>
>>>> I'm indexing many application log files, currently with an index by day
>>>> for all logs, which will make a very large index.  For just a few
>>>> applications in Development, the index is 55GB a day (across 2 servers).
>>>>  In prod with all applications, it will be "much more than that".  1TB a
>>>> day maybe?
>>>>
>>>> I'm wondering if there is value in splitting the indexes by day and by
>>>> application, which would produce more indexes per day, but they would be
>>>> smaller, vs. value in having a single, mammoth index by day alone.
>>>>
>>>> Is it just a resource question?  If I have enough RAM/disk/CPU to
>>>> support a "mammoth" index, then I'm fine?  Or are there other reasons to
>>>> (or to not) split up indexes?
>>>>
>>>> Very much appreciate your time.
>>>> Chris
>>>>
>>>> --
>>>> You received this message because you are subscribed to the Google
>>>> Groups "elasticsearch" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>> an email to elasticsearch+unsubscr...@googlegroups.com.
>>>> To view this discussion on the web visit
>>>> https://groups.google.com/d/msgid/elasticsearch/CAND3DphfsYx0LW0M-yvLWGauRSzVWG0etaBkiTrN7zVafq7tMA%40mail.gmail.com
>>>> <https://groups.google.com/d/msgid/elasticsearch/CAND3DphfsYx0LW0M-yvLWGauRSzVWG0etaBkiTrN7zVafq7tMA%40mail.gmail.com?utm_medium=email&utm_source=footer>
>>>> .
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>>
>>>
>>>
>>> --
>>> Adrien Grand
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "elasticsearch" group.
>>> To unsubscribe from this group and stop receiving emails from it, send
>>> an email to elasticsearch+unsubscr...@googlegroups.com.
>>> To view this discussion on the web visit
>>> https://groups.google.com/d/msgid/elasticsearch/CAL6Z4j5i7AAnasMYZgR83aTXvELan%3DkR6OLvGYKfs9d5Subi4A%40mail.gmail.com
>>> <https://groups.google.com/d/msgid/elasticsearch/CAL6Z4j5i7AAnasMYZgR83aTXvELan%3DkR6OLvGYKfs9d5Subi4A%40mail.gmail.com?utm_medium=email&utm_source=footer>
>>> .
>>>
>>> For more options, visit https://groups.google.com/d/optout.
>>>
>>
>>  --
>> You received this message because you are subscribed to the Google Groups
>> "elasticsearch" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to elasticsearch+unsubscr...@googlegroups.com.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/elasticsearch/CAND3Dph9Z1My%2B2%2BQ-NM-sWNn2vT1qktDi6%2BmR-b9rFN-Xc-_pw%40mail.gmail.com
>> <https://groups.google.com/d/msgid/elasticsearch/CAND3Dph9Z1My%2B2%2BQ-NM-sWNn2vT1qktDi6%2BmR-b9rFN-Xc-_pw%40mail.gmail.com?utm_medium=email&utm_source=footer>
>> .
>>
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>
>
> --
> Adrien Grand
>
> --
> You received this message because you are subscribed to the Google Groups
> "elasticsearch" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to elasticsearch+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/elasticsearch/CAL6Z4j5KGu34xCh6e5PKFm30U8mNAf-0acd7%3DQMAVuriL3msyA%40mail.gmail.com
> <https://groups.google.com/d/msgid/elasticsearch/CAL6Z4j5KGu34xCh6e5PKFm30U8mNAf-0acd7%3DQMAVuriL3msyA%40mail.gmail.com?utm_medium=email&utm_source=footer>
> .
>
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAND3DpjrpaV7NB87%3DLxRmAa%2B0RUgQ3oY%3DtuaB9Bc274e9jG9og%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to