Hi Chris,

Usually, the problem is not that much in terms of indices but shards, which
are the physical units of data storage (an index being a logical view over
several shards).

Something to beware of is that shards typically have some constant overhead
(disk space, file descriptors, memory usage) that does not depend on the
amount of data that they store. Although it would be ok to have up to a few
tens of shards per nodes, you should avoid to have eg. thousands of shards
per node.

if you plan on always adding a filter for a specific application in your
search requests, then splitting by application makes sense since this will
make the filter useless at search time, you will just need to query the
application-specific index. On the other hand if you don't filter by
application, then splitting data by yourself into smaller indices would be
pretty equivalent to storing everything in a single index with a higher
number of shards.

You might want to check out the following resources that talk about
capacity planning:
 - http://www.elasticsearch.org/videos/big-data-search-and-analytics/
 -
http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/capacity-planning.html



On Fri, Aug 22, 2014 at 9:08 PM, Chris Neal <chris.n...@derbysoft.net>
wrote:

> Hi all,
>
> As the subject says, I'm wondering about index size vs. number of indexes.
>
> I'm indexing many application log files, currently with an index by day
> for all logs, which will make a very large index.  For just a few
> applications in Development, the index is 55GB a day (across 2 servers).
>  In prod with all applications, it will be "much more than that".  1TB a
> day maybe?
>
> I'm wondering if there is value in splitting the indexes by day and by
> application, which would produce more indexes per day, but they would be
> smaller, vs. value in having a single, mammoth index by day alone.
>
> Is it just a resource question?  If I have enough RAM/disk/CPU to support
> a "mammoth" index, then I'm fine?  Or are there other reasons to (or to
> not) split up indexes?
>
> Very much appreciate your time.
> Chris
>
> --
> You received this message because you are subscribed to the Google Groups
> "elasticsearch" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to elasticsearch+unsubscr...@googlegroups.com.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/elasticsearch/CAND3DphfsYx0LW0M-yvLWGauRSzVWG0etaBkiTrN7zVafq7tMA%40mail.gmail.com
> <https://groups.google.com/d/msgid/elasticsearch/CAND3DphfsYx0LW0M-yvLWGauRSzVWG0etaBkiTrN7zVafq7tMA%40mail.gmail.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>



-- 
Adrien Grand

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAL6Z4j5i7AAnasMYZgR83aTXvELan%3DkR6OLvGYKfs9d5Subi4A%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to