We have huge amount of data (5Billion records, 3TB in size) organized in parent / child type in one index to enable the joins. My first question is, how should I allocate shards for this big index in order to make the parent/child query more efficient? Right now doing queries will cause out of memory on several nodes, and I have 7 VMs, with 64GMem, and 1T disk. Each Es has 32Gmem allocated to it. The index has 20 shards.
Any insights are helpful! Thanks, Chen -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CACim9RkMgWAxAZnLagKjnZd_saoQdP0Gof7t0-MsK97d4F--yw%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.