For the record, the bottleneck would not be on the master node (the node that manages the cluster state) but on the node that coordinates the execution of the search request, which is the node that your client contacts. So if you are doing costly terms aggregations with high shard sizes, it would help to round-robin between several nodes.
If you are interested in the accuracy issues of the terms aggregation, I would recommend reading http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/search-aggregations-bucket-terms-aggregation.html#search-aggregations-bucket-terms-aggregation-approximate-counts and upgrading to elasticsearch 1.4 which now returns an error bound on the counts, so that you know how bad the counts might be. The only way to improve accuracy is to increase the shard size, but as you noted, this raises issues too. On Thu, Dec 18, 2014 at 8:27 AM, yang ming <ymb...@gmail.com> wrote: > > Hi All > > we use the terms aggregation to get the top n authors, but the > aggregation may not return the top n authors. > > As the elasticsearch guide said, the aggregated results are not always > accurate. > > Indeed we can increase the shard size to get more accurate results, > but if the buckets returned by each shard are big enough, there will be a a > bottleneck in master node reducing the final result. > > Is there a other way to improve the accuracy of terms aggregation? > > Is there a good way to decrease the press of master node when > executing the reducing phase? > > Thanks > > -- > You received this message because you are subscribed to the Google Groups > "elasticsearch" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to elasticsearch+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/elasticsearch/0b83a2d6-8bd0-41dc-9e58-3b797949ca53%40googlegroups.com > <https://groups.google.com/d/msgid/elasticsearch/0b83a2d6-8bd0-41dc-9e58-3b797949ca53%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- Adrien Grand -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAL6Z4j7YSEwmJfWV87V_C1tyhSa6XdHCs54RJEBdqoBuEEKnHQ%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.