[ https://issues.apache.org/jira/browse/SOLR-8988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Keith Laban updated SOLR-8988: ------------------------------ Attachment: Screen Shot 2016-04-25 at 2.55.00 PM.png Screen Shot 2016-04-25 at 2.54.47 PM.png SOLR-8988.patch Added second version of patch which has this feature disabled by default but can be enabled with {{facet.distrib.mco=true}}. I also did some benchmarking and under all scenarios tested the new way is either the same or way faster. The test was with 12 shards everything evenly distributed. Two things to note about this test: - All terms have the same count which would be the worst case for refinement which is evident in the shape of each graph. Overrequesting is far more efficient. - All segments are evenly distributed however in the real world, the best performance gains for this patch would be seen when there are many segments which contain no relevant terms for the query. > Improve facet.method=fcs performance in SolrCloud > ------------------------------------------------- > > Key: SOLR-8988 > URL: https://issues.apache.org/jira/browse/SOLR-8988 > Project: Solr > Issue Type: Improvement > Reporter: Keith Laban > Attachments: SOLR-8988.patch, SOLR-8988.patch, Screen Shot 2016-04-25 > at 2.54.47 PM.png, Screen Shot 2016-04-25 at 2.55.00 PM.png > > > This relates to SOLR-8559 -- which improves the algorithm used by fcs > faceting when {{facet.mincount=1}} > This patch allows {{facet.mincount}} to be sent as 1 for distributed queries. > As far as I can tell there is no reason to set {{facet.mincount=0}} for > refinement purposes . After trying to make sense of all the refinement logic, > I cant see how the difference between _no value_ and _value=0_ would have a > negative effect. > *Test perf:* > - ~15million unique terms > - query matches ~3million documents > *Params:* > {code} > facet.mincount=1 > facet.limit=500 > facet.method=fcs > facet.sort=count > {code} > *Average Time Per Request:* > - Before patch: ~20seconds > - After patch: <1 second > *Note*: all tests pass and in my test, the output was identical before and > after patch. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org