[ https://issues.apache.org/jira/browse/SOLR-8768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15174008#comment-15174008 ]
Pablo Anzorena commented on SOLR-8768: -------------------------------------- UPDATE: It does not depend on the range facets. The problem is on the faceting with stats across multiple shards. Following with the example, 'sellers_2014' Tyrion, 1 Jon, 50 PoorNed, 4 'sellers_2015' Tyrion, 100 Jon, 50 PoorNed, 4 'sellers_2016' Tyrion, 1 Jon, 50 PoorNed, 4 If the request is top 2 sellers order by seller_measure desc, the response would be: {code} 1)Jon 150 2)Tyrion 100 {code} But the correct response would be Tyrion with 102, but it is not counting the +2 because Tyrion is not in the top 2 of the other two shards. > Wrong behaviour in json facets > ------------------------------ > > Key: SOLR-8768 > URL: https://issues.apache.org/jira/browse/SOLR-8768 > Project: Solr > Issue Type: Bug > Components: Facet Module > Reporter: Pablo Anzorena > > This bug is quite difficult to explain it, so I will first show it with an > example and then explain it. > I have a core splitted into three shards, let's call them 'sellers_2014', > 'sellers_2015', 'sellers_2016'. > The schema has the following fields: > seller_name, string > seller_measure, double > seller_date, date > With the following data. > 'sellers_2014' > Tyrion, 1, 2014-01-01T00:00:00Z > Jon, 50, 2014-01-01T00:00:00Z > PoorNed, 4, 2014-01-01T00:00:00Z > 'sellers_2015' > Tyrion, 100, 2015-01-01T00:00:00Z > Jon, 50, 2015-01-01T00:00:00Z > PoorNed, 4, 2015-01-01T00:00:00Z > 'sellers_2016' > Tyrion, 1, 2015-01-01T00:00:00Z > Jon, 50, 2015-01-01T00:00:00Z > PoorNed, 4, 2015-01-01T00:00:00Z > Request: > http://localhost:8983/solr/sellers_2016/select?q=*:*&shards=localhost:8983/solr/sellers_2014,localhost:8983/solr/sellers_2015,localhost:8983/solr/sellers_2016&json.facet= > {code} > { > top_sellers: { > type: terms, > field: seller_name, > limit: 2, > offset: 0, > sort: "seller_measure desc", > facet: { > seller_measure: "sum(seller_measure)", > seller_dates: { > type: range, > field: seller_date, > start: "2014-01-01T00:00:00Z", > end: "2016-12-31T00:00:00Z", > gap: "+1YEARS", > facet: { > seller_measure: "sum(seller_measure)" > } > } > } > } > } > {code} > So... With the request I want to know the top 2 sellers across the three > shards and for each seller, their seller_measure for each year. > The response I'm getting is: > {code} > "val": "Jon", > "count": 3, > "seller_measure": 150, > "seller_dates": { > "buckets": [ > { > "val": "2014-01-01T00:00:00Z", > "count": 1, > "seller_measure": 50 > }, > { > "val": "2015-01-01T00:00:00Z", > "count": 1, > "seller_measure": 50 > }, > { > "val": "2016-01-01T00:00:00Z", > "count": 1, > "seller_measure": 50 > } > ] > }, > "val": "Tyrion", > "count": 3, > "seller_measure": 102, > "seller_dates": { > "buckets": [ > { > "val": "2015-01-01T00:00:00Z", > "count": 1, > "seller_measure": 100 > } > ] > } > {code} > which is incorrect, because the two buckets of 2014 and 2016 in Tyrion are > missing. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org