[ 
https://issues.apache.org/jira/browse/SOLR-2894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man updated SOLR-2894:
---------------------------

    Attachment: pivot_mincount_problem.sh

Something's wonky with the way mincount is handled - if you run the attached 
"pivot_mincount_problem.sh" script while a 2 node cluster is running with th 
example configs you can see the problem by comparing these 3 URLs...


* Pivot1: 
http://localhost:8983/solr/select?rows=0&wt=json&indent=true&q=single_7_s:%284%205%206%29&facet=true&facet.pivot=multi_50_ss,single_100_s&facet.limit=10
* Filter: 
http://localhost:8983/solr/select?rows=0&wt=json&indent=true&q=single_7_s:%284%205%206%29&fq=multi_50_ss:35&fq=single_100_s:79
* Pivot2: 
http://localhost:8983/solr/select?rows=0&wt=json&indent=true&q=single_7_s:%284%205%206%29&facet=true&facet.pivot=multi_50_ss,single_100_s&facet.limit=10&facet.pivot.mincount=10


According to the "Pivot1" URL, there are 4244 total docs matching the query, of 
those 586 match {{multi_50_ss:35}} and of those 13 match {{single_100_s:79}}

This all jives with what the "Filter" URL tells us (where we ignore the pivot 
facets and just apply those as filters)

But if we add {{facet.pivot.mincount=10}} to the original pivot request to get 
the "Pivot2" URL, no values for single_100_s make the cut as sub-facets of the 
586 {{multi_50_ss:35}} docs.

Looking at the logs of the shard queries, it appears that 
{{facet.pivot.mincount=-1}} is set _only_ on the refinement queries, but non in 
the initial sub-shard queries (where the limit over requesting happens to find 
the top terms).  So terms that don't match above the mincount on at least one 
_single_ shard won't be considered at all for the cumulative total.

> Implement distributed pivot faceting
> ------------------------------------
>
>                 Key: SOLR-2894
>                 URL: https://issues.apache.org/jira/browse/SOLR-2894
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Erik Hatcher
>             Fix For: 4.9, 5.0
>
>         Attachments: SOLR-2894-reworked.patch, SOLR-2894.patch, 
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, 
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, 
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, 
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, 
> SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, SOLR-2894.patch, 
> dateToObject.patch, pivot_mincount_problem.sh
>
>
> Following up on SOLR-792, pivot faceting currently only supports 
> undistributed mode.  Distributed pivot faceting needs to be implemented.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to