[ https://issues.apache.org/jira/browse/SOLR-12795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Amrit Sarkar updated SOLR-12795: -------------------------------- Description: Today facetStream takes a "bucketSizeLimit" parameter. Here is what the doc says about this parameter - The number of buckets to include. This value is applied to each dimension. Now let's say we create a facet stream with 3 nested facets. For example "year_i,month_i,day_i" and provide 10 as the bucketSizeLimit. FacetStream would return 10 results to us for this facet expression while the total number of unqiue values are 1000 (10*10*10 ) The API should have a separate parameter "limit" which limits the number of tuples while bucketSizeLimit should be used to specify the size of each bucket in the JSON Facet API. was: Let's look at an observation regarding "bucketSizeLimit" in facetStream; and how we interpret it as a "limit". Suppose for 3 nested facets, bucketSizeLimit = 10, we receive total 1000 rows. since bucketSizeLimit = limit; ONLY the first top-level facet value's count will be returned; out of 10*10*10, 1*1*10th rows will be fetched. And the behavior will be consistent for any bucketSizeLimit we set, How about we have a separate parameter "limit" other than "bucketSizeLimit" which can be set to any arbitrary number (though should be < bucketSizeLimit^no_of_nested_facets), and that limit can be said "500". In this way, we will have the true SQL limit feature in place in FacetStream. > Introduce 'limit' parameter in FacetStream. > ------------------------------------------- > > Key: SOLR-12795 > URL: https://issues.apache.org/jira/browse/SOLR-12795 > Project: Solr > Issue Type: Sub-task > Security Level: Public(Default Security Level. Issues are Public) > Components: streaming expressions > Reporter: Amrit Sarkar > Priority: Major > Attachments: SOLR-12795.patch > > > Today facetStream takes a "bucketSizeLimit" parameter. Here is what the doc > says about this parameter - The number of buckets to include. This value is > applied to each dimension. > Now let's say we create a facet stream with 3 nested facets. For example > "year_i,month_i,day_i" and provide 10 as the bucketSizeLimit. > FacetStream would return 10 results to us for this facet expression while the > total number of unqiue values are 1000 (10*10*10 ) > The API should have a separate parameter "limit" which limits the number of > tuples while bucketSizeLimit should be used to specify the size of each > bucket in the JSON Facet API. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org