[ 
https://issues.apache.org/jira/browse/SOLR-17058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wei wang updated SOLR-17058:
----------------------------
    Description: 
When distributed IDF is enabled in solr cloud by adding one of the cache 
implementations in solrconfig.xml 
[https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf],
  each solr query will incur a distributed shard request to get term statistics

"debug": {

        "track": {

            "rid": "-54",

            "PARSE_QUERY": {

                "http://192.168.0.34:8987/solr/shard2_replica_n1/":

               { "QTime": “2”,                    

                "ElapsedTime": "13",                    

                "RequestPurpose": "GET_TERM_STATS",                     

                 …  

 

For queries that does not use distributed IDF information for scoring, the 
stats request is not necessary.  For example when retrieving docs by terms 
filter:

http://localhost:8987/solr/collection1/select?q=*%3A*&wt=json&fq=\{!terms 
f=id}id1,id2

  Hence I propose to add a disableDistribStats request param so that the 
distributed stats request can be disabled at query time. 
 # disableDistribStats defaults to false. When the param is not present, there 
is no change to current distributed IDF behavior. 
 # When explicitly set disableDistribStats=true, distributed stats call is 
disabled for the current query.  

  was:
When distributed IDF is enabled in solr cloud by adding one of the cache 
implementations in solrconfig.xml 
[https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf],
  each solr query will incur a distributed shard request to get term statistics

"debug": {

        "track": {

            "rid": "-54",

            "PARSE_QUERY": {

                "http://192.168.0.34:8987/solr/shard2_replica_n1/":

               { "QTime": “2”,                    

                "ElapsedTime": "13",                    

                "RequestPurpose": "GET_TERM_STATS",                     

                 …  

 

For queries that does not use distributed IDF information for scoring, the 
stats request is not necessary.  For example when retrieving docs by terms 
filter:    

http://localhost:8987/solr/collection1/select?q=*%3A*&wt=json&fq={!terms 
f=id}id1,id2

  Hence I propose to add a disableDistribStats request param so that the 
distributed stats request can be disabled at query time. 
 # disableDistribStats defaults to false. When the param is not present, there 
is no change to current distributed IDF behavior. 
 # When explicitly set disableDistribStats=true, distributed stats call is 
disabled for the current query.  


> Request param to disable distributed stats request at query time
> ----------------------------------------------------------------
>
>                 Key: SOLR-17058
>                 URL: https://issues.apache.org/jira/browse/SOLR-17058
>             Project: Solr
>          Issue Type: New Feature
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: query
>            Reporter: wei wang
>            Priority: Minor
>
> When distributed IDF is enabled in solr cloud by adding one of the cache 
> implementations in solrconfig.xml 
> [https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf],
>   each solr query will incur a distributed shard request to get term 
> statistics
> "debug": {
>         "track": {
>             "rid": "-54",
>             "PARSE_QUERY": {
>                 "http://192.168.0.34:8987/solr/shard2_replica_n1/":
>                { "QTime": “2”,                    
>                 "ElapsedTime": "13",                    
>                 "RequestPurpose": "GET_TERM_STATS",                     
>                  …  
>  
> For queries that does not use distributed IDF information for scoring, the 
> stats request is not necessary.  For example when retrieving docs by terms 
> filter:
> http://localhost:8987/solr/collection1/select?q=*%3A*&wt=json&fq=\{!terms 
> f=id}id1,id2
>   Hence I propose to add a disableDistribStats request param so that the 
> distributed stats request can be disabled at query time. 
>  # disableDistribStats defaults to false. When the param is not present, 
> there is no change to current distributed IDF behavior. 
>  # When explicitly set disableDistribStats=true, distributed stats call is 
> disabled for the current query.  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to