[ 
https://issues.apache.org/jira/browse/SOLR-6944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14352051#comment-14352051
 ] 

Alan Woodward commented on SOLR-6944:
-------------------------------------

I'm pretty sure this is due to stale connections (it generally occurs on the 
first request after a Jetty has shutdown and then been restarted).  There are 
two places that need fixing:
* in HttpSolrClient - we should always call 
ClientConnectionManager.closeExpiredConnections() if we get a 
NoHttpResponseException, whether or not we retry (see SOLR-7203)
* in CloudSolrClient.  We have a couple of options here: call 
closeExpiredConnections() in a background thread (we already have an 
executorservice here, so we can just use that), or alternatively call it 
whenever there's a change in the cluster state.  I quite like the latter idea - 
it would be useful to have a generic 'register a callback that gets called 
whenever the cluster state changes' API on ZkStateReader, and CloudSolrClient 
could just use that.

> ReplicationFactorTest and HttpPartitionTest both fail with 
> org.apache.http.NoHttpResponseException: The target server failed to respond
> ---------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-6944
>                 URL: https://issues.apache.org/jira/browse/SOLR-6944
>             Project: Solr
>          Issue Type: Test
>            Reporter: Mark Miller
>            Assignee: Mark Miller
>         Attachments: SOLR-6944.patch
>
>
> Our only recourse is to do a client side retry on such errors. I have some 
> retry code for this from SOLR-4509 that I will pull over here.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to