[ 
https://issues.apache.org/jira/browse/SOLR-2422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13005975#comment-13005975
 ] 

Robert Muir commented on SOLR-2422:
-----------------------------------

The first problem I found here is in testConnect, it has a timeout of 100ms

Our lucene slave is pretty busy (lots of cores, so lots of tests going on at 
once in parallel).

By changing this timeout to AbstractZkTestCase.TIMEOUT (10000ms), I found the 
test to be significantly more reliable. This is consistent with the other test 
cases, they seem to use this timeout.

I tested this on hudson and it seems a big improvement, so I committed the 
trivial change in r1080852 (sorry for the heavy-commit, I know we are all sick 
of the hudson instability).

> Improve reliability of ZkSolrClientTest
> ---------------------------------------
>
>                 Key: SOLR-2422
>                 URL: https://issues.apache.org/jira/browse/SOLR-2422
>             Project: Solr
>          Issue Type: Bug
>          Components: Build
>    Affects Versions: 4.0
>            Reporter: Robert Muir
>
> The ZKSolrClient test is pretty unreliable, it seems to fail a significant 
> portion of the time on hudson (often on my local as well).
> Additionally it seems to somehow sometimes (maybe depending upon retry loop?) 
> leave a lot of zookeeper threads running.
> I ran into these issues when i discovered that trying to interrupt() these 
> threads after the test completed was triggering a JRE bug, but by working 
> through it I saw how unreliable the test is.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to