[
https://issues.apache.org/jira/browse/SOLR-8914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15219154#comment-15219154
]
Erick Erickson commented on SOLR-8914:
--------------------------------------
Nope, completely re-did the test and got another failure on run 31. Here are
the failure snippets
[junit4] FAILURE 94.1s | TestStressLiveNodes.testStress <<<
[junit4] > Throwable #1: java.lang.AssertionError: iter1263:
[127.0.0.1:53372_solr, thrasher-T1262_0-0] expected:<1> but was:<2>
[junit4] > at
__randomizedtesting.SeedInfo.seed([3E42073F9773C752:2B597AFE14843F29]:0)
[junit4] > at
org.apache.solr.cloud.TestStressLiveNodes.testStress(TestStressLiveNodes.java:137)
[junit4] > at java.lang.Thread.run(Thread.java:745)
***********************
[junit4] FAILURE 75.7s | TestStressLiveNodes.testStress <<<
[junit4] > Throwable #1: java.lang.AssertionError: iter2373 6 != 1
expected:<[127.0.0.1:61240_solr, thrasher-T2373_0-0, thrasher-T2373_1-0,
thrasher-T2373_2-0, thrasher-T2373_3-0, thrasher-T2373_4-0]> but
was:<[127.0.0.1:61240_solr]>
[junit4] > at
__randomizedtesting.SeedInfo.seed([20645E3B72A746D3:357F23FAF150BEA8]:0)
[junit4] > at
org.apache.solr.cloud.TestStressLiveNodes.testStress(TestStressLiveNodes.java:200)
[junit4] > at java.lang.Thread.run(Thread.java:745)
**************************
[junit4] 2> NOTE: reproduce with: ant test -Dtestcase=TestStressLiveNodes
-Dtests.method=testStress -Dtests.seed=E803236A2774DE4C -Dtests.nightly=true
-Dtests.slow=true -Dtests.locale=sr-RS -Dtests.timezone=Pacific/Majuro
-Dtests.asserts=true -Dtests.file.encoding=UTF-8
[junit4] ERROR 62.7s | TestStressLiveNodes.testStress <<<
[junit4] > Throwable #1: org.apache.solr.common.SolrException:
java.util.concurrent.TimeoutException: Could not connect to ZooKeeper
127.0.0.1:62845/solr within 30000 ms
[junit4] > at
__randomizedtesting.SeedInfo.seed([E803236A2774DE4C:FD185EABA4832637]:0)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:181)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:115)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:110)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:97)
[junit4] > at
org.apache.solr.cloud.TestStressLiveNodes.newSolrZkClient(TestStressLiveNodes.java:87)
[junit4] > at
org.apache.solr.cloud.TestStressLiveNodes.access$000(TestStressLiveNodes.java:54)
[junit4] > at
org.apache.solr.cloud.TestStressLiveNodes$LiveNodeTrasher.<init>(TestStressLiveNodes.java:225)
[junit4] > at
org.apache.solr.cloud.TestStressLiveNodes.testStress(TestStressLiveNodes.java:174)
[junit4] > at java.lang.Thread.run(Thread.java:745)
[junit4] > Caused by: java.util.concurrent.TimeoutException: Could not
connect to ZooKeeper 127.0.0.1:62845/solr within 30000 ms
[junit4] > at
org.apache.solr.common.cloud.ConnectionManager.waitForConnected(ConnectionManager.java:228)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:173)
[junit4] > ... 46 more
[ju
***********************************
[junit4] 2> NOTE: reproduce with: ant test -Dtestcase=TestStressLiveNodes
-Dtests.method=testStress -Dtests.seed=6B315674F529C1E2 -Dtests.nightly=true
-Dtests.slow=true -Dtests.locale=ar-IQ -Dtests.timezone=Europe/Bratislava
-Dtests.asserts=true -Dtests.file.encoding=UTF-8
[junit4] ERROR 113s | TestStressLiveNodes.testStress <<<
[junit4] > Throwable #1: org.apache.solr.common.SolrException:
java.util.concurrent.TimeoutException: Could not connect to ZooKeeper
127.0.0.1:62849/solr within 30000 ms
[junit4] > at
__randomizedtesting.SeedInfo.seed([6B315674F529C1E2:7E2A2BB576DE3999]:0)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:181)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:115)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:110)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:97)
[junit4] > at
org.apache.solr.cloud.TestStressLiveNodes.newSolrZkClient(TestStressLiveNodes.java:87)
[junit4] > at
org.apache.solr.cloud.TestStressLiveNodes.access$000(TestStressLiveNodes.java:54)
[junit4] > at
org.apache.solr.cloud.TestStressLiveNodes$LiveNodeTrasher.<init>(TestStressLiveNodes.java:225)
[junit4] > at
org.apache.solr.cloud.TestStressLiveNodes.testStress(TestStressLiveNodes.java:174)
[junit4] > at java.lang.Thread.run(Thread.java:745)
[junit4] > Caused by: java.util.concurrent.TimeoutException: Could not
connect to ZooKeeper 127.0.0.1:62849/solr within 30000 ms
[junit4] > at
org.apache.solr.common.cloud.ConnectionManager.waitForConnected(ConnectionManager.java:228)
[junit4] > at
org.apache.solr.common.cloud.SolrZkClient.<init>(SolrZkClient.java:173)
[junit4] > ... 46 more
[junit4] 2> 116534 INFO (jetty-launcher-1-thread-2) [ ]
o.e.j.s.ServerConnector Stopped
ServerConnector@5aa5196c{HTTP/1.1,[http/1.1]}{127.0.0.1:0}
[juni
> ZkStateReader's refreshLiveNodes(Watcher) is not thread safe
> ------------------------------------------------------------
>
> Key: SOLR-8914
> URL: https://issues.apache.org/jira/browse/SOLR-8914
> Project: Solr
> Issue Type: Bug
> Reporter: Hoss Man
> Attachments: SOLR-8914.patch, SOLR-8914.patch, SOLR-8914.patch,
> SOLR-8914.patch, jenkins.thetaphi.de_Lucene-Solr-6.x-Solaris_32.log.txt,
> live_node_mentions_port56361_with_threadIds.log.txt,
> live_nodes_mentions.log.txt
>
>
> Jenkin's encountered a failure in TestTolerantUpdateProcessorCloud over the
> weekend....
> {noformat}
> http://jenkins.thetaphi.de/job/Lucene-Solr-6.x-Solaris/32/consoleText
> Checking out Revision c46d7686643e7503304cb35dfe546bce9c6684e7
> (refs/remotes/origin/branch_6x)
> Using Java: 64bit/jdk1.8.0 -XX:+UseCompressedOops -XX:+UseG1GC
> {noformat}
> The failure happened during the static setup of the test, when a
> MiniSolrCloudCluster & several clients are initialized -- before any code
> related to TolerantUpdateProcessor is ever used.
> I can't reproduce this, or really make sense of what i'm (not) seeing here in
> the logs, so i'm filing this jira with my analysis in the hopes that someone
> else can help make sense of it.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]