[ 
https://issues.apache.org/jira/browse/SOLR-8249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14996991#comment-14996991
 ] 

Steve Rowe commented on SOLR-8249:
----------------------------------

I've been running the following, over 5000 iterations each for patched and 
unpatched trunk, and only had one failure on unpatched - I'll include the log 
below.  Here's the cmdline I'm using:

{noformat}
(for a in {1..1000} ; do ant test -Dtestcase=OverseerTest 
-Dtests.method=testOverseerStatsReset -Dtests.dups=10 -Dtests.jvms=10 ; done) 
2>&1 | tee ../../test.output
{noformat}

Here's the failure from unpatched trunk:

{noformat}
   [junit4] Suite: org.apache.solr.cloud.OverseerTest
   [junit4]   2> Creating dataDir: 
/home/sarowe/svn/lucene/dev/trunk/solr/build/solr-core/test/J3/temp/solr.cloud.OverseerTest_D0EFDBDB7BF104A-001/init-core-data-001
   [junit4]   2> 0    INFO  (SUITE-OverseerTest-seed#[D0EFDBDB7BF104A]-worker) 
[    ] o.a.s.SolrTestCaseJ4 Randomized ssl (false) and clientAuth (true)
   [junit4]   2> 69   INFO  (SUITE-OverseerTest-seed#[D0EFDBDB7BF104A]-worker) 
[    ] o.a.s.SolrTestCaseJ4 ####initCore
   [junit4]   2> 72   INFO  (SUITE-OverseerTest-seed#[D0EFDBDB7BF104A]-worker) 
[    ] o.a.s.SolrTestCaseJ4 ####initCore end
   [junit4]   2> 83   INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.SolrTestCaseJ4 ###Starting testOverseerStatsReset
   [junit4]   2> 92   INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.ZkTestServer STARTING ZK TEST SERVER
   [junit4]   2> 105  INFO  (Thread-1) [    ] o.a.s.c.ZkTestServer client 
port:0.0.0.0/0.0.0.0:0
   [junit4]   2> 106  INFO  (Thread-1) [    ] o.a.s.c.ZkTestServer Starting 
server
   [junit4]   2> 194  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.ZkTestServer start zk server on port:36981
   [junit4]   2> 213  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient Using default ZkCredentialsProvider
   [junit4]   2> 245  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.ConnectionManager Waiting for client to connect to ZooKeeper
   [junit4]   2> 384  INFO  (zkCallback-1-thread-1) [    ] 
o.a.s.c.c.ConnectionManager Watcher 
org.apache.solr.common.cloud.ConnectionManager@121063b4 
name:ZooKeeperConnection Watcher:127.0.0.1:36981 got event WatchedEvent 
state:SyncConnected type:None path:null path:null type:None
   [junit4]   2> 385  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.ConnectionManager Client is connected to ZooKeeper
   [junit4]   2> 386  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient Using default ZkACLProvider
   [junit4]   2> 414  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient Using default ZkCredentialsProvider
   [junit4]   2> 415  WARN  (NIOServerCxn.Factory:0.0.0.0/0.0.0.0:0) [    ] 
o.a.z.s.NIOServerCnxn caught end of stream exception
   [junit4]   2> EndOfStreamException: Unable to read additional data from 
client sessionid 0x150ed0c2d8d0000, likely client has closed socket
   [junit4]   2>        at 
org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:228)
   [junit4]   2>        at 
org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:208)
   [junit4]   2>        at java.lang.Thread.run(Thread.java:745)
   [junit4]   2> 454  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.ConnectionManager Waiting for client to connect to ZooKeeper
   [junit4]   2> 457  INFO  (zkCallback-2-thread-1) [    ] 
o.a.s.c.c.ConnectionManager Watcher 
org.apache.solr.common.cloud.ConnectionManager@4c1e7367 
name:ZooKeeperConnection Watcher:127.0.0.1:36981 got event WatchedEvent 
state:SyncConnected type:None path:null path:null type:None
   [junit4]   2> 457  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.ConnectionManager Client is connected to ZooKeeper
   [junit4]   2> 457  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient Using default ZkACLProvider
   [junit4]   2> 458  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /solr
   [junit4]   2> 467  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient Using default ZkCredentialsProvider
   [junit4]   2> 468  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.ConnectionManager Waiting for client to connect to ZooKeeper
   [junit4]   2> 470  INFO  (zkCallback-3-thread-1) [    ] 
o.a.s.c.c.ConnectionManager Watcher 
org.apache.solr.common.cloud.ConnectionManager@485ae506 
name:ZooKeeperConnection Watcher:127.0.0.1:36981/solr got event WatchedEvent 
state:SyncConnected type:None path:null path:null type:None
   [junit4]   2> 470  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.ConnectionManager Client is connected to ZooKeeper
   [junit4]   2> 470  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient Using default ZkACLProvider
   [junit4]   2> 488  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /live_nodes
   [junit4]   2> 496  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /collections
   [junit4]   2> 498  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /aliases.json
   [junit4]   2> 500  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /clusterstate.json
   [junit4]   2> 503  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /security.json
   [junit4]   2> 509  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.ZkStateReader Updating cluster state from ZooKeeper... 
   [junit4]   2> 534  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient Using default ZkCredentialsProvider
   [junit4]   2> 541  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.ConnectionManager Waiting for client to connect to ZooKeeper
   [junit4]   2> 545  INFO  (zkCallback-4-thread-1) [    ] 
o.a.s.c.c.ConnectionManager Watcher 
org.apache.solr.common.cloud.ConnectionManager@3e920eb0 
name:ZooKeeperConnection Watcher:127.0.0.1:36981/solr got event WatchedEvent 
state:SyncConnected type:None path:null path:null type:None
   [junit4]   2> 547  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.ConnectionManager Client is connected to ZooKeeper
   [junit4]   2> 547  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient Using default ZkACLProvider
   [junit4]   2> 550  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.ZkStateReader Updating cluster state from ZooKeeper... 
   [junit4]   2> 552  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /live_nodes/node1
   [junit4]   2> 557  INFO  (zkCallback-4-thread-1) [    ] 
o.a.s.c.c.ZkStateReader A live node change: WatchedEvent state:SyncConnected 
type:NodeChildrenChanged path:/live_nodes, has occurred - updating... (live 
nodes size: 0)
   [junit4]   2> 561  INFO  (zkCallback-3-thread-1) [    ] 
o.a.s.c.c.ZkStateReader A live node change: WatchedEvent state:SyncConnected 
type:NodeChildrenChanged path:/live_nodes, has occurred - updating... (live 
nodes size: 0)
   [junit4]   2> 767  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.u.UpdateShardHandler Creating UpdateShardHandler HTTP client with params: 
socketTimeout=600000&connTimeout=60000&retry=true
   [junit4]   2> 846  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /overseer_elect
   [junit4]   2> 850  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /overseer_elect/election
   [junit4]   2> 858  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.Overseer Overseer (id=null) closing
   [junit4]   2> 862  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.LeaderElector Joined leadership election with path: 
/overseer_elect/election/94836228734386178-127.0.0.1:36981_solr-n_0000000000
   [junit4]   2> 877  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.OverseerElectionContext I am going to be the leader 127.0.0.1:36981_solr
   [junit4]   2> 881  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /overseer_elect/leader
   [junit4]   2> 885  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.Overseer Overseer 
(id=94836228734386178-127.0.0.1:36981_solr-n_0000000000) starting
   [junit4]   2> 908  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /overseer/queue
   [junit4]   2> 912  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /overseer/queue-work
   [junit4]   2> 929  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /overseer/collection-map-failure
   [junit4]   2> 932  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /overseer/collection-map-running
   [junit4]   2> 935  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /overseer/collection-map-completed
   [junit4]   2> 944  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /overseer/collection-queue-work
   [junit4]   2> 965  INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.OverseerAutoReplicaFailoverThread Starting 
OverseerAutoReplicaFailoverThread autoReplicaFailoverWorkLoopDelay=10000 
autoReplicaFailoverWaitAfterExpiration=30000 
autoReplicaFailoverBadNodeExpiration=60000
   [junit4]   2> 1062 INFO  
(OverseerCollectionConfigSetProcessor-94836228734386178-127.0.0.1:36981_solr-n_0000000000)
 [    ] o.a.s.c.OverseerTaskProcessor Process current queue of overseer 
operations
   [junit4]   2> 1079 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [    
] o.a.s.c.Overseer Starting to work on the main queue
   [junit4]   2> 1087 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [    
] o.a.s.c.Overseer processMessage: queueSize: 1, message = {
   [junit4]   2>   "operation":"state",
   [junit4]   2>   "state":"recovering",
   [junit4]   2>   "node_name":"node1",
   [junit4]   2>   "core":"core1",
   [junit4]   2>   "core_node_name":"core_node1",
   [junit4]   2>   "collection":"collection1",
   [junit4]   2>   "numShards":"1",
   [junit4]   2>   "base_url":"http://node1/solr/"} current state version: 0
   [junit4]   2> 1094 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [    
] o.a.s.c.o.ReplicaMutator Update state numShards=1 message={
   [junit4]   2>   "operation":"state",
   [junit4]   2>   "state":"recovering",
   [junit4]   2>   "node_name":"node1",
   [junit4]   2>   "core":"core1",
   [junit4]   2>   "core_node_name":"core_node1",
   [junit4]   2>   "collection":"collection1",
   [junit4]   2>   "numShards":"1",
   [junit4]   2>   "base_url":"http://node1/solr/"}
   [junit4]   2> 1095 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [    
] o.a.s.c.o.ClusterStateMutator building a new cName: collection1
   [junit4]   2> 1102 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [    
] o.a.s.c.o.ReplicaMutator Assigning new node to shard shard=shard1
   [junit4]   2> 1107 INFO  (zkCallback-4-thread-1) [    ] 
o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent 
state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred 
- updating... (live nodes size: 1)
   [junit4]   2> 1107 INFO  (zkCallback-3-thread-1) [    ] 
o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent 
state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred 
- updating... (live nodes size: 1)
   [junit4]   2> 1578 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: 
/collections/collection1/leader_elect/shard1/election
   [junit4]   2> 1599 INFO  (zkCallback-3-thread-1) [    ] 
o.a.s.c.c.ZkStateReader A collections change: WatchedEvent state:SyncConnected 
type:NodeChildrenChanged path:/collections, has occurred - updating...
   [junit4]   2> 1599 INFO  (zkCallback-4-thread-1) [    ] 
o.a.s.c.c.ZkStateReader A collections change: WatchedEvent state:SyncConnected 
type:NodeChildrenChanged path:/collections, has occurred - updating...
   [junit4]   2> 1607 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.LeaderElector Joined leadership election with path: 
/collections/collection1/leader_elect/shard1/election/94836228734386179-node1_core1-n_0000000000
   [junit4]   2> 1611 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /collections/collection1/leaders/shard1
   [junit4]   2> 1616 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.ShardLeaderElectionContextBase Creating leader registration node
   [junit4]   2> 1628 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [    
] o.a.s.c.Overseer processMessage: queueSize: 1, message = {
   [junit4]   2>   "operation":"state",
   [junit4]   2>   "state":"active",
   [junit4]   2>   "shard":"shard1",
   [junit4]   2>   "collection":"collection1",
   [junit4]   2>   "base_url":"http://node1/solr/";,
   [junit4]   2>   "node_name":"node1",
   [junit4]   2>   "core_node_name":"core_node1",
   [junit4]   2>   "core":"core1"} current state version: 1
   [junit4]   2> 1629 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [    
] o.a.s.c.o.ReplicaMutator Update state numShards=null message={
   [junit4]   2>   "operation":"state",
   [junit4]   2>   "state":"active",
   [junit4]   2>   "shard":"shard1",
   [junit4]   2>   "collection":"collection1",
   [junit4]   2>   "base_url":"http://node1/solr/";,
   [junit4]   2>   "node_name":"node1",
   [junit4]   2>   "core_node_name":"core_node1",
   [junit4]   2>   "core":"core1"}
   [junit4]   2> 1646 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [    
] o.a.s.c.Overseer processMessage: queueSize: 1, message = {
   [junit4]   2>   "operation":"leader",
   [junit4]   2>   "shard":"shard1",
   [junit4]   2>   "collection":"collection1",
   [junit4]   2>   "base_url":"http://node1/solr/";,
   [junit4]   2>   "core":"core1"} current state version: 1
   [junit4]   2> 1665 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.Overseer Overseer 
(id=94836228734386178-127.0.0.1:36981_solr-n_0000000000) closing
   [junit4]   2> 1666 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.ElectionContext Canceling election 
/overseer_elect/election/94836228734386178-127.0.0.1:36981_solr-n_0000000000
   [junit4]   2> 1675 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [    
] o.a.s.c.Overseer Overseer Loop exiting : 127.0.0.1:36981_solr
   [junit4]   2> 1676 INFO  (zkCallback-3-thread-1) [    ] 
o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent 
state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred 
- updating... (live nodes size: 1)
   [junit4]   2> 1677 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.LeaderElector Joined leadership election with path: 
/overseer_elect/election/94836228734386178-127.0.0.1:36981_solr-n_0000000001
   [junit4]   2> 1677 INFO  (zkCallback-4-thread-1) [    ] 
o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent 
state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred 
- updating... (live nodes size: 1)
   [junit4]   2> 1682 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.OverseerElectionContext I am going to be the leader 127.0.0.1:36981_solr
   [junit4]   2> 1683 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.c.SolrZkClient makePath: /overseer_elect/leader
   [junit4]   2> 1683 INFO  (OverseerExitThread) [    ] o.a.s.c.Overseer I'm 
exiting , but I'm still the leader
   [junit4]   2> 1686 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.Overseer Overseer 
(id=94836228734386178-127.0.0.1:36981_solr-n_0000000001) starting
   [junit4]   2> 1705 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.OverseerAutoReplicaFailoverThread Starting 
OverseerAutoReplicaFailoverThread autoReplicaFailoverWorkLoopDelay=10000 
autoReplicaFailoverWaitAfterExpiration=30000 
autoReplicaFailoverBadNodeExpiration=60000
   [junit4]   2> 1705 INFO  
(OverseerCollectionConfigSetProcessor-94836228734386178-127.0.0.1:36981_solr-n_0000000001)
 [    ] o.a.s.c.OverseerTaskProcessor Process current queue of overseer 
operations
   [junit4]   2> 1706 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000001) [    
] o.a.s.c.Overseer Starting to work on the main queue
   [junit4]   2> 1708 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000001) [    
] o.a.s.c.Overseer processMessage: workQueueSize: 2, message = {
   [junit4]   2>   "operation":"state",
   [junit4]   2>   "state":"active",
   [junit4]   2>   "shard":"shard1",
   [junit4]   2>   "collection":"collection1",
   [junit4]   2>   "base_url":"http://node1/solr/";,
   [junit4]   2>   "node_name":"node1",
   [junit4]   2>   "core_node_name":"core_node1",
   [junit4]   2>   "core":"core1"}
   [junit4]   2> 1709 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000001) [    
] o.a.s.c.o.ReplicaMutator Update state numShards=null message={
   [junit4]   2>   "operation":"state",
   [junit4]   2>   "state":"active",
   [junit4]   2>   "shard":"shard1",
   [junit4]   2>   "collection":"collection1",
   [junit4]   2>   "base_url":"http://node1/solr/";,
   [junit4]   2>   "node_name":"node1",
   [junit4]   2>   "core_node_name":"core_node1",
   [junit4]   2>   "core":"core1"}
   [junit4]   2> 1709 INFO  (zkCallback-3-thread-1) [    ] 
o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent 
state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred 
- updating... (live nodes size: 1)
   [junit4]   2> 1709 INFO  (zkCallback-4-thread-1) [    ] 
o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent 
state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred 
- updating... (live nodes size: 1)
   [junit4]   2> 1711 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000001) [    
] o.a.s.c.Overseer processMessage: workQueueSize: 2, message = {
   [junit4]   2>   "operation":"leader",
   [junit4]   2>   "shard":"shard1",
   [junit4]   2>   "collection":"collection1",
   [junit4]   2>   "base_url":"http://node1/solr/";,
   [junit4]   2>   "core":"core1"}
   [junit4]   2> 1712 INFO  (zkCallback-3-thread-1) [    ] 
o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent 
state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred 
- updating... (live nodes size: 1)
   [junit4]   2> 1747 INFO  (zkCallback-3-thread-1) [    ] 
o.a.s.c.c.ZkStateReader A live node change: WatchedEvent state:SyncConnected 
type:NodeChildrenChanged path:/live_nodes, has occurred - updating... (live 
nodes size: 1)
   [junit4]   2> 1758 INFO  (zkCallback-4-thread-2) [    ] 
o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent 
state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred 
- updating... (live nodes size: 1)
   [junit4]   2> 1759 WARN  (zkCallback-4-thread-2) [    ] 
o.a.s.c.c.ZkStateReader ZooKeeper watch triggered, but Solr cannot talk to ZK
   [junit4]   2> 1762 ERROR 
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]-EventThread) [ 
   ] o.a.z.ClientCnxn Error while calling watcher 
   [junit4]   2> java.util.concurrent.RejectedExecutionException: Task 
org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor$1@28ba4cca 
rejected from 
org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor@6db8d7b9[Shutting
 down, pool size = 2, active threads = 0, queued tasks = 0, completed tasks = 7]
   [junit4]   2>        at 
java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:2047)
   [junit4]   2>        at 
java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:823)
   [junit4]   2>        at 
java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:1369)
   [junit4]   2>        at 
org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.execute(ExecutorUtil.java:214)
   [junit4]   2>        at 
java.util.concurrent.AbstractExecutorService.submit(AbstractExecutorService.java:112)
   [junit4]   2>        at 
org.apache.solr.common.cloud.SolrZkClient$3.process(SolrZkClient.java:266)
   [junit4]   2>        at 
org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:522)
   [junit4]   2>        at 
org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:498)
   [junit4]   2> 1772 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.ZkTestServer connecting to 127.0.0.1:36981 36981
   [junit4]   2> 1896 INFO  (Thread-1) [    ] o.a.s.c.ZkTestServer connecting 
to 127.0.0.1:36981 36981
   [junit4]   2> 1905 WARN  (Thread-1) [    ] o.a.s.c.ZkTestServer Watch limit 
violations: 
   [junit4]   2> Maximum concurrent create/delete watches above limit:
   [junit4]   2> 
   [junit4]   2>        2       /solr/aliases.json
   [junit4]   2> 
   [junit4]   2> Maximum concurrent data watches above limit:
   [junit4]   2> 
   [junit4]   2>        2       /solr/clusterstate.json
   [junit4]   2> 
   [junit4]   2> Maximum concurrent children watches above limit:
   [junit4]   2> 
   [junit4]   2>        2       /solr/live_nodes
   [junit4]   2>        2       /solr/collections
   [junit4]   2>        2       /solr/overseer/collection-queue-work
   [junit4]   2> 
   [junit4]   2> 1905 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.SolrTestCaseJ4 ###Ending testOverseerStatsReset
   [junit4]   2> 1916 INFO  
(TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [    ] 
o.a.s.c.Overseer Overseer 
(id=94836228734386178-127.0.0.1:36981_solr-n_0000000001) closing
   [junit4]   2> 1916 INFO  
(OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000001) [    
] o.a.s.c.Overseer Overseer Loop exiting : 127.0.0.1:36981_solr
   [junit4]   2> 1917 ERROR (OverseerExitThread) [    ] o.a.s.c.Overseer could 
not read the data
   [junit4]   2> org.apache.zookeeper.KeeperException$SessionExpiredException: 
KeeperErrorCode = Session expired for /overseer_elect/leader
   [junit4]   2>        at 
org.apache.zookeeper.KeeperException.create(KeeperException.java:127)
   [junit4]   2>        at 
org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
   [junit4]   2>        at 
org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:1155)
   [junit4]   2>        at 
org.apache.solr.common.cloud.SolrZkClient$7.execute(SolrZkClient.java:353)
   [junit4]   2>        at 
org.apache.solr.common.cloud.SolrZkClient$7.execute(SolrZkClient.java:350)
   [junit4]   2>        at 
org.apache.solr.common.cloud.ZkCmdExecutor.retryOperation(ZkCmdExecutor.java:61)
   [junit4]   2>        at 
org.apache.solr.common.cloud.SolrZkClient.getData(SolrZkClient.java:350)
   [junit4]   2>        at 
org.apache.solr.cloud.Overseer$ClusterStateUpdater.checkIfIamStillLeader(Overseer.java:304)
   [junit4]   2>        at 
org.apache.solr.cloud.Overseer$ClusterStateUpdater.access$300(Overseer.java:87)
   [junit4]   2>        at 
org.apache.solr.cloud.Overseer$ClusterStateUpdater$2.run(Overseer.java:265)
   [junit4]   2> NOTE: download the large Jenkins line-docs file by running 
'ant get-jenkins-line-docs' in the lucene directory.
   [junit4]   2> NOTE: reproduce with: ant test  -Dtestcase=OverseerTest 
-Dtests.method=testOverseerStatsReset -Dtests.seed=D0EFDBDB7BF104A 
-Dtests.slow=true 
-Dtests.linedocsfile=/home/jenkins/lucene-data/enwiki.random.lines.txt 
-Dtests.locale=en_IN -Dtests.timezone=Antarctica/McMurdo -Dtests.asserts=true 
-Dtests.file.encoding=ISO-8859-1
   [junit4] FAILURE 1.85s J3  | OverseerTest.testOverseerStatsReset <<<
   [junit4]    > Throwable #1: java.lang.AssertionError: expected:<0> but 
was:<1>
   [junit4]    >        at 
__randomizedtesting.SeedInfo.seed([D0EFDBDB7BF104A:A65A1A812265B344]:0)
   [junit4]    >        at 
org.apache.solr.cloud.OverseerTest.testOverseerStatsReset(OverseerTest.java:741)
   [junit4]    >        at java.lang.Thread.run(Thread.java:745)
   [junit4]   2> 4921 INFO  (SUITE-OverseerTest-seed#[D0EFDBDB7BF104A]-worker) 
[    ] o.a.s.SolrTestCaseJ4 ###deleteCore
   [junit4]   2> NOTE: leaving temporary files on disk at: 
/home/sarowe/svn/lucene/dev/trunk/solr/build/solr-core/test/J3/temp/solr.cloud.OverseerTest_D0EFDBDB7BF104A-001
   [junit4]   2> NOTE: test params are: 
codec=DummyCompressingStoredFields(storedFieldsFormat=CompressingStoredFieldsFormat(compressionMode=DUMMY,
 chunkSize=1, maxDocsPerChunk=860, blockSize=7), 
termVectorsFormat=CompressingTermVectorsFormat(compressionMode=DUMMY, 
chunkSize=1, blockSize=7)), 
sim=RandomSimilarityProvider(queryNorm=false,coord=yes): {}, locale=en_IN, 
timezone=Antarctica/McMurdo
   [junit4]   2> NOTE: Linux 4.1.0-custom2-amd64 amd64/Oracle Corporation 
1.8.0_45 (64-bit)/cpus=16,threads=1,free=423825696,total=514850816
   [junit4]   2> NOTE: All tests run in this JVM: [OverseerTest]
   [junit4] Completed [1/10] on J3 in 6.32s, 1 test, 1 failure <<< FAILURES!
{noformat}

I don't understand why this test fails so frequently when run with other Solr 
tests, but then so rarely fails when run in isolation - note that unlike 
standard Solr tests, only one test is being run per JVM with the way I'm 
running it.

> OverseerTest failures
> ---------------------
>
>                 Key: SOLR-8249
>                 URL: https://issues.apache.org/jira/browse/SOLR-8249
>             Project: Solr
>          Issue Type: Bug
>            Reporter: Noble Paul
>            Assignee: Noble Paul
>         Attachments: SOLR-8249.patch, SOLR-8249.patch, SOLR-8249.patch, 
> SOLR-8249.patch, test.output
>
>
> http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Windows/5377/
> This is related to SOLR-7989 . [~ichattopadhyaya] we need to fix the testcase



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to