[ https://issues.apache.org/jira/browse/SOLR-8249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14996991#comment-14996991 ]
Steve Rowe commented on SOLR-8249: ---------------------------------- I've been running the following, over 5000 iterations each for patched and unpatched trunk, and only had one failure on unpatched - I'll include the log below. Here's the cmdline I'm using: {noformat} (for a in {1..1000} ; do ant test -Dtestcase=OverseerTest -Dtests.method=testOverseerStatsReset -Dtests.dups=10 -Dtests.jvms=10 ; done) 2>&1 | tee ../../test.output {noformat} Here's the failure from unpatched trunk: {noformat} [junit4] Suite: org.apache.solr.cloud.OverseerTest [junit4] 2> Creating dataDir: /home/sarowe/svn/lucene/dev/trunk/solr/build/solr-core/test/J3/temp/solr.cloud.OverseerTest_D0EFDBDB7BF104A-001/init-core-data-001 [junit4] 2> 0 INFO (SUITE-OverseerTest-seed#[D0EFDBDB7BF104A]-worker) [ ] o.a.s.SolrTestCaseJ4 Randomized ssl (false) and clientAuth (true) [junit4] 2> 69 INFO (SUITE-OverseerTest-seed#[D0EFDBDB7BF104A]-worker) [ ] o.a.s.SolrTestCaseJ4 ####initCore [junit4] 2> 72 INFO (SUITE-OverseerTest-seed#[D0EFDBDB7BF104A]-worker) [ ] o.a.s.SolrTestCaseJ4 ####initCore end [junit4] 2> 83 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.SolrTestCaseJ4 ###Starting testOverseerStatsReset [junit4] 2> 92 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.ZkTestServer STARTING ZK TEST SERVER [junit4] 2> 105 INFO (Thread-1) [ ] o.a.s.c.ZkTestServer client port:0.0.0.0/0.0.0.0:0 [junit4] 2> 106 INFO (Thread-1) [ ] o.a.s.c.ZkTestServer Starting server [junit4] 2> 194 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.ZkTestServer start zk server on port:36981 [junit4] 2> 213 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient Using default ZkCredentialsProvider [junit4] 2> 245 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.ConnectionManager Waiting for client to connect to ZooKeeper [junit4] 2> 384 INFO (zkCallback-1-thread-1) [ ] o.a.s.c.c.ConnectionManager Watcher org.apache.solr.common.cloud.ConnectionManager@121063b4 name:ZooKeeperConnection Watcher:127.0.0.1:36981 got event WatchedEvent state:SyncConnected type:None path:null path:null type:None [junit4] 2> 385 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.ConnectionManager Client is connected to ZooKeeper [junit4] 2> 386 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient Using default ZkACLProvider [junit4] 2> 414 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient Using default ZkCredentialsProvider [junit4] 2> 415 WARN (NIOServerCxn.Factory:0.0.0.0/0.0.0.0:0) [ ] o.a.z.s.NIOServerCnxn caught end of stream exception [junit4] 2> EndOfStreamException: Unable to read additional data from client sessionid 0x150ed0c2d8d0000, likely client has closed socket [junit4] 2> at org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:228) [junit4] 2> at org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:208) [junit4] 2> at java.lang.Thread.run(Thread.java:745) [junit4] 2> 454 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.ConnectionManager Waiting for client to connect to ZooKeeper [junit4] 2> 457 INFO (zkCallback-2-thread-1) [ ] o.a.s.c.c.ConnectionManager Watcher org.apache.solr.common.cloud.ConnectionManager@4c1e7367 name:ZooKeeperConnection Watcher:127.0.0.1:36981 got event WatchedEvent state:SyncConnected type:None path:null path:null type:None [junit4] 2> 457 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.ConnectionManager Client is connected to ZooKeeper [junit4] 2> 457 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient Using default ZkACLProvider [junit4] 2> 458 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /solr [junit4] 2> 467 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient Using default ZkCredentialsProvider [junit4] 2> 468 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.ConnectionManager Waiting for client to connect to ZooKeeper [junit4] 2> 470 INFO (zkCallback-3-thread-1) [ ] o.a.s.c.c.ConnectionManager Watcher org.apache.solr.common.cloud.ConnectionManager@485ae506 name:ZooKeeperConnection Watcher:127.0.0.1:36981/solr got event WatchedEvent state:SyncConnected type:None path:null path:null type:None [junit4] 2> 470 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.ConnectionManager Client is connected to ZooKeeper [junit4] 2> 470 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient Using default ZkACLProvider [junit4] 2> 488 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /live_nodes [junit4] 2> 496 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /collections [junit4] 2> 498 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /aliases.json [junit4] 2> 500 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /clusterstate.json [junit4] 2> 503 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /security.json [junit4] 2> 509 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.ZkStateReader Updating cluster state from ZooKeeper... [junit4] 2> 534 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient Using default ZkCredentialsProvider [junit4] 2> 541 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.ConnectionManager Waiting for client to connect to ZooKeeper [junit4] 2> 545 INFO (zkCallback-4-thread-1) [ ] o.a.s.c.c.ConnectionManager Watcher org.apache.solr.common.cloud.ConnectionManager@3e920eb0 name:ZooKeeperConnection Watcher:127.0.0.1:36981/solr got event WatchedEvent state:SyncConnected type:None path:null path:null type:None [junit4] 2> 547 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.ConnectionManager Client is connected to ZooKeeper [junit4] 2> 547 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient Using default ZkACLProvider [junit4] 2> 550 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.ZkStateReader Updating cluster state from ZooKeeper... [junit4] 2> 552 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /live_nodes/node1 [junit4] 2> 557 INFO (zkCallback-4-thread-1) [ ] o.a.s.c.c.ZkStateReader A live node change: WatchedEvent state:SyncConnected type:NodeChildrenChanged path:/live_nodes, has occurred - updating... (live nodes size: 0) [junit4] 2> 561 INFO (zkCallback-3-thread-1) [ ] o.a.s.c.c.ZkStateReader A live node change: WatchedEvent state:SyncConnected type:NodeChildrenChanged path:/live_nodes, has occurred - updating... (live nodes size: 0) [junit4] 2> 767 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.u.UpdateShardHandler Creating UpdateShardHandler HTTP client with params: socketTimeout=600000&connTimeout=60000&retry=true [junit4] 2> 846 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /overseer_elect [junit4] 2> 850 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /overseer_elect/election [junit4] 2> 858 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.Overseer Overseer (id=null) closing [junit4] 2> 862 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.LeaderElector Joined leadership election with path: /overseer_elect/election/94836228734386178-127.0.0.1:36981_solr-n_0000000000 [junit4] 2> 877 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.OverseerElectionContext I am going to be the leader 127.0.0.1:36981_solr [junit4] 2> 881 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /overseer_elect/leader [junit4] 2> 885 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.Overseer Overseer (id=94836228734386178-127.0.0.1:36981_solr-n_0000000000) starting [junit4] 2> 908 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /overseer/queue [junit4] 2> 912 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /overseer/queue-work [junit4] 2> 929 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /overseer/collection-map-failure [junit4] 2> 932 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /overseer/collection-map-running [junit4] 2> 935 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /overseer/collection-map-completed [junit4] 2> 944 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /overseer/collection-queue-work [junit4] 2> 965 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.OverseerAutoReplicaFailoverThread Starting OverseerAutoReplicaFailoverThread autoReplicaFailoverWorkLoopDelay=10000 autoReplicaFailoverWaitAfterExpiration=30000 autoReplicaFailoverBadNodeExpiration=60000 [junit4] 2> 1062 INFO (OverseerCollectionConfigSetProcessor-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [ ] o.a.s.c.OverseerTaskProcessor Process current queue of overseer operations [junit4] 2> 1079 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [ ] o.a.s.c.Overseer Starting to work on the main queue [junit4] 2> 1087 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [ ] o.a.s.c.Overseer processMessage: queueSize: 1, message = { [junit4] 2> "operation":"state", [junit4] 2> "state":"recovering", [junit4] 2> "node_name":"node1", [junit4] 2> "core":"core1", [junit4] 2> "core_node_name":"core_node1", [junit4] 2> "collection":"collection1", [junit4] 2> "numShards":"1", [junit4] 2> "base_url":"http://node1/solr/"} current state version: 0 [junit4] 2> 1094 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [ ] o.a.s.c.o.ReplicaMutator Update state numShards=1 message={ [junit4] 2> "operation":"state", [junit4] 2> "state":"recovering", [junit4] 2> "node_name":"node1", [junit4] 2> "core":"core1", [junit4] 2> "core_node_name":"core_node1", [junit4] 2> "collection":"collection1", [junit4] 2> "numShards":"1", [junit4] 2> "base_url":"http://node1/solr/"} [junit4] 2> 1095 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [ ] o.a.s.c.o.ClusterStateMutator building a new cName: collection1 [junit4] 2> 1102 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [ ] o.a.s.c.o.ReplicaMutator Assigning new node to shard shard=shard1 [junit4] 2> 1107 INFO (zkCallback-4-thread-1) [ ] o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred - updating... (live nodes size: 1) [junit4] 2> 1107 INFO (zkCallback-3-thread-1) [ ] o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred - updating... (live nodes size: 1) [junit4] 2> 1578 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /collections/collection1/leader_elect/shard1/election [junit4] 2> 1599 INFO (zkCallback-3-thread-1) [ ] o.a.s.c.c.ZkStateReader A collections change: WatchedEvent state:SyncConnected type:NodeChildrenChanged path:/collections, has occurred - updating... [junit4] 2> 1599 INFO (zkCallback-4-thread-1) [ ] o.a.s.c.c.ZkStateReader A collections change: WatchedEvent state:SyncConnected type:NodeChildrenChanged path:/collections, has occurred - updating... [junit4] 2> 1607 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.LeaderElector Joined leadership election with path: /collections/collection1/leader_elect/shard1/election/94836228734386179-node1_core1-n_0000000000 [junit4] 2> 1611 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /collections/collection1/leaders/shard1 [junit4] 2> 1616 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.ShardLeaderElectionContextBase Creating leader registration node [junit4] 2> 1628 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [ ] o.a.s.c.Overseer processMessage: queueSize: 1, message = { [junit4] 2> "operation":"state", [junit4] 2> "state":"active", [junit4] 2> "shard":"shard1", [junit4] 2> "collection":"collection1", [junit4] 2> "base_url":"http://node1/solr/", [junit4] 2> "node_name":"node1", [junit4] 2> "core_node_name":"core_node1", [junit4] 2> "core":"core1"} current state version: 1 [junit4] 2> 1629 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [ ] o.a.s.c.o.ReplicaMutator Update state numShards=null message={ [junit4] 2> "operation":"state", [junit4] 2> "state":"active", [junit4] 2> "shard":"shard1", [junit4] 2> "collection":"collection1", [junit4] 2> "base_url":"http://node1/solr/", [junit4] 2> "node_name":"node1", [junit4] 2> "core_node_name":"core_node1", [junit4] 2> "core":"core1"} [junit4] 2> 1646 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [ ] o.a.s.c.Overseer processMessage: queueSize: 1, message = { [junit4] 2> "operation":"leader", [junit4] 2> "shard":"shard1", [junit4] 2> "collection":"collection1", [junit4] 2> "base_url":"http://node1/solr/", [junit4] 2> "core":"core1"} current state version: 1 [junit4] 2> 1665 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.Overseer Overseer (id=94836228734386178-127.0.0.1:36981_solr-n_0000000000) closing [junit4] 2> 1666 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.ElectionContext Canceling election /overseer_elect/election/94836228734386178-127.0.0.1:36981_solr-n_0000000000 [junit4] 2> 1675 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000000) [ ] o.a.s.c.Overseer Overseer Loop exiting : 127.0.0.1:36981_solr [junit4] 2> 1676 INFO (zkCallback-3-thread-1) [ ] o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred - updating... (live nodes size: 1) [junit4] 2> 1677 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.LeaderElector Joined leadership election with path: /overseer_elect/election/94836228734386178-127.0.0.1:36981_solr-n_0000000001 [junit4] 2> 1677 INFO (zkCallback-4-thread-1) [ ] o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred - updating... (live nodes size: 1) [junit4] 2> 1682 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.OverseerElectionContext I am going to be the leader 127.0.0.1:36981_solr [junit4] 2> 1683 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.c.SolrZkClient makePath: /overseer_elect/leader [junit4] 2> 1683 INFO (OverseerExitThread) [ ] o.a.s.c.Overseer I'm exiting , but I'm still the leader [junit4] 2> 1686 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.Overseer Overseer (id=94836228734386178-127.0.0.1:36981_solr-n_0000000001) starting [junit4] 2> 1705 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.OverseerAutoReplicaFailoverThread Starting OverseerAutoReplicaFailoverThread autoReplicaFailoverWorkLoopDelay=10000 autoReplicaFailoverWaitAfterExpiration=30000 autoReplicaFailoverBadNodeExpiration=60000 [junit4] 2> 1705 INFO (OverseerCollectionConfigSetProcessor-94836228734386178-127.0.0.1:36981_solr-n_0000000001) [ ] o.a.s.c.OverseerTaskProcessor Process current queue of overseer operations [junit4] 2> 1706 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000001) [ ] o.a.s.c.Overseer Starting to work on the main queue [junit4] 2> 1708 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000001) [ ] o.a.s.c.Overseer processMessage: workQueueSize: 2, message = { [junit4] 2> "operation":"state", [junit4] 2> "state":"active", [junit4] 2> "shard":"shard1", [junit4] 2> "collection":"collection1", [junit4] 2> "base_url":"http://node1/solr/", [junit4] 2> "node_name":"node1", [junit4] 2> "core_node_name":"core_node1", [junit4] 2> "core":"core1"} [junit4] 2> 1709 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000001) [ ] o.a.s.c.o.ReplicaMutator Update state numShards=null message={ [junit4] 2> "operation":"state", [junit4] 2> "state":"active", [junit4] 2> "shard":"shard1", [junit4] 2> "collection":"collection1", [junit4] 2> "base_url":"http://node1/solr/", [junit4] 2> "node_name":"node1", [junit4] 2> "core_node_name":"core_node1", [junit4] 2> "core":"core1"} [junit4] 2> 1709 INFO (zkCallback-3-thread-1) [ ] o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred - updating... (live nodes size: 1) [junit4] 2> 1709 INFO (zkCallback-4-thread-1) [ ] o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred - updating... (live nodes size: 1) [junit4] 2> 1711 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000001) [ ] o.a.s.c.Overseer processMessage: workQueueSize: 2, message = { [junit4] 2> "operation":"leader", [junit4] 2> "shard":"shard1", [junit4] 2> "collection":"collection1", [junit4] 2> "base_url":"http://node1/solr/", [junit4] 2> "core":"core1"} [junit4] 2> 1712 INFO (zkCallback-3-thread-1) [ ] o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred - updating... (live nodes size: 1) [junit4] 2> 1747 INFO (zkCallback-3-thread-1) [ ] o.a.s.c.c.ZkStateReader A live node change: WatchedEvent state:SyncConnected type:NodeChildrenChanged path:/live_nodes, has occurred - updating... (live nodes size: 1) [junit4] 2> 1758 INFO (zkCallback-4-thread-2) [ ] o.a.s.c.c.ZkStateReader A cluster state change: WatchedEvent state:SyncConnected type:NodeDataChanged path:/clusterstate.json, has occurred - updating... (live nodes size: 1) [junit4] 2> 1759 WARN (zkCallback-4-thread-2) [ ] o.a.s.c.c.ZkStateReader ZooKeeper watch triggered, but Solr cannot talk to ZK [junit4] 2> 1762 ERROR (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]-EventThread) [ ] o.a.z.ClientCnxn Error while calling watcher [junit4] 2> java.util.concurrent.RejectedExecutionException: Task org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor$1@28ba4cca rejected from org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor@6db8d7b9[Shutting down, pool size = 2, active threads = 0, queued tasks = 0, completed tasks = 7] [junit4] 2> at java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:2047) [junit4] 2> at java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:823) [junit4] 2> at java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:1369) [junit4] 2> at org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.execute(ExecutorUtil.java:214) [junit4] 2> at java.util.concurrent.AbstractExecutorService.submit(AbstractExecutorService.java:112) [junit4] 2> at org.apache.solr.common.cloud.SolrZkClient$3.process(SolrZkClient.java:266) [junit4] 2> at org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:522) [junit4] 2> at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:498) [junit4] 2> 1772 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.ZkTestServer connecting to 127.0.0.1:36981 36981 [junit4] 2> 1896 INFO (Thread-1) [ ] o.a.s.c.ZkTestServer connecting to 127.0.0.1:36981 36981 [junit4] 2> 1905 WARN (Thread-1) [ ] o.a.s.c.ZkTestServer Watch limit violations: [junit4] 2> Maximum concurrent create/delete watches above limit: [junit4] 2> [junit4] 2> 2 /solr/aliases.json [junit4] 2> [junit4] 2> Maximum concurrent data watches above limit: [junit4] 2> [junit4] 2> 2 /solr/clusterstate.json [junit4] 2> [junit4] 2> Maximum concurrent children watches above limit: [junit4] 2> [junit4] 2> 2 /solr/live_nodes [junit4] 2> 2 /solr/collections [junit4] 2> 2 /solr/overseer/collection-queue-work [junit4] 2> [junit4] 2> 1905 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.SolrTestCaseJ4 ###Ending testOverseerStatsReset [junit4] 2> 1916 INFO (TEST-OverseerTest.testOverseerStatsReset-seed#[D0EFDBDB7BF104A]) [ ] o.a.s.c.Overseer Overseer (id=94836228734386178-127.0.0.1:36981_solr-n_0000000001) closing [junit4] 2> 1916 INFO (OverseerStateUpdate-94836228734386178-127.0.0.1:36981_solr-n_0000000001) [ ] o.a.s.c.Overseer Overseer Loop exiting : 127.0.0.1:36981_solr [junit4] 2> 1917 ERROR (OverseerExitThread) [ ] o.a.s.c.Overseer could not read the data [junit4] 2> org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode = Session expired for /overseer_elect/leader [junit4] 2> at org.apache.zookeeper.KeeperException.create(KeeperException.java:127) [junit4] 2> at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) [junit4] 2> at org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:1155) [junit4] 2> at org.apache.solr.common.cloud.SolrZkClient$7.execute(SolrZkClient.java:353) [junit4] 2> at org.apache.solr.common.cloud.SolrZkClient$7.execute(SolrZkClient.java:350) [junit4] 2> at org.apache.solr.common.cloud.ZkCmdExecutor.retryOperation(ZkCmdExecutor.java:61) [junit4] 2> at org.apache.solr.common.cloud.SolrZkClient.getData(SolrZkClient.java:350) [junit4] 2> at org.apache.solr.cloud.Overseer$ClusterStateUpdater.checkIfIamStillLeader(Overseer.java:304) [junit4] 2> at org.apache.solr.cloud.Overseer$ClusterStateUpdater.access$300(Overseer.java:87) [junit4] 2> at org.apache.solr.cloud.Overseer$ClusterStateUpdater$2.run(Overseer.java:265) [junit4] 2> NOTE: download the large Jenkins line-docs file by running 'ant get-jenkins-line-docs' in the lucene directory. [junit4] 2> NOTE: reproduce with: ant test -Dtestcase=OverseerTest -Dtests.method=testOverseerStatsReset -Dtests.seed=D0EFDBDB7BF104A -Dtests.slow=true -Dtests.linedocsfile=/home/jenkins/lucene-data/enwiki.random.lines.txt -Dtests.locale=en_IN -Dtests.timezone=Antarctica/McMurdo -Dtests.asserts=true -Dtests.file.encoding=ISO-8859-1 [junit4] FAILURE 1.85s J3 | OverseerTest.testOverseerStatsReset <<< [junit4] > Throwable #1: java.lang.AssertionError: expected:<0> but was:<1> [junit4] > at __randomizedtesting.SeedInfo.seed([D0EFDBDB7BF104A:A65A1A812265B344]:0) [junit4] > at org.apache.solr.cloud.OverseerTest.testOverseerStatsReset(OverseerTest.java:741) [junit4] > at java.lang.Thread.run(Thread.java:745) [junit4] 2> 4921 INFO (SUITE-OverseerTest-seed#[D0EFDBDB7BF104A]-worker) [ ] o.a.s.SolrTestCaseJ4 ###deleteCore [junit4] 2> NOTE: leaving temporary files on disk at: /home/sarowe/svn/lucene/dev/trunk/solr/build/solr-core/test/J3/temp/solr.cloud.OverseerTest_D0EFDBDB7BF104A-001 [junit4] 2> NOTE: test params are: codec=DummyCompressingStoredFields(storedFieldsFormat=CompressingStoredFieldsFormat(compressionMode=DUMMY, chunkSize=1, maxDocsPerChunk=860, blockSize=7), termVectorsFormat=CompressingTermVectorsFormat(compressionMode=DUMMY, chunkSize=1, blockSize=7)), sim=RandomSimilarityProvider(queryNorm=false,coord=yes): {}, locale=en_IN, timezone=Antarctica/McMurdo [junit4] 2> NOTE: Linux 4.1.0-custom2-amd64 amd64/Oracle Corporation 1.8.0_45 (64-bit)/cpus=16,threads=1,free=423825696,total=514850816 [junit4] 2> NOTE: All tests run in this JVM: [OverseerTest] [junit4] Completed [1/10] on J3 in 6.32s, 1 test, 1 failure <<< FAILURES! {noformat} I don't understand why this test fails so frequently when run with other Solr tests, but then so rarely fails when run in isolation - note that unlike standard Solr tests, only one test is being run per JVM with the way I'm running it. > OverseerTest failures > --------------------- > > Key: SOLR-8249 > URL: https://issues.apache.org/jira/browse/SOLR-8249 > Project: Solr > Issue Type: Bug > Reporter: Noble Paul > Assignee: Noble Paul > Attachments: SOLR-8249.patch, SOLR-8249.patch, SOLR-8249.patch, > SOLR-8249.patch, test.output > > > http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Windows/5377/ > This is related to SOLR-7989 . [~ichattopadhyaya] we need to fix the testcase -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org