[ 
https://issues.apache.org/jira/browse/SOLR-15399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17347064#comment-17347064
 ] 

Timothy Potter commented on SOLR-15399:
---------------------------------------

Debugging this via beasting ... here's what I'm seeing in the logs so far: 
{code}
  2> 13227 INFO  (TEST-TestPullReplica.testKillLeader-seed#[AF3CA0CC9910FE0]) 
[n:127.0.0.1:63150_solr     ] o.a.s.c.TestPullReplica 
  2> 
  2> >> Sending first doc to collection pull_replica_test_kill_leader
  2> 
  2> 
  2> 13233 DEBUG (indexFetcher-303-thread-1) [     ] o.a.s.h.ReplicationHandler 
Polling for index modifications
  2> 13233 INFO  (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher Last 
replication failed, so I'll force replication
  2> 13233 INFO  (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher 
Updated leaderUrl to 
http://127.0.0.1:63150/solr/pull_replica_test_kill_leader_shard1_replica_n1/
  2> 13233 INFO  (qtp831673056-203) [n:127.0.0.1:63150_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node3 
x:pull_replica_test_kill_leader_shard1_replica_n1 ] o.a.s.c.ZkShardTerms 
Successful update of terms at 
/collections/pull_replica_test_kill_leader/terms/shard1 to 
Terms{values={core_node3=1}, version=1} for ensureHighestTermsAreNotZero
  2> 13233 INFO  (qtp831673056-203) [n:127.0.0.1:63150_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node3 
x:pull_replica_test_kill_leader_shard1_replica_n1 ] 
o.a.s.u.p.LogUpdateProcessorFactory 
[pull_replica_test_kill_leader_shard1_replica_n1]  webapp=/solr path=/update 
params={wt=javabin&version=2}{add=[1 (1700115601919311872)]} 0 3
  2> 13234 INFO  (qtp831673056-165) [n:127.0.0.1:63150_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node3 
x:pull_replica_test_kill_leader_shard1_replica_n1 ] o.a.s.c.S.Request 
[pull_replica_test_kill_leader_shard1_replica_n1]  webapp=/solr 
path=/replication 
params={qt=/replication&wt=javabin&version=2&command=indexversion} status=0 
QTime=0
  2> 13235 INFO  (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher 
Leader's generation: 1
  2> 13235 INFO  (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher 
Leader's version: 0
  2> 13235 INFO  (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher 
Follower's generation: 1
  2> 13235 INFO  (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher 
Follower's version: 0
  2> 13235 INFO  (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher New 
index in Leader. Deleting mine...
  2> 13236 INFO  (searcherExecutor-292-thread-1) [     ] o.a.s.c.SolrCore 
[pull_replica_test_kill_leader_shard1_replica_p2]  Registered new searcher 
autowarm time: 0 ms
  2> 13236 DEBUG (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher 
Nothing to replicate, leader's version is 0
  2> 13239 INFO  
(searcherExecutor-294-thread-1-processing-n:127.0.0.1:63150_solr 
x:pull_replica_test_kill_leader_shard1_replica_n1 
c:pull_replica_test_kill_leader s:shard1 r:core_node3) [n:127.0.0.1:63150_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node3 
x:pull_replica_test_kill_leader_shard1_replica_n1 ] o.a.s.c.SolrCore 
[pull_replica_test_kill_leader_shard1_replica_n1]  Registered new searcher 
autowarm time: 0 ms
  2> 13239 INFO  (qtp831673056-201) [n:127.0.0.1:63150_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node3 
x:pull_replica_test_kill_leader_shard1_replica_n1 ] 
o.a.s.u.p.LogUpdateProcessorFactory 
[pull_replica_test_kill_leader_shard1_replica_n1]  webapp=/solr path=/update 
params={_stateVer_=pull_replica_test_kill_leader:7&waitSearcher=true&commit=true&softCommit=false&wt=javabin&version=2}{commit=}
 0 4
  2> 13242 INFO  (qtp831673056-146) [n:127.0.0.1:63150_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node3 
x:pull_replica_test_kill_leader_shard1_replica_n1 ] o.a.s.c.S.Request 
[pull_replica_test_kill_leader_shard1_replica_n1]  webapp=/solr path=/select 
params={q=*:*&wt=javabin&version=2} hits=1 status=0 QTime=0
  2> 13242 INFO  (TEST-TestPullReplica.testKillLeader-seed#[AF3CA0CC9910FE0]) 
[n:127.0.0.1:63150_solr     ] o.a.s.c.TestPullReplica 
  2> 
  2> >> Leader 
http://127.0.0.1:63150/solr/pull_replica_test_kill_leader_shard1_replica_n1/ 
has 1 doc, checking PULL replica at 
http://127.0.0.1:63149/solr/pull_replica_test_kill_leader_shard1_replica_p2/ 
now ...
  2> 
  2> 
  2> 13244 INFO  (qtp954792869-200) [n:127.0.0.1:63149_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node4 
x:pull_replica_test_kill_leader_shard1_replica_p2 ] o.a.s.c.S.Request 
[pull_replica_test_kill_leader_shard1_replica_p2]  webapp=/solr path=/select 
params={q=*:*&wt=javabin&version=2} hits=0 status=0 QTime=0
  2> 13454 INFO  (qtp954792869-202) [n:127.0.0.1:63149_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node4 
x:pull_replica_test_kill_leader_shard1_replica_p2 ] o.a.s.c.S.Request 
[pull_replica_test_kill_leader_shard1_replica_p2]  webapp=/solr path=/select 
params={q=*:*&wt=javabin&version=2} hits=0 status=0 QTime=0
  2> 13664 INFO  (qtp954792869-137) [n:127.0.0.1:63149_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node4 
x:pull_replica_test_kill_leader_shard1_replica_p2 ] o.a.s.c.S.Request 
[pull_replica_test_kill_leader_shard1_replica_p2]  webapp=/solr path=/select 
params={q=*:*&wt=javabin&version=2} hits=0 status=0 QTime=0
  2> 13869 INFO  (qtp954792869-199) [n:127.0.0.1:63149_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node4 
x:pull_replica_test_kill_leader_shard1_replica_p2 ] o.a.s.c.S.Request 
[pull_replica_test_kill_leader_shard1_replica_p2]  webapp=/solr path=/select 
params={q=*:*&wt=javabin&version=2} hits=0 status=0 QTime=0
  2> 14078 INFO  (qtp954792869-148) [n:127.0.0.1:63149_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node4 
x:pull_replica_test_kill_leader_shard1_replica_p2 ] o.a.s.c.S.Request 
[pull_replica_test_kill_leader_shard1_replica_p2]  webapp=/solr path=/select 
params={q=*:*&wt=javabin&version=2} hits=0 status=0 QTime=0
  2> 14243 DEBUG (indexFetcher-303-thread-1) [     ] o.a.s.h.ReplicationHandler 
Polling for index modifications
  2> 14243 DEBUG (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher 
leaderUrl didn't change
  2> 14244 INFO  (qtp831673056-203) [n:127.0.0.1:63150_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node3 
x:pull_replica_test_kill_leader_shard1_replica_n1 ] o.a.s.c.S.Request 
[pull_replica_test_kill_leader_shard1_replica_n1]  webapp=/solr 
path=/replication 
params={qt=/replication&wt=javabin&version=2&command=indexversion} status=0 
QTime=0
  2> 14245 INFO  (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher 
Leader's generation: 2
  2> 14245 INFO  (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher 
Leader's version: 1621356584476
  2> 14245 INFO  (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher 
Follower's generation: 2
  2> 14245 INFO  (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher 
Follower's version: 1621356584476
  2> 14245 INFO  (indexFetcher-303-thread-1) [     ] o.a.s.h.IndexFetcher 
Follower in sync with leader.
  2> 14285 INFO  (qtp954792869-305) [n:127.0.0.1:63149_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node4 
x:pull_replica_test_kill_leader_shard1_replica_p2 ] o.a.s.c.S.Request 
[pull_replica_test_kill_leader_shard1_replica_p2]  webapp=/solr path=/select 
params={q=*:*&wt=javabin&version=2} hits=0 status=0 QTime=0
  2> 14496 INFO  (qtp954792869-237) [n:127.0.0.1:63149_solr 
c:pull_replica_test_kill_leader s:shard1 r:core_node4 
x:pull_replica_test_kill_leader_shard1_replica_p2 ] o.a.s.c.S.Request 
[pull_replica_test_kill_leader_shard1_replica_p2]  webapp=/solr path=/select 
params={q=*:*&wt=javabin&version=2} hits=0 status=0 QTime=0
{code}
>From these logs, it's almost like the PULL replica gets the leader's index but 
>doesn't open its new searcher. 

> TestPullReplica and TestPullReplicaWithAuth are flaky
> -----------------------------------------------------
>
>                 Key: SOLR-15399
>                 URL: https://issues.apache.org/jira/browse/SOLR-15399
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Timothy Potter
>            Assignee: Timothy Potter
>            Priority: Major
>
> See:
> https://jenkins.thetaphi.de/job/Solr-main-Linux/537/
> and
> https://jenkins.thetaphi.de/job/Solr-main-Linux/536/
> Unf. these pass with beasting locally 30/30 but something is still awry here 
> ...



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@solr.apache.org
For additional commands, e-mail: issues-h...@solr.apache.org

Reply via email to