[ https://issues.apache.org/jira/browse/SOLR-8275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14999609#comment-14999609 ]
Mike Drob commented on SOLR-8275: --------------------------------- I believe the local exception on node1 was: {noformat} 2015-11-09 13:00:56,161 ERROR org.apache.solr.core.SolrCore: org.apache.solr.common.SolrException: I was asked to wait on state recovering for shard1 in c1 on node2:8983_solr but I still do not see the requested state. I see state: recovering live:true leader from ZK: http://node1:8983/solr/c1_shard1_replica2/ at org.apache.solr.handler.admin.CoreAdminHandler.handleWaitForStateAction(CoreAdminHandler.java:987) at org.apache.solr.handler.admin.CoreAdminHandler.handleRequestInternal(CoreAdminHandler.java:246) at org.apache.solr.handler.admin.CoreAdminHandler.handleRequestBody(CoreAdminHandler.java:189) at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135) at org.apache.solr.servlet.SolrDispatchFilter.handleAdminRequest(SolrDispatchFilter.java:770) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:262) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:211) at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235) at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206) at org.apache.solr.servlet.SolrHadoopAuthenticationFilter$2.doFilter(SolrHadoopAuthenticationFilter.java:288) at org.apache.hadoop.security.authentication.server.AuthenticationFilter.doFilter(AuthenticationFilter.java:592) at org.apache.hadoop.security.token.delegation.web.DelegationTokenAuthenticationFilter.doFilter(DelegationTokenAuthenticationFilter.java:291) at org.apache.hadoop.security.authentication.server.AuthenticationFilter.doFilter(AuthenticationFilter.java:555) at org.apache.solr.servlet.SolrHadoopAuthenticationFilter.doFilter(SolrHadoopAuthenticationFilter.java:293) at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235) at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206) at org.apache.solr.servlet.HostnameFilter.doFilter(HostnameFilter.java:86) at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235) at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206) at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233) at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191) at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127) at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:103) at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109) at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:293) at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:861) at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:620) at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:489) at java.lang.Thread.run(Thread.java:745) {noformat} > Unclear error message during recovery > ------------------------------------- > > Key: SOLR-8275 > URL: https://issues.apache.org/jira/browse/SOLR-8275 > Project: Solr > Issue Type: Bug > Components: SolrCloud > Affects Versions: 4.10.3 > Reporter: Mike Drob > > A SolrCloud install got into a bad state (mostly around LeaderElection, I > think) and during recovery one of the nodes was giving me this message: > {noformat} > 2015-11-09 13:00:56,158 ERROR org.apache.solr.cloud.RecoveryStrategy: Error > while trying to recover. > core=c1_shard1_replica4:java.util.concurrent.ExecutionException: > org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException: I was > asked to wait on state recovering for shard1 in c1 on node2:8983_solr but I > still do not see the requested state. I see state: recovering live:true > leader from ZK: http://node1:8983/solr/c1_shard1_replica2/ > at java.util.concurrent.FutureTask.report(FutureTask.java:122) > at java.util.concurrent.FutureTask.get(FutureTask.java:192) > at > org.apache.solr.cloud.RecoveryStrategy.sendPrepRecoveryCmd(RecoveryStrategy.java:599) > at > org.apache.solr.cloud.RecoveryStrategy.doRecovery(RecoveryStrategy.java:370) > at org.apache.solr.cloud.RecoveryStrategy.run(RecoveryStrategy.java:236) > Caused by: > org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException: I was > asked to wait on state recovering for shard1 in c1 on node2:8983_solr but I > still do not see the requested state. I see state: recovering live:true > leader from ZK: http://node1:8983/solr/c1_shard1_replica2/ > at > org.apache.solr.client.solrj.impl.HttpSolrServer.executeMethod(HttpSolrServer.java:621) > at > org.apache.solr.client.solrj.impl.HttpSolrServer$1.call(HttpSolrServer.java:292) > at > org.apache.solr.client.solrj.impl.HttpSolrServer$1.call(HttpSolrServer.java:288) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > {noformat} > The crux of this message: "I was asked to wait on state recovering for shard1 > in c1 on node2:8983_solr but I still do not see the requested state. I see > state: recovering" seems contradictory. At a minimum, we should improve this > error, but there might also be some erroneous logic going on. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org