[ https://issues.apache.org/jira/browse/SOLR-8599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15252720#comment-15252720 ]
Keith Laban commented on SOLR-8599: ----------------------------------- [~anshumg] There were two separate commits for this ticket, but they didn't all land on 6x or 6.0 but are both on master: {code} commit e3b785a906d6f93e04f2cb45c436516158af0425 Author: Dennis Gove <dpg...@gmail.com> Date: Sun Mar 20 11:13:56 2016 -0400 SOLR-8599: Improved the tests for this issue to avoid changing a variable to non-final commit 2c0a5e30364d83dc82383075a5f7c65200022494 Author: Dennis Gove <dpg...@gmail.com> Date: Wed Feb 10 15:02:18 2016 -0500 SOLR-8599: After a failed connection during construction of SolrZkClient attempt to retry until a connection can be made {code} however only the first commit found its way to 6x and 6.0 so please port that second commit and remember to port both for 5.5.1, thanks > Errors in construction of SolrZooKeeper cause Solr to go into an inconsistent > state > ----------------------------------------------------------------------------------- > > Key: SOLR-8599 > URL: https://issues.apache.org/jira/browse/SOLR-8599 > Project: Solr > Issue Type: Bug > Components: SolrCloud > Reporter: Keith Laban > Assignee: Dennis Gove > Fix For: master, 6.0, 5.5.1 > > Attachments: SOLR-8599.patch, SOLR-8599.patch, SOLR-8599.patch, > SOLR-8599.patch > > > We originally saw this happen due to a DNS exception (see stack trace below). > Although any exception thrown in the constructor of SolrZooKeeper or the > parent class, ZooKeeper, will cause DefaultConnectionStrategy to fail to > update the zookeeper client. Once it gets into this state, it will not try to > connect again until the process is restarted. The node itself will also > respond successfully to query requests, but not to update requests. > Two things should be address here: > 1) Fix the error handling and issue some number of retries > 2) If we are stuck in a state like this stop responding to all requests > {code} > 2016-01-23 13:49:20.222 ERROR ConnectionManager [main-EventThread] - > :java.net.UnknownHostException: HOSTNAME: unknown error > at java.net.Inet6AddressImpl.lookupAllHostAddr(Native Method) > at java.net.InetAddress$2.lookupAllHostAddr(InetAddress.java:928) > at java.net.InetAddress.getAddressesFromNameService(InetAddress.java:1323) > at java.net.InetAddress.getAllByName0(InetAddress.java:1276) > at java.net.InetAddress.getAllByName(InetAddress.java:1192) > at java.net.InetAddress.getAllByName(InetAddress.java:1126) > at > org.apache.zookeeper.client.StaticHostProvider.<init>(StaticHostProvider.java:61) > at org.apache.zookeeper.ZooKeeper.<init>(ZooKeeper.java:445) > at org.apache.zookeeper.ZooKeeper.<init>(ZooKeeper.java:380) > at org.apache.solr.common.cloud.SolrZooKeeper.<init>(SolrZooKeeper.java:41) > at > org.apache.solr.common.cloud.DefaultConnectionStrategy.reconnect(DefaultConnectionStrategy.java:53) > at > org.apache.solr.common.cloud.ConnectionManager.process(ConnectionManager.java:132) > at > org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:522) > at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:498) > 2016-01-23 13:49:20.222 INFO ConnectionManager [main-EventThread] - > Connected:false > 2016-01-23 13:49:20.222 INFO ClientCnxn [main-EventThread] - EventThread shut > down > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org