[ https://issues.apache.org/jira/browse/SOLR-6995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Darren Spehr closed SOLR-6995. ------------------------------ Resolution: Duplicate > On a new cluster setup, new nodes never move out of the 'down' state. > --------------------------------------------------------------------- > > Key: SOLR-6995 > URL: https://issues.apache.org/jira/browse/SOLR-6995 > Project: Solr > Issue Type: Bug > Components: SolrCloud > Affects Versions: 4.10.3 > Reporter: Darren Spehr > > This is related to a question I posted on > [stackoverflow|http://stackoverflow.com/questions/28004832/solr-4-10-3-is-not-proceeding-to-leader-election-on-new-cluster-startup-hangs] > a day ago. > When deploying a new cluster, new nodes never proceed to an 'active' state. I > have used both a custom (yet simple) deployment, as well as using the example > framework you get when downloading the Solr distribution. Further inspection > of the logs, and comparing them to the output of a 4.10.1 start up, it looks > like the nodes never move on to leader election. > For an apples-to-apples comparison I moved the solr/collection1 directory to > solr/ttPoiMDS, and modified the core.properties appropriately. I also removed > the 'conf' directory, assuming that it would pick this up from our ZooKeeper > cluster. > Here is the start up: > {code} > java > -Dcollection.configName=ttPoiMDS > -Djava.util.logging.config.file=etc/logging.properties > -DzkHost=our_zk_host_1:<zk_port> > -Djetty.port=8983 > -jar start.jar > {code} > Here is the output in the logs: > {code} > 0 [main] INFO org.eclipse.jetty.server.Server – jetty-8.1.10.v20130312 > 32 [main] INFO org.eclipse.jetty.deploy.providers.ScanningAppProvider – > Deployment monitor /site/pkgs/solr/solr-4.10.3/example/contexts at interval 0 > 38 [main] INFO org.eclipse.jetty.deploy.DeploymentManager – Deployable > added: /site/pkgs/solr/solr-4.10.3/example/contexts/solr-jetty-context.xml > 114 [main] INFO org.eclipse.jetty.webapp.WebInfConfiguration – Extract > jar:file:/site/pkgs/solr/solr-4.10.3/example/webapps/solr.war!/ to > /site/pkgs/solr/solr-4.10.3/example/solr-webapp/webapp > 1527 [main] INFO org.eclipse.jetty.webapp.StandardDescriptorProcessor – NO > JSP Support for /solr, did not find org.apache.jasper.servlet.JspServlet > 1597 [main] INFO org.apache.solr.servlet.SolrDispatchFilter – > SolrDispatchFilter.init() > 1616 [main] INFO org.apache.solr.core.SolrResourceLoader – JNDI not > configured for solr (NoInitialContextEx) > 1616 [main] INFO org.apache.solr.core.SolrResourceLoader – solr home > defaulted to 'solr/' (could not find system property or JNDI) > 1617 [main] INFO org.apache.solr.core.SolrResourceLoader – new > SolrResourceLoader for directory: 'solr/' > 1748 [main] INFO org.apache.solr.core.ConfigSolr – Loading container > configuration from /site/pkgs/solr/solr-4.10.3/example/solr/solr.xml > 1884 [main] INFO org.apache.solr.core.CoresLocator – Config-defined core > root directory: /site/pkgs/solr/solr-4.10.3/example/solr > 1890 [main] INFO org.apache.solr.core.CoreContainer – New CoreContainer > 378496804 > 1890 [main] INFO org.apache.solr.core.CoreContainer – Loading cores into > CoreContainer [instanceDir=solr/] > 1905 [main] INFO org.apache.solr.handler.component.HttpShardHandlerFactory > – Setting socketTimeout to: 0 > 1905 [main] INFO org.apache.solr.handler.component.HttpShardHandlerFactory > – Setting urlScheme to: null > 1909 [main] INFO org.apache.solr.handler.component.HttpShardHandlerFactory > – Setting connTimeout to: 0 > 1911 [main] INFO org.apache.solr.handler.component.HttpShardHandlerFactory > – Setting maxConnectionsPerHost to: 20 > 1912 [main] INFO org.apache.solr.handler.component.HttpShardHandlerFactory > – Setting corePoolSize to: 0 > 1912 [main] INFO org.apache.solr.handler.component.HttpShardHandlerFactory > – Setting maximumPoolSize to: 2147483647 > 1912 [main] INFO org.apache.solr.handler.component.HttpShardHandlerFactory > – Setting maxThreadIdleTime to: 5 > 1912 [main] INFO org.apache.solr.handler.component.HttpShardHandlerFactory > – Setting sizeOfQueue to: -1 > 1912 [main] INFO org.apache.solr.handler.component.HttpShardHandlerFactory > – Setting fairnessPolicy to: false > 2054 [main] INFO org.apache.solr.update.UpdateShardHandler – Creating > UpdateShardHandler HTTP client with params: > socketTimeout=0&connTimeout=0&retry=false > 2056 [main] INFO org.apache.solr.logging.LogWatcher – SLF4J impl is > org.slf4j.impl.Log4jLoggerFactory > 2057 [main] INFO org.apache.solr.logging.LogWatcher – Registering Log > Listener [Log4j (org.slf4j.impl.Log4jLoggerFactory)] > 2058 [main] INFO org.apache.solr.core.CoreContainer – Host Name: > 2059 [main] INFO org.apache.solr.core.ZkContainer – Zookeeper > client=our_zk_host_1:2195 > 2140 [main] INFO org.apache.solr.common.cloud.ConnectionManager – Waiting > for client to connect to ZooKeeper > 2220 [main-EventThread] INFO org.apache.solr.common.cloud.ConnectionManager > – Watcher org.apache.solr.common.cloud.ConnectionManager@36dadde6 > name:ZooKeeperConnection Watcher:our_zk_host_1:2195 got event WatchedEvent > state:SyncConnected type:None path:null path:null type:None > 2220 [main] INFO org.apache.solr.common.cloud.ConnectionManager – Client is > connected to ZooKeeper > 2293 [main] INFO org.apache.solr.common.cloud.SolrZkClient – makePath: > /overseer/queue > 2447 [main] INFO org.apache.solr.common.cloud.SolrZkClient – makePath: > /overseer/collection-queue-work > 2609 [main] INFO org.apache.solr.common.cloud.SolrZkClient – makePath: > /overseer/collection-map-running > 2758 [main] INFO org.apache.solr.common.cloud.SolrZkClient – makePath: > /overseer/collection-map-completed > 2919 [main] INFO org.apache.solr.common.cloud.SolrZkClient – makePath: > /overseer/collection-map-failure > 3093 [main] INFO org.apache.solr.common.cloud.SolrZkClient – makePath: > /live_nodes > 3173 [main] INFO org.apache.solr.cloud.ZkController – Register node as live > in ZooKeeper:/live_nodes/our_solr_host:8983_solr > 3201 [main] INFO org.apache.solr.common.cloud.SolrZkClient – makePath: > /live_nodes/our_solr_host:8983_solr > 3371 [main] INFO org.apache.solr.common.cloud.SolrZkClient – makePath: > /overseer_elect > 3477 [main] INFO org.apache.solr.common.cloud.SolrZkClient – makePath: > /overseer_elect/election > 3577 [main] INFO org.apache.solr.cloud.Overseer – Overseer (id=null) closing > 3659 [main] INFO org.apache.solr.cloud.ElectionContext – I am going to be > the leader our_solr_host:8983_solr > 3662 [main] INFO org.apache.solr.common.cloud.SolrZkClient – makePath: > /overseer_elect/leader > 3763 [main] INFO org.apache.solr.cloud.Overseer – Overseer > (id=93155855038349319-our_solr_host:8983_solr-n_0000000000) starting > 3887 [main] INFO org.apache.solr.common.cloud.SolrZkClient – makePath: > /overseer/queue-work > 4409 [main] INFO org.apache.solr.cloud.OverseerAutoReplicaFailoverThread – > Starting OverseerAutoReplicaFailoverThread > autoReplicaFailoverWorkLoopDelay=10000 > autoReplicaFailoverWaitAfterExpiration=30000 > autoReplicaFailoverBadNodeExpiration=60000 > 4438 > [OverseerCollectionProcessor-93155855038349319-our_solr_host:8983_solr-n_0000000000] > INFO org.apache.solr.cloud.OverseerCollectionProcessor – Process current > queue of collection creations > 4456 [main] INFO org.apache.solr.common.cloud.SolrZkClient – makePath: > /clusterstate.json > 4559 [main] INFO org.apache.solr.common.cloud.ZkStateReader – Updating > cluster state from ZooKeeper... > 4731 > [OverseerStateUpdate-93155855038349319-our_solr_host:8983_solr-n_0000000000] > INFO org.apache.solr.cloud.Overseer – Starting to work on the main queue > 4736 [main] INFO org.apache.solr.core.CoresLocator – Looking for core > definitions underneath /site/pkgs/solr/solr-4.10.3/example/solr > 4744 [main] INFO org.apache.solr.core.CoresLocator – Found core ttPoiMDS in > /site/pkgs/solr/solr-4.10.3/example/solr/ttPoiMDS/ > 4744 [main] INFO org.apache.solr.core.CoresLocator – Found 1 core > definitions > 4767 [coreLoadExecutor-6-thread-1] INFO org.apache.solr.cloud.ZkController > – publishing core=ttPoiMDS state=down collection=ttPoiMDS > 4767 [coreLoadExecutor-6-thread-1] INFO org.apache.solr.cloud.ZkController > – numShards not found on descriptor - reading it from system property > 4796 [coreLoadExecutor-6-thread-1] INFO org.apache.solr.cloud.ZkController > – look for our core node name > 4797 [zkCallback-2-thread-1] INFO org.apache.solr.cloud.DistributedQueue – > LatchChildWatcher fired on path: /overseer/queue state: SyncConnected type > NodeChildrenChanged > 4897 > [OverseerStateUpdate-93155855038349319-our_solr_host:8983_solr-n_0000000000] > INFO org.apache.solr.cloud.Overseer – Update state numShards=null message={ > "operation":"state", > "shard":null, > "roles":null, > "state":"down", > "core":"ttPoiMDS", > "collection":"ttPoiMDS", > "node_name":"our_solr_host:8983_solr", > "base_url":"http://our_solr_host:8983/solr"} > 4899 > [OverseerStateUpdate-93155855038349319-our_solr_host:8983_solr-n_0000000000] > INFO org.apache.solr.cloud.Overseer – Assigning new node to shard > shard=shard1 > 5052 [zkCallback-2-thread-1] INFO org.apache.solr.common.cloud.ZkStateReader > – A cluster state change: WatchedEvent state:SyncConnected > type:NodeDataChanged path:/clusterstate.json, has occurred - updating... > (live nodes size: 1) > 5800 [coreLoadExecutor-6-thread-1] INFO org.apache.solr.cloud.ZkController > – waiting to find shard id in clusterstate for ttPoiMDS > 5801 [coreLoadExecutor-6-thread-1] INFO org.apache.solr.cloud.ZkController > – Check for collection zkNode:ttPoiMDS > 5924 [coreLoadExecutor-6-thread-1] INFO org.apache.solr.cloud.ZkController > – Collection zkNode exists > 5925 [coreLoadExecutor-6-thread-1] INFO > org.apache.solr.common.cloud.ZkStateReader – Load collection config > from:/collections/ttPoiMDS > 5970 [coreLoadExecutor-6-thread-1] INFO > org.apache.solr.common.cloud.ZkStateReader – path=/collections/ttPoiMDS > configName=ttPoiMDS specified config exists in ZooKeeper > 5970 [coreLoadExecutor-6-thread-1] INFO > org.apache.solr.core.SolrResourceLoader – new SolrResourceLoader for > directory: '/site/pkgs/solr/solr-4.10.3/example/solr/ttPoiMDS/' > 6054 [coreLoadExecutor-6-thread-1] INFO org.apache.solr.core.SolrConfig – > Adding specified lib dirs to ClassLoader > 6055 [coreLoadExecutor-6-thread-1] WARN > org.apache.solr.core.SolrResourceLoader – Can't find (or read) directory to > add to classloader: ../../lib/mq (resolved as: > /site/pkgs/solr/solr-4.10.3/example/solr/ttPoiMDS/../../lib/mq). > 6139 [coreLoadExecutor-6-thread-1] INFO org.apache.solr.core.SolrConfig – > Using Lucene MatchVersion: 4.10.3 > 6238 [coreLoadExecutor-6-thread-1] INFO org.apache.solr.core.Config – > Loaded SolrConfig: solrconfig.xml > 6320 [coreLoadExecutor-6-thread-1] INFO org.apache.solr.schema.IndexSchema > – Reading Solr Schema from /configs/ttPoiMDS/schema.xml > 6347 [coreLoadExecutor-6-thread-1] INFO org.apache.solr.schema.IndexSchema > – [ttPoiMDS] Schema name=poiMDS > 6582 [main] INFO org.apache.solr.servlet.SolrDispatchFilter – > user.dir=/site/pkgs/solr/solr-4.10.3/example > 6582 [main] INFO org.apache.solr.servlet.SolrDispatchFilter – > SolrDispatchFilter.init() done > 6614 [main] INFO org.eclipse.jetty.server.AbstractConnector – Started > SocketConnector@0.0.0.0:8983 > {code} > Startup then seems to hang. The Solr UI is available, but it shows that no > cores are available. In the cloud view the configuration from ZooKeeper is > present. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org