Solrcloud TimeoutException: Idle timeout expired

2019-01-29 Thread Schaum Mallik
I am seeing this error in our logs. Our Solr heap is set to more than 10G.
Any clues which anyone can provide will be very helpful.

Thank you

null:java.io.IOException: java.util.concurrent.TimeoutException: Idle
timeout expired: 12/12 ms
at 
org.eclipse.jetty.server.HttpInput$ErrorState.noContent(HttpInput.java:1075)
at org.eclipse.jetty.server.HttpInput.read(HttpInput.java:313)
at 
org.apache.solr.servlet.ServletInputStreamWrapper.read(ServletInputStreamWrapper.java:74)
at 
org.apache.commons.io.input.ProxyInputStream.read(ProxyInputStream.java:100)
at 
org.apache.solr.common.util.FastInputStream.readWrappedStream(FastInputStream.java:79)
at 
org.apache.solr.common.util.FastInputStream.refill(FastInputStream.java:88)
at 
org.apache.solr.common.util.FastInputStream.peek(FastInputStream.java:60)
at 
org.apache.solr.handler.loader.JavabinLoader.parseAndLoadDocs(JavabinLoader.java:107)
at 
org.apache.solr.handler.loader.JavabinLoader.load(JavabinLoader.java:55)
at 
org.apache.solr.handler.UpdateRequestHandler$1.load(UpdateRequestHandler.java:97)
at 
org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:68)
at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:199)
at org.apache.solr.core.SolrCore.execute(SolrCore.java:2539)
at org.apache.solr.servlet.HttpSolrCall.execute(HttpSolrCall.java:709)
at org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:515)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:377)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:323)
at 
org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1634)
at 
org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:533)
at 
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:146)
at 
org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:548)
at 
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:132)
at 
org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:257)
at 
org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:1595)
at 
org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:255)
at 
org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1253)
at 
org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:203)
at 
org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:473)
at 
org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:1564)
at 
org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:201)
at 
org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1155)
at 
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:144)
at 
org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:219)
at 
org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:126)
at 
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:132)
at 
org.eclipse.jetty.rewrite.handler.RewriteHandler.handle(RewriteHandler.java:335)
at 
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:132)
at org.eclipse.jetty.server.Server.handle(Server.java:531)
at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:352)
at 
org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:260)
at 
org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:281)
at org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:102)
at org.eclipse.jetty.io.ChannelEndPoint$2.run(ChannelEndPoint.java:118)
at 
org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.runTask(EatWhatYouKill.java:333)
at 
org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.doProduce(EatWhatYouKill.java:310)
at 
org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.tryProduce(EatWhatYouKill.java:168)
at 
org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.run(EatWhatYouKill.java:126)
at 
org.eclipse.jetty.util.thread.ReservedThreadExecutor$ReservedThread.run(ReservedThreadExecutor.java:366)
at 
org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:760)
at 
org.eclipse.jetty.util.thread.QueuedThreadPool$2.run(QueuedThreadPool.java:678)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.util.concurrent.TimeoutException: Idle timeout
expired: 12/12 ms
at 

Solrcloud TimeoutException: Idle timeout expired

2019-01-28 Thread Schaum Mallik
I am seeing this error in our logs. Our Solr heap is set to more than 10G.
Any clues which anyone can provide will be very helpful.

Thank you

null:java.io.IOException: java.util.concurrent.TimeoutException: Idle
timeout expired: 12/12 ms
at 
org.eclipse.jetty.server.HttpInput$ErrorState.noContent(HttpInput.java:1075)
at org.eclipse.jetty.server.HttpInput.read(HttpInput.java:313)
at 
org.apache.solr.servlet.ServletInputStreamWrapper.read(ServletInputStreamWrapper.java:74)
at 
org.apache.commons.io.input.ProxyInputStream.read(ProxyInputStream.java:100)
at 
org.apache.solr.common.util.FastInputStream.readWrappedStream(FastInputStream.java:79)
at 
org.apache.solr.common.util.FastInputStream.refill(FastInputStream.java:88)
at 
org.apache.solr.common.util.FastInputStream.peek(FastInputStream.java:60)
at 
org.apache.solr.handler.loader.JavabinLoader.parseAndLoadDocs(JavabinLoader.java:107)
at 
org.apache.solr.handler.loader.JavabinLoader.load(JavabinLoader.java:55)
at 
org.apache.solr.handler.UpdateRequestHandler$1.load(UpdateRequestHandler.java:97)
at 
org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:68)
at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:199)
at org.apache.solr.core.SolrCore.execute(SolrCore.java:2539)
at org.apache.solr.servlet.HttpSolrCall.execute(HttpSolrCall.java:709)
at org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:515)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:377)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:323)
at 
org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1634)
at 
org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:533)
at 
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:146)
at 
org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:548)
at 
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:132)
at 
org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:257)
at 
org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:1595)
at 
org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:255)
at 
org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1253)
at 
org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:203)
at 
org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:473)
at 
org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:1564)
at 
org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:201)
at 
org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1155)
at 
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:144)
at 
org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:219)
at 
org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:126)
at 
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:132)
at 
org.eclipse.jetty.rewrite.handler.RewriteHandler.handle(RewriteHandler.java:335)
at 
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:132)
at org.eclipse.jetty.server.Server.handle(Server.java:531)
at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:352)
at 
org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:260)
at 
org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:281)
at org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:102)
at org.eclipse.jetty.io.ChannelEndPoint$2.run(ChannelEndPoint.java:118)
at 
org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.runTask(EatWhatYouKill.java:333)
at 
org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.doProduce(EatWhatYouKill.java:310)
at 
org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.tryProduce(EatWhatYouKill.java:168)
at 
org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.run(EatWhatYouKill.java:126)
at 
org.eclipse.jetty.util.thread.ReservedThreadExecutor$ReservedThread.run(ReservedThreadExecutor.java:366)
at 
org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:760)
at 
org.eclipse.jetty.util.thread.QueuedThreadPool$2.run(QueuedThreadPool.java:678)
at java.lang.Thread.run(Thread.java:748)
Caused by: java.util.concurrent.TimeoutException: Idle timeout
expired: 12/12 ms
at 

Zookeeper restart on solr basic authentication password change

2019-01-28 Thread Schaum Mallik
I recently updated the password on our 3 node solr cloud. We have 3
zookeeper ensemble serving it.  I have one question. Do I need to restart
the zookeeper ensemble after password change?

Thank you


Re: SolrCoreInitializationException after restart of one solr node

2018-09-20 Thread Schaum Mallik
Sure will do thank you

On Thursday, September 20, 2018, Erick Erickson 
wrote:

> Try "bin/solr zk rm -help"
> On Thu, Sep 20, 2018 at 8:51 AM Schaum Mallik 
> wrote:
> >
> > Sure Shawn will do what you suggested in the previous emails. Thank you
> so
> > much
> >
> > On Thursday, September 20, 2018, Shawn Heisey 
> wrote:
> >
> > > On 9/20/2018 9:32 AM, Schaum Mallik wrote:
> > >
> > >> ‘Then use "bin/solr zk rm" to get rid of it from ZK.‘ <— can you give
> the
> > >> full command for this one if you don’t mind
> > >>
> > >
> > > Before doing this, try what I suggested.  I am not sure that you need
> to
> > > mess with what you have in ZK.
> > >
> > > If you have core.properties files in ZK, that should not cause any
> issues
> > > with any current version of Solr, or the upcoming 8.0 as far as I
> know.  I
> > > suppose it's possible that it might cause problems with a future
> version,
> > > but honestly I doubt that it will ever be a problem.
> > >
> > > The path to delete in ZK would be /configs/XXX/core.properties, where
> XXX
> > > is the name of the config.  The name of the config can be the same as
> the
> > > core, but can also be different.  I think you can probably use this
> command:
> > >
> > > bin/solr zk rm /configs/XXX/core.properties -z ZKHOST
> > >
> > > Where XXX is the name of the collection and ZKHOST is the ZKHOST string
> > > that your Solr installs are using, listing all your ZK servers.  I have
> > > tested this and it should work.
> > >
> > > Thanks,
> > > Shawn
> > >
> > >
>


Re: SolrCoreInitializationException after restart of one solr node

2018-09-20 Thread Schaum Mallik
Sure Shawn will do what you suggested in the previous emails. Thank you so
much

On Thursday, September 20, 2018, Shawn Heisey  wrote:

> On 9/20/2018 9:32 AM, Schaum Mallik wrote:
>
>> ‘Then use "bin/solr zk rm" to get rid of it from ZK.‘ <— can you give the
>> full command for this one if you don’t mind
>>
>
> Before doing this, try what I suggested.  I am not sure that you need to
> mess with what you have in ZK.
>
> If you have core.properties files in ZK, that should not cause any issues
> with any current version of Solr, or the upcoming 8.0 as far as I know.  I
> suppose it's possible that it might cause problems with a future version,
> but honestly I doubt that it will ever be a problem.
>
> The path to delete in ZK would be /configs/XXX/core.properties, where XXX
> is the name of the config.  The name of the config can be the same as the
> core, but can also be different.  I think you can probably use this command:
>
> bin/solr zk rm /configs/XXX/core.properties -z ZKHOST
>
> Where XXX is the name of the collection and ZKHOST is the ZKHOST string
> that your Solr installs are using, listing all your ZK servers.  I have
> tested this and it should work.
>
> Thanks,
> Shawn
>
>


Re: SolrCoreInitializationException after restart of one solr node

2018-09-20 Thread Schaum Mallik
‘Then use "bin/solr zk rm" to get rid of it from ZK.‘ <— can you give the
full command for this one if you don’t mind

Thank you


On Thursday, September 20, 2018, Erick Erickson 
wrote:

> bq. In response to this mistake that I did of keeping the core.properties
> in
> the configuration directory when it was uploaded to zookeeper, how should I
> go about fixing it?
>
> Oh my. You're saying that core.properties is _in_ ZooKeeper? Never
> seen that before ;).
>
> OK, I think I see what's happening. Somehow you have nodes that have
> the core.properties file in your ...solr/server/configsets. Your
> SOLR_HOME some ancestor of that directory and the recursive search is
> finding it and trying to load it.
>
> So just remove it from that directory on all the Solr nodes where it
> exists. I'd have Solr shut down at the time, but that's probably not
> strictly necessary.
>
> Then use "bin/solr zk rm" to get rid of it from ZK.
>
> If possible, I'd do all this with all the Solr instances shut down,
> but that's probably not absolutely necessary, I'm just paranoid.
>
> On Thu, Sep 20, 2018 at 8:13 AM Schaum Mallik 
> wrote:
> >
> > In response to this mistake that I did of keeping the core.properties in
> > the configuration directory when it was uploaded to zookeeper, how
> should I
> > go about fixing it?
> >
> > Thank you
> >
> > On Thursday, September 20, 2018, Schaum Mallik 
> > wrote:
> >
> > > Hi Eric
> > >
> > > I created the collection the way you detailed it in here. The
> > > configuration directory for the collections was copied over from the
> > > standalone solr 6. the core.properties existed in the conf directory
> and I
> > > didn’t realize it would cause this issue.
> > >
> > > Also all the nodes are solr 7.4.
> > >
> > > Thank you
> > >
> > > On Thursday, September 20, 2018, Erick Erickson <
> erickerick...@gmail.com>
> > > wrote:
> > >
> > >> I agree with Shawn that having
> > >> instanceDir=/opt/solr/server/solr/configsets/articles as where your
> > >> core is located is strange, how are you starting Solr? In particular,
> > >> do those nodes use a "-s' parameter to redefine SOLR_HOME?
> > >>
> > >> The "-d" option (assuming you created the collection with bin/solr)
> > >> just tells Solr where the configset you're using is located on your
> > >> local disk, it does _not_ put the instanceDir there, it just looks for
> > >> the configset to upload to ZK.
> > >>
> > >> Check if there's a "core.properties" file in
> > >> "/opt/solr/server/solr/configsets/articles". The presence of that
> file
> > >> indicates that that's the home of the replica.
> > >>
> > >> Now to the very weird bit. This sounds an awful lot like
> > >> https://issues.apache.org/jira/browse/SOLR-11503. Especially if your
> > >> old system was Solr 6.6.1 or 6.6.2.. Short form, "core.properties"
> > >> needs to contain a, you guessed it, "coreNodeName" property that
> > >> corresponds to what's in ZooKeeper.
> > >>
> > >> However, I simply cannot reconcile this being the problem with what
> > >> you're reporting. There's code in 7x that repairs this issue
> > >> automatically. Is there any chance for the node in question that it's
> > >> _not_ running 7.4 and still on 6.6.1/2? Maybe Shawn's latest comment
> > >> would reconcile that?
> > >>
> > >> How to fix. Well, fixing should be automatic unless the fixer-upper
> > >> code is no longer in 7.4, but on a quick check the code _is_ in 7.4.
> > >> > Make very, very sure the Solr you're running on the nodes in
> question
> > >> is 7.4, or at least _NOT_ 6.6.1/2
> > >> > Make very, very sure that you don't specify some strange "-s"
> parameter
> > >> or have defined SOLR_HOME to point to /opt/solr/server/solr/
> configsets/articles,
> > >> although that shouldn't really matter.
> > >> > When you DELETEREPLICA, go to the node and manually see if the
> > >> directory /opt/solr/server/solr/configsets/articles exists. It
> shouldn't
> > >> (see the deleteInstanceDir option in the DELETEREPLICA for why).
> > >>
> > >> If none of that makes any difference, please show us the full output
> of
> > >> > ps aux | grep solr
> > >> > expor

Re: SolrCoreInitializationException after restart of one solr node

2018-09-20 Thread Schaum Mallik
Thanks Shawn. Will follow your advice on this one. Thank you so much

On Thursday, September 20, 2018, Shawn Heisey  wrote:

> On 9/20/2018 9:25 AM, Schaum Mallik wrote:
>
>> ok so that’s the problem. The core.properties in the replicas under
>> /opt/solr/server/solr. So if I remove the file from all the replica
>> folders
>> and also move the directories under /opt/solr/server/solr/configsets to
>> some backup location and restart this node then hopefully it should start
>> working correct?
>>
>
> The cores with names that include 'shard' and 'replica' are likely fine
> and should not be touched.
>
> It seems to be the ones under configsets that are the problem.  I would
> only move instanceDir locations that give you the error in the log.  Any
> directory with core.properties that doesn't give you an error, don't touch.
>
> Thanks,
> Shawn
>
>


Re: SolrCoreInitializationException after restart of one solr node

2018-09-20 Thread Schaum Mallik
ok so that’s the problem. The core.properties in the replicas under
/opt/solr/server/solr. So if I remove the file from all the replica folders
and also move the directories under /opt/solr/server/solr/configsets to
some backup location and restart this node then hopefully it should start
working correct?

Thank you

On Thursday, September 20, 2018, Shawn Heisey  wrote:

> On 9/20/2018 9:13 AM, Schaum Mallik wrote:
>
>> In response to this mistake that I did of keeping the core.properties in
>> the configuration directory when it was uploaded to zookeeper, how should
>> I
>> go about fixing it?
>>
>
> A core.properties file in the config in ZK will not cause any problems.
> It shouldn't be there, but it would be ignored.
>
> This problem is happening because you have a core.properties file in
> directories under /opt/solr/server/solr, so Solr is seeing that and loading
> the location as a core. Because you're in cloud mode, it tries to get
> information about the node from ZK, and it's not there, because when those
> core.properties files were created, the instance was NOT running in cloud
> mode.
>
> Thanks,
> Shawn
>
>


Re: SolrCoreInitializationException after restart of one solr node

2018-09-20 Thread Schaum Mallik
In response to this mistake that I did of keeping the core.properties in
the configuration directory when it was uploaded to zookeeper, how should I
go about fixing it?

Thank you

On Thursday, September 20, 2018, Schaum Mallik 
wrote:

> Hi Eric
>
> I created the collection the way you detailed it in here. The
> configuration directory for the collections was copied over from the
> standalone solr 6. the core.properties existed in the conf directory and I
> didn’t realize it would cause this issue.
>
> Also all the nodes are solr 7.4.
>
> Thank you
>
> On Thursday, September 20, 2018, Erick Erickson 
> wrote:
>
>> I agree with Shawn that having
>> instanceDir=/opt/solr/server/solr/configsets/articles as where your
>> core is located is strange, how are you starting Solr? In particular,
>> do those nodes use a "-s' parameter to redefine SOLR_HOME?
>>
>> The "-d" option (assuming you created the collection with bin/solr)
>> just tells Solr where the configset you're using is located on your
>> local disk, it does _not_ put the instanceDir there, it just looks for
>> the configset to upload to ZK.
>>
>> Check if there's a "core.properties" file in
>> "/opt/solr/server/solr/configsets/articles". The presence of that file
>> indicates that that's the home of the replica.
>>
>> Now to the very weird bit. This sounds an awful lot like
>> https://issues.apache.org/jira/browse/SOLR-11503. Especially if your
>> old system was Solr 6.6.1 or 6.6.2.. Short form, "core.properties"
>> needs to contain a, you guessed it, "coreNodeName" property that
>> corresponds to what's in ZooKeeper.
>>
>> However, I simply cannot reconcile this being the problem with what
>> you're reporting. There's code in 7x that repairs this issue
>> automatically. Is there any chance for the node in question that it's
>> _not_ running 7.4 and still on 6.6.1/2? Maybe Shawn's latest comment
>> would reconcile that?
>>
>> How to fix. Well, fixing should be automatic unless the fixer-upper
>> code is no longer in 7.4, but on a quick check the code _is_ in 7.4.
>> > Make very, very sure the Solr you're running on the nodes in question
>> is 7.4, or at least _NOT_ 6.6.1/2
>> > Make very, very sure that you don't specify some strange "-s" parameter
>> or have defined SOLR_HOME to point to 
>> /opt/solr/server/solr/configsets/articles,
>> although that shouldn't really matter.
>> > When you DELETEREPLICA, go to the node and manually see if the
>> directory /opt/solr/server/solr/configsets/articles exists. It shouldn't
>> (see the deleteInstanceDir option in the DELETEREPLICA for why).
>>
>> If none of that makes any difference, please show us the full output of
>> > ps aux | grep solr
>> > export (looking for you redefining SOLR_HOME)
>> > the "core.properties" file in the indicated directory.
>>
>> Best,
>> Erick
>>
>> On Thu, Sep 20, 2018 at 7:36 AM Shawn Heisey  wrote:
>> >
>> > On 9/20/2018 8:22 AM, Schaum Mallik wrote:
>> > > Thanks for the response Shawn.
>> > >
>> > > My follow up question is how would the zookeeper ensemble know that
>> the
>> > > location of the indexes has changed? Also do I need to apply the same
>> > > changes to the other 2 solr nodes which are working fine?
>> >
>> > This move is not to change the location, it's to stop Solr from trying
>> > to load the indexes completely.  Solr will no longer try to load them.
>> > I think these are invalid indexes from when the Solr instance was NOT
>> > running in cloud mode, and have nothing at all to do with your working
>> > collections.  But just in case I'm wrong, I'm having you move them
>> > instead of delete them, so they can be put back if it turns out they
>> > actually were needed.
>> >
>> > ZooKeeper doesn't know anything at all about Solr and the data it puts
>> > in ZK.  It is just a service that stores cluster data where all nodes
>> > can read it and handles elections when it is asked to.
>> >
>> > Thanks,
>> > Shawn
>> >
>>
>


Re: SolrCoreInitializationException after restart of one solr node

2018-09-20 Thread Schaum Mallik
Hi Eric

I created the collection the way you detailed it in here. The configuration
directory for the collections was copied over from the standalone solr 6.
the core.properties existed in the conf directory and I didn’t realize it
would cause this issue.

Also all the nodes are solr 7.4.

Thank you

On Thursday, September 20, 2018, Erick Erickson 
wrote:

> I agree with Shawn that having
> instanceDir=/opt/solr/server/solr/configsets/articles as where your
> core is located is strange, how are you starting Solr? In particular,
> do those nodes use a "-s' parameter to redefine SOLR_HOME?
>
> The "-d" option (assuming you created the collection with bin/solr)
> just tells Solr where the configset you're using is located on your
> local disk, it does _not_ put the instanceDir there, it just looks for
> the configset to upload to ZK.
>
> Check if there's a "core.properties" file in
> "/opt/solr/server/solr/configsets/articles". The presence of that file
> indicates that that's the home of the replica.
>
> Now to the very weird bit. This sounds an awful lot like
> https://issues.apache.org/jira/browse/SOLR-11503. Especially if your
> old system was Solr 6.6.1 or 6.6.2.. Short form, "core.properties"
> needs to contain a, you guessed it, "coreNodeName" property that
> corresponds to what's in ZooKeeper.
>
> However, I simply cannot reconcile this being the problem with what
> you're reporting. There's code in 7x that repairs this issue
> automatically. Is there any chance for the node in question that it's
> _not_ running 7.4 and still on 6.6.1/2? Maybe Shawn's latest comment
> would reconcile that?
>
> How to fix. Well, fixing should be automatic unless the fixer-upper
> code is no longer in 7.4, but on a quick check the code _is_ in 7.4.
> > Make very, very sure the Solr you're running on the nodes in question is
> 7.4, or at least _NOT_ 6.6.1/2
> > Make very, very sure that you don't specify some strange "-s" parameter
> or have defined SOLR_HOME to point to 
> /opt/solr/server/solr/configsets/articles,
> although that shouldn't really matter.
> > When you DELETEREPLICA, go to the node and manually see if the directory
> /opt/solr/server/solr/configsets/articles exists. It shouldn't (see the
> deleteInstanceDir option in the DELETEREPLICA for why).
>
> If none of that makes any difference, please show us the full output of
> > ps aux | grep solr
> > export (looking for you redefining SOLR_HOME)
> > the "core.properties" file in the indicated directory.
>
> Best,
> Erick
>
> On Thu, Sep 20, 2018 at 7:36 AM Shawn Heisey  wrote:
> >
> > On 9/20/2018 8:22 AM, Schaum Mallik wrote:
> > > Thanks for the response Shawn.
> > >
> > > My follow up question is how would the zookeeper ensemble know that the
> > > location of the indexes has changed? Also do I need to apply the same
> > > changes to the other 2 solr nodes which are working fine?
> >
> > This move is not to change the location, it's to stop Solr from trying
> > to load the indexes completely.  Solr will no longer try to load them.
> > I think these are invalid indexes from when the Solr instance was NOT
> > running in cloud mode, and have nothing at all to do with your working
> > collections.  But just in case I'm wrong, I'm having you move them
> > instead of delete them, so they can be put back if it turns out they
> > actually were needed.
> >
> > ZooKeeper doesn't know anything at all about Solr and the data it puts
> > in ZK.  It is just a service that stores cluster data where all nodes
> > can read it and handles elections when it is asked to.
> >
> > Thanks,
> > Shawn
> >
>


Re: SolrCoreInitializationException after restart of one solr node

2018-09-20 Thread Schaum Mallik
Hi Shawn

Sorry for being repetitive but want to make sure I understand you clearly.

So my real collection data sits just outside the configsets folder. So if I
move the location mentioned under instancedir to some backup location and
then restart solr cloud you think this non working node will get the real
collection data sitting outside the configsets dir.

I really appreciate the time and effort you are taking in answering my
queries.

Thank you
On Thursday, September 20, 2018, Shawn Heisey  wrote:

> On 9/20/2018 8:44 AM, Schaum Mallik wrote:
>
>> Thank you for your detailed responses. I am still kind of confused though.
>> Just to give you some more insight. When I first created the cloud I
>> created to collections and the ‘-d’ option pointed to the directory where
>> the config for the collection was stored which was in
>> /opt/solr/server/solr/configsets/articles/conf
>>
>> so as you are saying if I move the replica data to another location and
>> restart the node and most probably solr might complain then what happens
>> next. This solr node was working fine a week back and the indexes were
>> upgraded when they were moved from solr 6 to solr 7
>>
>
> I really think that those cores that won't initialize are remnants from a
> non-cloud install and probably have *never* worked.  For some reason, you
> didn't notice these errors until now.
>
> If you take a non-cloud Solr instance that has indexes and add ZK_HOST to
> make it a cloud instance, the cores that existed before cloud mode will NOT
> work.
>
> I just completed a test where I did that exact sequence of converting a
> node from non-cloud with existing indexes to cloud, and ran into the exact
> same errors you're seeing.  I can relay exact details of the test if
> required.
>
> I think that moving the non-cloud cores out so Solr doesn't try to load
> them will likely fix this issue.
>
> Thanks,
> Shawn
>
>


Re: SolrCoreInitializationException after restart of one solr node

2018-09-20 Thread Schaum Mallik
Thank you for your detailed responses. I am still kind of confused though.
Just to give you some more insight. When I first created the cloud I
created to collections and the ‘-d’ option pointed to the directory where
the config for the collection was stored which was in
/opt/solr/server/solr/configsets/articles/conf

so as you are saying if I move the replica data to another location and
restart the node and most probably solr might complain then what happens
next. This solr node was working fine a week back and the indexes were
upgraded when they were moved from solr 6 to solr 7

Let me know your thoughts

Thank you

On Thursday, September 20, 2018, Shawn Heisey  wrote:

> On 9/20/2018 8:22 AM, Schaum Mallik wrote:
>
>> Thanks for the response Shawn.
>>
>> My follow up question is how would the zookeeper ensemble know that the
>> location of the indexes has changed? Also do I need to apply the same
>> changes to the other 2 solr nodes which are working fine?
>>
>
> This move is not to change the location, it's to stop Solr from trying to
> load the indexes completely.  Solr will no longer try to load them.  I
> think these are invalid indexes from when the Solr instance was NOT running
> in cloud mode, and have nothing at all to do with your working
> collections.  But just in case I'm wrong, I'm having you move them instead
> of delete them, so they can be put back if it turns out they actually were
> needed.
>
> ZooKeeper doesn't know anything at all about Solr and the data it puts in
> ZK.  It is just a service that stores cluster data where all nodes can read
> it and handles elections when it is asked to.
>
> Thanks,
> Shawn
>
>


Re: SolrCoreInitializationException after restart of one solr node

2018-09-20 Thread Schaum Mallik
Thanks for the response Shawn.

My follow up question is how would the zookeeper ensemble know that the
location of the indexes has changed? Also do I need to apply the same
changes to the other 2 solr nodes which are working fine?
Thanks

On Thursday, September 20, 2018, Shawn Heisey  wrote:

> On 9/20/2018 6:02 AM, Schaum Mallik wrote:
>
>> Yeah my indexes, read and write works fine on the other two solr nodes.
>> Since I have this setup running in prod currently what are the steps you
>> will advice I take to resolve this issue. Starting from scratch is really
>> not an option since it will involve a huge cost. Is there some workaround
>> that you guys can suggest.
>> when you say move the instance dir outside the /op/solr are you
>> suggesting,
>> modifying this value 'SOLR_HOME'?
>>
>
> No, I mean literally moving the directory somewhere else. Something like
> this:
>
> mkdir /tmp/index-keep
> mv /opt/solr/server/solr/configsets/articles /tmp/index-keep/.
> mv /opt/solr/server/solr/configsets/categories /tmp/index-keep/.
> # move any other non-cloud cores out
> # restart Solr here
>
> Thanks,
> Shawn
>
>


Re: SolrCoreInitializationException after restart of one solr node

2018-09-20 Thread Schaum Mallik
Yeah my indexes, read and write works fine on the other two solr nodes.
Since I have this setup running in prod currently what are the steps you
will advice I take to resolve this issue. Starting from scratch is really
not an option since it will involve a huge cost. Is there some workaround
that you guys can suggest.
when you say move the instance dir outside the /op/solr are you suggesting,
modifying this value 'SOLR_HOME'?

Thank you


On Thu, Sep 20, 2018 at 3:00 AM Shawn Heisey  wrote:

> On 9/19/2018 9:20 PM, Schaum Mallik wrote:
> > The data and index get stored under
> > /opt/solr/server/solr/articles_shard1_replica_n1.
> > The config directory when the collection was created, that time the path
> to
> > the config was given as '/opt/solr/server/solr/configsets/articles'. I
> > didn't use the service installer script. The other two solr nodes are
> > working without any issue. Any ideas how I can resolve this.
>
> Do your indexes still work after startup, even with those initialization
> errors?  I think there's a good chance that they do.
>
> This looks to me like what you did was start with a non-cloud install
> that was using the configsets feature, so instanceDir actually WAS in
> configsets, and then converted it to a cloud setup, at which point those
> old cores wouldn't have worked right, so you probably created the
> collections again.  I'm betting that if you moved those instancedirs
> that are under configsets somewhere else (not under /opt/solr at all),
> that everything MIGHT be fine after another restart, based on the fact
> that in the core list I see xxx_shardN_replica_nM cores for articles and
> categories, which would probably be the REAL indexes being used for
> those names.
>
> Converting from cloud to non-cloud is a process that requires
> understanding exactly how Solr deals with cores at startup, and isn't
> something I would recommend anyone ever try do.  Set your cloud up from
> scratch in cloud mode.  If you're using the configsets feature in
> non-cloud mode, that will confuse things even more.
>
> Also, I strongly recommend using the service installer script.  It
> creates a much cleaner install that I find much easier to understand.
> The data is entirely separate from the program install.
>
> Thanks,
> Shawn
>
>


Re: SolrCoreInitializationException after restart of one solr node

2018-09-19 Thread Schaum Mallik
I also want to add one other things. I had moved from a single core solr
instance on solr 6.6 to the solr cloud few months back. I had ran the
indexupgrader tool on the indexes before I moved them to the solr cloud.

On Wed, Sep 19, 2018 at 7:29 PM Shawn Heisey  wrote:

> On 9/19/2018 8:22 PM, Schaum Mallik wrote:
> > I have a 3 zookeeper ensemble and 3 solr nodes running version 7.4.0.
> > Recently I had to restart one node and after I did that it started
> throwing
> > this exception.
> 
> > Caused by: org.apache.solr.common.SolrException: No coreNodeName
> > for
> >
> CoreDescriptor[name=articles;instanceDir=/opt/solr/server/solr/configsets/articles]
>
> It is VERY weird for instanceDir to be under the configsets directory.
> Especially for SolrCloud.  I know you're in cloud mode because the
> ZK-related classes are heavily mentioned in the stacktrace.
>
> Can you share the entire solr.log file after a restart?  If that
> directory really is the instanceDir, there should be a core.properties
> file.  What are its contents?
>
> Did you use the service installer script?  If you did, having an
> instanceDir underneath configsets is even MORE strange.
>
> Thanks,
> Shawn
>
>


Re: SolrCoreInitializationException after restart of one solr node

2018-09-19 Thread Schaum Mallik
ategories
x:categories] o.a.s.c.ZkController

org.apache.solr.common.SolrException: No coreNodeName for
CoreDescriptor[name=categories;instanceDir=/opt/solr/server/solr/configsets/categories]

at
org.apache.solr.cloud.ZkController.checkStateInZk(ZkController.java:1716)
~[solr-core-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:13]

at org.apache.solr.cloud.ZkController.preRegister(ZkController.java:1654)
~[solr-core-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:13]

at
org.apache.solr.core.CoreContainer.createFromDescriptor(CoreContainer.java:1128)
~[solr-core-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:13]

at
org.apache.solr.core.CoreContainer.lambda$load$13(CoreContainer.java:681)
~[solr-core-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:13]

at
com.codahale.metrics.InstrumentedExecutorService$InstrumentedCallable.call(InstrumentedExecutorService.java:197)
~[metrics-core-3.2.6.jar:3.2.6]

at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[?:1.8.0_171]

at
org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:209)
~[solr-solrj-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:14]

at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
[?:1.8.0_171]

at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
[?:1.8.0_171]

at java.lang.Thread.run(Thread.java:748) [?:1.8.0_171]

2018-09-20 03:01:36.002 ERROR
(coreContainerWorkExecutor-2-thread-1-processing-n:solr1:8983_solr) [   ]
o.a.s.c.CoreContainer Error waiting for SolrCore to be loaded on startup

org.apache.solr.common.SolrException: Unable to create core [categories]

at
org.apache.solr.core.CoreContainer.createFromDescriptor(CoreContainer.java:1156)
~[solr-core-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:13]

at
org.apache.solr.core.CoreContainer.lambda$load$13(CoreContainer.java:681)
~[solr-core-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:13]

at
com.codahale.metrics.InstrumentedExecutorService$InstrumentedCallable.call(InstrumentedExecutorService.java:197)
~[metrics-core-3.2.6.jar:3.2.6]

at java.util.concurrent.FutureTask.run(FutureTask.java:266) ~[?:1.8.0_171]

at
org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:209)
~[solr-solrj-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:14]

at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
[?:1.8.0_171]

at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
[?:1.8.0_171]

at java.lang.Thread.run(Thread.java:748) [?:1.8.0_171]

Caused by: org.apache.solr.common.SolrException:

at org.apache.solr.cloud.ZkController.preRegister(ZkController.java:1685)
~[solr-core-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:13]

at
org.apache.solr.core.CoreContainer.createFromDescriptor(CoreContainer.java:1128)
~[solr-core-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:13]

... 7 more

Caused by: org.apache.solr.common.SolrException: No coreNodeName for
CoreDescriptor[name=categories;instanceDir=/opt/solr/server/solr/configsets/categories]

at
org.apache.solr.cloud.ZkController.checkStateInZk(ZkController.java:1716)
~[solr-core-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:13]

at org.apache.solr.cloud.ZkController.preRegister(ZkController.java:1654)
~[solr-core-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:13]

at
org.apache.solr.core.CoreContainer.createFromDescriptor(CoreContainer.java:1128)
~[solr-core-7.4.0.jar:7.4.0 9060ac689c270b02143f375de0348b7f626adebc -
jpountz - 2018-06-18 16:55:13]

... 7 more


On Wed, Sep 19, 2018 at 7:29 PM Shawn Heisey  wrote:

> On 9/19/2018 8:22 PM, Schaum Mallik wrote:
> > I have a 3 zookeeper ensemble and 3 solr nodes running version 7.4.0.
> > Recently I had to restart one node and after I did that it started
> throwing
> > this exception.
> 
> > Caused by: org.apache.solr.common.SolrException: No coreNodeName
> > for
> >
> CoreDescriptor[name=articles;instanceDir=/opt/solr/server/solr/configsets/articles]
>
> It is VERY weird for instanceDir to be under the configsets directory.
> Especially for SolrCloud.  I know you're in cloud mode because the
> ZK-related classes are heavily mentioned in the stacktrace.
>
> Can you share the entire solr.log file after a restart?  If that
> directory really is the instanceDir, there should be a core.properties
> file.  What are its contents?
>
> Did you use the service installer 

SolrCoreInitializationException after restart of one solr node

2018-09-19 Thread Schaum Mallik
Hi Guys

I have a 3 zookeeper ensemble and 3 solr nodes running version 7.4.0.
Recently I had to restart one node and after I did that it started throwing
this exception.

{

  "error":{

"metadata":[

  "error-class","org.apache.solr.core.SolrCoreInitializationException",

  "root-error-class","org.apache.solr.common.SolrException"],

"msg":"SolrCore 'articles' is not available due to init failure: ",

"trace":"org.apache.solr.core.SolrCoreInitializationException: SolrCore
'articles' is not available due to init failure: \n\tat
org.apache.solr.core.CoreContainer.getCore(CoreContainer.java:1590)\n\tat
org.apache.solr.servlet.HttpSolrCall.init(HttpSolrCall.java:249)\n\tat
org.apache.solr.servlet.HttpSolrCall.call(HttpSolrCall.java:469)\n\tat
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:377)\n\tat
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:323)\n\tat
org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1634)\n\tat
org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:533)\n\tat
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:146)\n\tat
org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:548)\n\tat
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:132)\n\tat
org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:257)\n\tat
org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:1595)\n\tat
org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:255)\n\tat
org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1253)\n\tat
org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:203)\n\tat
org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:473)\n\tat
org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:1564)\n\tat
org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:201)\n\tat
org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1155)\n\tat
org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:144)\n\tat
org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:219)\n\tat
org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerCollection.java:126)\n\tat
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:132)\n\tat
org.eclipse.jetty.rewrite.handler.RewriteHandler.handle(RewriteHandler.java:335)\n\tat
org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:132)\n\tat
org.eclipse.jetty.server.Server.handle(Server.java:531)\n\tat
org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:352)\n\tat
org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:260)\n\tat
org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:281)\n\tat
org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:102)\n\tat
org.eclipse.jetty.io.ChannelEndPoint$2.run(ChannelEndPoint.java:118)\n\tat
org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.runTask(EatWhatYouKill.java:333)\n\tat
org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.doProduce(EatWhatYouKill.java:310)\n\tat
org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.tryProduce(EatWhatYouKill.java:168)\n\tat
org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.run(EatWhatYouKill.java:126)\n\tat
org.eclipse.jetty.util.thread.ReservedThreadExecutor$ReservedThread.run(ReservedThreadExecutor.java:366)\n\tat
org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:760)\n\tat
org.eclipse.jetty.util.thread.QueuedThreadPool$2.run(QueuedThreadPool.java:678)\n\tat
java.lang.Thread.run(Thread.java:748)\nCaused by:
org.apache.solr.common.SolrException: \n\tat
org.apache.solr.cloud.ZkController.preRegister(ZkController.java:1685)\n\tat
org.apache.solr.core.CoreContainer.createFromDescriptor(CoreContainer.java:1128)\n\tat
org.apache.solr.core.CoreContainer.lambda$load$13(CoreContainer.java:681)\n\tat
com.codahale.metrics.InstrumentedExecutorService$InstrumentedCallable.call(InstrumentedExecutorService.java:197)\n\tat
java.util.concurrent.FutureTask.run(FutureTask.java:266)\n\tat
org.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:209)\n\tat
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)\n\tat
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)\n\t...
1 more\nCaused by: org.apache.solr.common.SolrException: No coreNodeName
for
CoreDescriptor[name=articles;instanceDir=/opt/solr/server/solr/configsets/articles]\n\tat
org.apache.solr.cloud.ZkController.checkStateInZk(ZkController.java:1716)\n\tat
org.apache.solr.cloud.ZkController.preRegister(ZkController.java:1654)\n\t...
8 more\n",

"code":500}}


I tried dropping