Re: Recovery Issue - Solr 6.6.1 and HDFS

Joe Obernberger Tue, 21 Nov 2017 12:08:25 -0800

We've never run an index this size in anything but HDFS, so I have nocomparison. What we've been doing is keeping two main collections - alldata, and the last 30 days of data. Then we handle queries based ondate range. The 30 day index is significantly faster.

My main concern right now is that 6 of the 100 shards are not comingback because of no leader. I've never seen this error before. Anyideas? ClusterStatus shows all three replicas with state 'down'.


Thanks!

-joe


On 11/21/2017 2:35 PM, Hendrik Haddorp wrote:

We actually also have some performance issue with HDFS at the moment.We are doing lots of soft commits for NRT search. Those seem to beslower then with local storage. The investigation is however notreally far yet.
We have a setup with 2000 collections, with one shard each and areplication factor of 2 or 3. When we restart nodes too fast thatcauses problems with the overseer queue, which can lead to the queuegetting out of control and Solr pretty much dying. We are still onSolr 6.3. 6.6 has some improvements and should handle these actionsfaster. I would check what you see for"/solr/admin/collections?action=OVERSEERSTATUS&wt=json". The criticalpart is the "overseer_queue_size" value. If this goes up to about10000 it is pretty much game over on our setup. In that case it seemsto be best to stop all nodes, clear the queue in ZK and then restartthe nodes one by one with a gap of like 5min. That normally recoverspretty well.
regards,
Hendrik

On 21.11.2017 20:12, Joe Obernberger wrote:
We set the hard commit time long because we were having performanceissues with HDFS, and thought that since the block size is 128M,having a longer hard commit made sense. That was our hypothesisanyway. Happy to switch it back and see what happens.
I don't know what caused the cluster to go into recovery in the firstplace. We had a server die over the weekend, but it's just one outof ~50. Every shard is 3x replicated (and 3x replicated in HDFS...so9 copies). It was at this point that we noticed lots of networkactivity, and most of the shards in this recovery, fail, retry loop. That is when we decided to shut it down resulting in zombie lock files.
I tried using the FORCELEADER call, which completed, but doesn't seemto have any effect on the shards that have no leader. Kinda out ofideas for that problem. If I can get the cluster back up, I'll try alower hard commit time. Thanks again Erick!
-Joe


On 11/21/2017 2:00 PM, Erick Erickson wrote:
Frankly with HDFS I'm a bit out of my depth so listen to Hendrik ;)...

I need to back up a bit. Once nodes are in this state it's not
surprising that they need to be forcefully killed. I was more thinking
about how they got in this situation in the first place. _Before_ you
get into the nasty state how are the Solr nodes shut down? Forcefully?

Your hard commit is far longer than it needs to be, resulting in much
larger tlog files etc. I usually set this at 15-60 seconds with local
disks, not quite sure whether longer intervals are helpful on HDFS.
What this means is that you can spend up to 30 minutes when you
restart solr _replaying the tlogs_! If Solr is killed, it may not have
had a chance to fsync the segments and may have to replay on startup.
If you have openSearcher set to false, the hard commit operation is
not horribly expensive, it just fsync's the current segments and opens
new ones. It won't be a total cure, but I bet reducing this interval
would help a lot.

Also, if you stop indexing there's no need to wait 30 minutes if you
issue a manual commit, something like
.../collection/update?commit=true. Just reducing the hard commit
interval will make the wait between stopping indexing and restarting
shorter all by itself if you don't want to issue the manual commit.

Best,
Erick

On Tue, Nov 21, 2017 at 10:34 AM, Hendrik Haddorp
<hendrik.hadd...@gmx.net> wrote:
Hi,
the write.lock issue I see as well when Solr is not been stoppedgracefully.The write.lock files are then left in the HDFS as they do not getremoved
automatically when the client disconnects like a ephemeral node in
ZooKeeper. Unfortunately Solr does also not realize that it shouldbe owningthe lock as it is marked in the state stored in ZooKeeper as theowner andis also not willing to retry, which is why you need to restart thewholeSolr instance after the cleanup. I added some logic to my Solrstart upscript which scans the log files in HDFS and compares that with thestate inZooKeeper and then delete all lock files that belong to the nodethat I'm
starting.

regards,
Hendrik


On 21.11.2017 14:07, Joe Obernberger wrote:
Hi All - we have a system with 45 physical boxes running solr6.6.1 using
HDFS as the index.  The current index size is about 31TBytes. With 3x
replication that takes up 93TBytes of disk. Our main collection issplitacross 100 shards with 3 replicas each. The issue that we'rerunning intois when restarting the solr6 cluster. The shards go into recoveryand startto utilize nearly all of their network interfaces. If we starttoo many ofthe nodes at once, the shards will go into a recovery, fail, andretry loopand never come up. The errors are related to HDFS not respondingfastenough and warnings from the DFSClient. If we stop a node whenthis is
happening, the script will force a stop (180 second timeout) and upon
restart, we have lock files (write.lock) inside of HDFS.
The process at this point is to start one node, find out the lockfiles,wait for it to come up completely (hours), stop it, delete thewrite.lockfiles, and restart. Usually this second restart is faster, but itstill can
take 20-60 minutes.
The smaller indexes recover much faster (less than 5 minutes).Should wehave not used so many replicas with HDFS? Is there a better waywe should
have built the solr6 cluster?

Thank you for any insight!

-Joe
---
This email has been checked for viruses by AVG.
http://www.avg.com

Re: Recovery Issue - Solr 6.6.1 and HDFS

Reply via email to