Failure when trying to full sync, out of space ? Doesn't delete old segments before full sync?

Michael Joyner Mon, 28 Nov 2016 08:39:53 -0800

Hello all,

I'm running out of spacing when trying to restart nodes to get a clusterback up fully operational where a node ran out of space during an optimize.

It appears to be trying to do a full sync from another node, but doesn'ttake care to check available space before starting downloads and doesn'tdelete the out of date segment files before attempting to do the full sync.

If the segments are out of date and we are pulling from another nodebefore coming "online" why aren't the old segments deleted? Is thissomething that can be enabled in the master solrconfig.xml file?

It seems to know what size the segments are before they are transferred,is there a reason a basic disk space check isn't done for the targetpartition with an immediate abort done if the destination's space lookslike it would go negative before attempting sync? Is this something thatcan be enabled in the master solrconfig.xml file? This would be a lotmore useful (IMHO) than waiting for a full sync to complete only to runout of space after several hundred gigs of data is transferred withautomatic cluster recovery failing as a result.


This happens when doing a 'sudo service solr restart'

(Workaround, shutdown offending node, manually delete segment indexfolders and tlog files, start node)


Exception:

WARN - 2016-11-28 16:15:16.291;org.apache.solr.handler.IndexFetcher$FileFetcher; Error in fetchingfile: _2f6i.cfs (downloaded 2317352960 of 5257809205 bytes)

java.io.IOException: No space left on device
    at sun.nio.ch.FileDispatcherImpl.write0(Native Method)
    at sun.nio.ch.FileDispatcherImpl.write(FileDispatcherImpl.java:60)
    at sun.nio.ch.IOUtil.writeFromNativeBuffer(IOUtil.java:93)
    at sun.nio.ch.IOUtil.write(IOUtil.java:65)
    at sun.nio.ch.FileChannelImpl.write(FileChannelImpl.java:211)
    at java.nio.channels.Channels.writeFullyImpl(Channels.java:78)
    at java.nio.channels.Channels.writeFully(Channels.java:101)
    at java.nio.channels.Channels.access$000(Channels.java:61)
    at java.nio.channels.Channels$1.write(Channels.java:174)

atorg.apache.lucene.store.FSDirectory$FSIndexOutput$1.write(FSDirectory.java:419)

    at java.util.zip.CheckedOutputStream.write(CheckedOutputStream.java:73)
    at java.io.BufferedOutputStream.write(BufferedOutputStream.java:122)

atorg.apache.lucene.store.OutputStreamIndexOutput.writeBytes(OutputStreamIndexOutput.java:53)atorg.apache.solr.handler.IndexFetcher$DirectoryFile.write(IndexFetcher.java:1634)atorg.apache.solr.handler.IndexFetcher$FileFetcher.fetchPackets(IndexFetcher.java:1491)atorg.apache.solr.handler.IndexFetcher$FileFetcher.fetchFile(IndexFetcher.java:1429)atorg.apache.solr.handler.IndexFetcher.downloadIndexFiles(IndexFetcher.java:855)atorg.apache.solr.handler.IndexFetcher.fetchLatestIndex(IndexFetcher.java:434)atorg.apache.solr.handler.IndexFetcher.fetchLatestIndex(IndexFetcher.java:251)atorg.apache.solr.handler.ReplicationHandler.doFetch(ReplicationHandler.java:397)atorg.apache.solr.cloud.RecoveryStrategy.replicate(RecoveryStrategy.java:156)atorg.apache.solr.cloud.RecoveryStrategy.doRecovery(RecoveryStrategy.java:408)atorg.apache.solr.cloud.RecoveryStrategy.run(RecoveryStrategy.java:221)atjava.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)

    at java.util.concurrent.FutureTask.run(FutureTask.java:266)

atorg.apache.solr.common.util.ExecutorUtil$MDCAwareThreadPoolExecutor.lambda$execute$0(ExecutorUtil.java:229)atjava.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)atjava.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)

    at java.lang.Thread.run(Thread.java:745)

-Mike

Failure when trying to full sync, out of space ? Doesn't delete old segments before full sync?

Reply via email to