[ https://issues.apache.org/jira/browse/SOLR-13813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17040279#comment-17040279 ]
Yonik Seeley commented on SOLR-13813: ------------------------------------- Attaching test in progress for live shard splitting with a failure on the leader node during the split. It currently fails for both NRT replicas and shared storage replicas. I'm still investigating and haven't opened a new issue yet because it could be a test issue. When it fails for NRT, it can often be a single missing document. {code} 24745 ERROR (TEST-SharedStorageSplitTest.testLiveSplitFail-seed#[2D0A18F897BF536C]) [ ] o.a.s.s.b.SharedStorageSplitTest MISSING DOCUMENTS: {doc_5137=1} 24748 INFO (TEST-SharedStorageSplitTest.testLiveSplitFail-seed#[2D0A18F897BF536C]) [ ] o.a.s.SolrTestCaseJ4 ###Ending testLiveSplitFail NOTE: reproduce with: ant test -Dtestcase=SharedStorageSplitTest -Dtests.method=testLiveSplitFail -Dtests.seed=2D0A18F897BF536C -Dtests.slow=true -Dtests.badapples=true -Dtests.locale=en-DM -Dtests.timezone=America/Barbados -Dtests.asserts=true -Dtests.file.encoding=UTF-8 java.lang.AssertionError: Documents are missing! Expected :5106 Actual :5107 {code} > Shared storage online split support > ----------------------------------- > > Key: SOLR-13813 > URL: https://issues.apache.org/jira/browse/SOLR-13813 > Project: Solr > Issue Type: Sub-task > Reporter: Yonik Seeley > Priority: Major > Attachments: SOLR-13813.patch > > Time Spent: 1h > Remaining Estimate: 0h > > The strategy for online shard splitting is the same as that for normal (non > SHARED shards.) > During a split, the leader will forward updates to sub-shard leaders, those > updates will be buffered by the transaction log while the split is in > progress, and then the buffered updates are replayed. > One change that was added was to push the local index to blob store after > buffered updates are applied (but before it is marked as ACTIVE): > See > https://github.com/apache/lucene-solr/commit/fe17c813f5fe6773c0527f639b9e5c598b98c7d4#diff-081b7c2242d674bb175b41b6afc21663 > This issue is about adding tests and ensuring that online shard splitting > (while updates are flowing) works reliably. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org