[ https://issues.apache.org/jira/browse/SOLR-3180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yonik Seeley updated SOLR-3180: ------------------------------- Attachment: fail.130101_034142.txt With the logging fixed, I can see more of what the problem is. It appears if two updates for document 50030 are happening simultaneously, which shouldn't happen since the tests don't do this. {code} 2> 65234 T28 C12 P56307 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948614358958080 2> 65235 T28 C12 P56307 /update {wt=javabin&version=2} {add=[50030 (1422948614358958080)]} 0 2 2> 65242 T61 C4 P44328 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948614366298112 2> 65245 T103 C1 P59742 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948614366298112 2> 65246 T160 C7 P51177 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948614366298112 2> 65247 T103 C1 P59742 /update {distrib.from=http://127.0.0.1:44328/fa/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2} {add=[50030 (1422948614366298112)]} 0 2 2> 65247 T160 C7 P51177 /update {distrib.from=http://127.0.0.1:44328/fa/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2} {add=[50030 (1422948614366298112)]} 0 2 2> 80763 T59 C4 P44328 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948630641246208 2> 80773 T158 C7 P51177 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948630641246208 2> 80776 T158 C7 P51177 /update {distrib.from=http://127.0.0.1:44328/fa/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2} {add=[50030 (1422948630641246208)]} 0 5 2> 85279 T61 C4 P44328 /update {distrib.from=http://127.0.0.1:36869/fa/collection1/&update.distrib=TOLEADER&wt=javabin&version=2} {add=[50030 (1422948614366298112)]} 0 20038 2> 87884 T213 C10 P42076 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948630641246208 2> 87885 T213 C10 P42076 /update {distrib.from=http://127.0.0.1:44328/fa/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2} {add=[50030 (1422948630641246208)]} 0 7116 2> 87973 T216 C10 P42076 /update {distrib.from=http://127.0.0.1:44328/fa/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2} {add=[50030 (1422948614366298112)]} 0 22728 2> 95272 T61 C4 P44328 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948645855035392 2> 95277 T158 C7 P51177 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948645855035392 2> 95281 T158 C7 P51177 /update {distrib.from=http://127.0.0.1:44328/fa/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2} {add=[50030 (1422948645855035392)]} 0 5 2> 95283 T61 C4 P44328 /update {wt=javabin&version=2} {add=[50030 (1422948645855035392)]} 0 12 2> 95326 T27 C12 P56307 /update {wt=javabin&version=2} {delete=[50030 (-1422948645912707072)]} 0 0 2> 95336 T159 C7 P51177 /update {distrib.from=http://127.0.0.1:44328/fa/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2} {delete=[50030 (-1422948645918998528)]} 0 1 2> 95338 T61 C4 P44328 /update {wt=javabin&version=2} {delete=[50030 (-1422948645918998528)]} 0 7 2> 96276 T475 C4 P44328 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948646907805696 2> 96286 T156 C7 P51177 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948646907805696 2> 96290 T156 C7 P51177 /update {distrib.from=http://127.0.0.1:44328/fa/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2} {add=[50030 (1422948646907805696)]} 0 5 2> 96292 T475 C4 P44328 /update {distrib.from=http://127.0.0.1:36869/fa/collection1/&update.distrib=TOLEADER&wt=javabin&version=2} {add=[50030 (1422948646907805696)]} 0 18 2> 96293 T77 C6 P36869 /update {wt=javabin&version=2} {add=[50030]} 0 31054 2> 97738 T59 C4 P44328 /update {distrib.from=http://127.0.0.1:36869/fa/collection1/&update.distrib=TOLEADER&wt=javabin&version=2} {add=[50030 (1422948630641246208)]} 0 16977 2> 98396 T103 C1 P59742 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948630641246208 2> 98397 T103 C1 P59742 /update {distrib.from=http://127.0.0.1:44328/fa/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2} {add=[50030 (1422948630641246208)]} 0 17627 2> 105349 T488 C1 P59742 oasu.PeerSync.handleUpdates FINE PeerSync: core=collection1 url=http://127.0.0.1:59742/fa raw update record [1, 1422948645855035392, SolrInputDocument[id=50030, a_si=50, other_tl1=50, a_t=to come to the aid of their country., rnd_b=true, _version_=1422948645855035392]] 2> 105350 T488 C1 P59742 oasu.PeerSync.handleUpdates FINE PeerSync: core=collection1 url=http://127.0.0.1:59742/fa add add{flags=12,_version_=1422948645855035392,id=50030} 2> 105351 T488 C1 P59742 oasu.DirectUpdateHandler2.checkDocument LOCAL_ADD: id=50030 version=1422948645855035392 2> 105356 T488 C1 P59742 oasu.PeerSync.handleUpdates FINE PeerSync: core=collection1 url=http://127.0.0.1:59742/fa raw update record [2, -1422948645918998528, [B@3b76982e] 2> 105356 T488 C1 P59742 oasu.PeerSync.handleUpdates FINE PeerSync: core=collection1 url=http://127.0.0.1:59742/fa delete delete{flags=12,_version_=-1422948645918998528,indexedId=50030,commitWithin=-1} 2> 105356 T488 C1 P59742 oasu.PeerSync.handleUpdates FINE PeerSync: core=collection1 url=http://127.0.0.1:59742/fa raw update record [1, 1422948646907805696, SolrInputDocument[id=50030, a_si=50, other_tl1=50, a_t=to come to the aid of their country., rnd_b=true, _version_=1422948646907805696]] 2> 105356 T488 C1 P59742 oasu.PeerSync.handleUpdates FINE PeerSync: core=collection1 url=http://127.0.0.1:59742/fa add add{flags=12,_version_=1422948646907805696,id=50030} 2> ###### Only in cloudDocList: [{id=50030}] 2> 203443 T10 oasc.AbstractFullDistribZkTestBase.checkShardConsistency SEVERE controlClient :{numFound=0,start=0,docs=[]} 2> cloudClient :{numFound=1,start=0,docs=[SolrDocument{id=50030, _version_=1422948646907805696}]} {code} As usual C12 is the control. You can see an add complete on C4 and on C6 {code} 2> 95283 T61 C4 P44328 /update {wt=javabin&version=2} {add=[50030 (1422948645855035392)]} 0 12 2> 96293 T77 C6 P36869 /update {wt=javabin&version=2} {add=[50030]} 0 31054 {code} > ChaosMonkey test failures > ------------------------- > > Key: SOLR-3180 > URL: https://issues.apache.org/jira/browse/SOLR-3180 > Project: Solr > Issue Type: Bug > Components: SolrCloud > Reporter: Yonik Seeley > Attachments: CMSL_fail1.log, CMSL_hang_2.txt, CMSL_hang.txt, > fail.130101_034142.txt, fail.inconsistent.txt, test_report_1.txt > > > Handle intermittent failures in the ChaosMonkey tests. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org