[ 
https://issues.apache.org/jira/browse/SOLR-3180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yonik Seeley updated SOLR-3180:
-------------------------------

    Attachment: fail.130103_105104.txt

Here's a log with some notes on the first timeout.  Chaos monkey decides to 
cause connection loss to ZK at 48 sec.  At 60 sec, two requests start that 
don't seem to finish until 83 sec into the test.  They seem to be blocked in 
zkCheck().

{code}
  2> 48972 T260 oasc.ChaosMonkey.monkeyLog monkey: chose a victim! 42854
  2> 48973 T260 oasc.ChaosMonkey.monkeyLog monkey: expire session for 42854 !
  2> 48975 T260 oasc.ChaosMonkey.monkeyLog monkey: cause connection loss!

 
  2> 52127 T122 C3 P42854 /update 
{distrib.from=http://127.0.0.1:51342/r_/f/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2}
 {delete=[60058 (-1423156803769729024)]} 0 1
  2> 60518 T120 C3 P42854 oasup.LogUpdateProcessor.processAdd PRE_UPDATE ADD 
add{flags=0,_version_=0,id=10063} 
{distrib.from=http://127.0.0.1:51342/r_/f/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2}
  2> 60786 T124 C3 P42854 oasup.LogUpdateProcessor.processDelete PRE_UPDATE 
DELETE delete{flags=0,_version_=-1423156812849348608,id=60059,commitWithin=-1} 
{distrib.from=http://127.0.0.1:51342/r_/f/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2}

  2> 75667 T74 C6 P51342 oasc.Diagnostics.logThreadDumps SEVERE REQUESTING 
THREAD DUMP DUE TO TIMEOUT: Timeout occured while waiting response from server 
at: http://127.0.0.1:42854/r_/f/collection1
[...]
  2>    "qtp1333272771-124" Id=124 TIMED_WAITING
  2>            at java.lang.Thread.sleep(Native Method)
  2>            at 
org.apache.solr.update.processor.DistributedUpdateProcessor.zkCheck(DistributedUpdateProcessor.java:925)
  2>            at 
org.apache.solr.update.processor.DistributedUpdateProcessor.processDelete(DistributedUpdateProcessor.java:699)
  2>            at 
org.apache.solr.update.processor.LogUpdateProcessor.processDelete(LogUpdateProcessor.java:97)
  2>            at 
org.apache.solr.handler.loader.XMLLoader.processDelete(XMLLoader.java:346)
  2>            at 
org.apache.solr.handler.loader.XMLLoader.processUpdate(XMLLoader.java:277)
  2>            at 
org.apache.solr.handler.loader.XMLLoader.load(XMLLoader.java:173)
  2>            at 
org.apache.solr.handler.UpdateRequestHandler$1.load(UpdateRequestHandler.java:92)
[...]
  2>    "qtp1333272771-120" Id=120 TIMED_WAITING
  2>            at java.lang.Thread.sleep(Native Method)
  2>            at 
org.apache.solr.update.processor.DistributedUpdateProcessor.zkCheck(DistributedUpdateProcessor.java:925)
  2>            at 
org.apache.solr.update.processor.DistributedUpdateProcessor.processAdd(DistributedUpdateProcessor.java:330)
  2>            at 
org.apache.solr.update.processor.LogUpdateProcessor.processAdd(LogUpdateProcessor.java:76)
  2>            at 
org.apache.solr.handler.loader.XMLLoader.processUpdate(XMLLoader.java:246)
  2>            at 
org.apache.solr.handler.loader.XMLLoader.load(XMLLoader.java:173)
  2>            at 
org.apache.solr.handler.UpdateRequestHandler$1.load(UpdateRequestHandler.java:92)
  2>            at 
org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:74)

  2> 83174 T124 C3 P42854 PRE_UPDATE FINISH  
{distrib.from=http://127.0.0.1:51342/r_/f/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2}
  2> 83174 T120 C3 P42854 PRE_UPDATE FINISH  
{distrib.from=http://127.0.0.1:51342/r_/f/collection1/&update.distrib=FROMLEADER&wt=javabin&version=2}

{code}
                
> ChaosMonkey test failures
> -------------------------
>
>                 Key: SOLR-3180
>                 URL: https://issues.apache.org/jira/browse/SOLR-3180
>             Project: Solr
>          Issue Type: Bug
>          Components: SolrCloud
>            Reporter: Yonik Seeley
>         Attachments: CMSL_fail1.log, CMSL_hang_2.txt, CMSL_hang.txt, 
> fail.130101_034142.txt, fail.130102_020942.txt, fail.130103_105104.txt, 
> fail.inconsistent.txt, test_report_1.txt
>
>
> Handle intermittent failures in the ChaosMonkey tests.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to