[ 
https://issues.apache.org/jira/browse/SOLR-13060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16722108#comment-16722108
 ] 

Dawid Weiss edited comment on SOLR-13060 at 12/15/18 11:01 AM:
---------------------------------------------------------------

Hi [~steve_rowe]. Just FYI: the upgrade of randomizedtesting does fix the suite 
timeout problem (I just tested it on by running SOLR-13074 with a suite timeout 
of 10 seconds...). I think one hour is very generous  for the sysout loop in 
SOLR-13074, so it'll be enough to fill the disk anyway. I'll work on truncating 
sysouts up to at most 1 gig, test it on that SOLR-13074, then maybe to fix the 
underlying cause of leaking threads. 

Until this is solved, I don't think it makes sense to run hdfs tests at all -- 
they will hang and fill up disk space on jenkins.


was (Author: dweiss):
Hi [~steve_rowe]. Just FYU: the upgrade of randomizedtesting does fix the suite 
timeout problem (I just tested it on by running SOLR-13074 with a suite timeout 
of 10 seconds...). I think one hour is very generous  for the sysout loop in 
SOLR-13074, so it'll be enough to fill the disk anyway. I'll work on truncating 
sysouts up to at most 1 gig, test it on that SOLR-13074, then maybe to fix the 
underlying cause of leaking threads. 

Until this is solved, I don't think it makes sense to run hdfs tests at all -- 
they will hang and fill up disk space on jenkins.

> Some Nightly HDFS tests never terminate on ASF Jenkins, triggering whole-job 
> timeout, causing Jenkins to kill JVMs, causing dump files to be created that 
> fill all disk space, causing failure of all following jobs on the same node
> -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-13060
>                 URL: https://issues.apache.org/jira/browse/SOLR-13060
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: Tests
>            Reporter: Steve Rowe
>            Priority: Major
>         Attachments: 
> junit4-J0-20181210_065854_4175881849742830327151.spill.part1.gz
>
>
> The 3 tests that are affected: 
> * HdfsAutoAddReplicasIntegrationTest
> * HdfsCollectionsAPIDistributedZkTest
> * MoveReplicaHDFSTest 
> Instances from the dev list:
> 12/1: 
> https://lists.apache.org/thread.html/e04ad0f9113e15f77393ccc26e3505e3090783b1d61bd1c7ff03d33e@%3Cdev.lucene.apache.org%3E
> 12/5: 
> https://lists.apache.org/thread.html/d78c99255abfb5134803c2b77664c1a039d741f92d6e6fcbcc66cd14@%3Cdev.lucene.apache.org%3E
> 12/8: 
> https://lists.apache.org/thread.html/92ad03795ae60b1e94859d49c07740ca303f997ae2532e6f079acfb4@%3Cdev.lucene.apache.org%3E
> 12/8: 
> https://lists.apache.org/thread.html/26aace512bce0b51c4157e67ac3120f93a99905b40040bee26472097@%3Cdev.lucene.apache.org%3E
> 12/11: 
> https://lists.apache.org/thread.html/33558a8dd292fd966a7f476bf345b66905d99f7eb9779a4d17b7ec97@%3Cdev.lucene.apache.org%3E



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to