[ 
https://issues.apache.org/jira/browse/KUDU-3687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18013757#comment-18013757
 ] 

ASF subversion and git services commented on KUDU-3687:
-------------------------------------------------------

Commit 33999074d088e974dbfdb3501a6600fddce9a6da in kudu's branch 
refs/heads/master from Alexey Serbin
[ https://gitbox.apache.org/repos/asf?p=kudu.git;h=33999074d ]

KUDU-3687 higher timeout for WAL segment count in heavy workloads

As it turns out, even with the increased default timeout parameter for
WaitForMinFilesInTabletWalDirOnTS() in [1], a couple of new tests added
after that [2] are failing from time to time under TSAN due to heavy
workloads.  This changelist addresses the issue, increasing the timeout
argument for WaitForMinFilesInTabletWalDirOnTS() in a couple of
particular call sites from 60 up to 120 seconds.

[1] http://gerrit.cloudera.org:8080/1793
[2] http://gerrit.cloudera.org:8080/16726

Change-Id: I68f30bc8b1c59843b1485bd0924ddd51d144eff1
Reviewed-on: http://gerrit.cloudera.org:8080/23286
Reviewed-by: Gabriella Lotz <[email protected]>
Tested-by: Alexey Serbin <[email protected]>
Reviewed-by: Abhishek Chennaka <[email protected]>


> TabletCopyITest.TestDownloadWalInParallelWithHeavyUpdate fails from time to 
> time
> --------------------------------------------------------------------------------
>
>                 Key: KUDU-3687
>                 URL: https://issues.apache.org/jira/browse/KUDU-3687
>             Project: Kudu
>          Issue Type: Bug
>          Components: tablet copy, test
>            Reporter: Alexey Serbin
>            Priority: Major
>         Attachments: tablet_copy-itest.0.txt.xz
>
>
> The {{TestDownloadWalInParallelWithHeavyUpdate}} scenario of the 
> {{TabletCopyITest}} sometimes fails, at least in TSAN builds:
> {noformat}
> src/kudu/integration-tests/tablet_copy-itest.cc:2151: Failure
> Failed                                                                        
>   
> Bad status: Timed out: Timed out waiting for number of WAL segments on tablet 
> 451669bc2c6445f98b2e853a46f14a4e on TS 0 to be 6. Found 5                     
>    
> I20250811 20:08:03.739187 24196 external_mini_cluster-itest-base.cc:80] Found 
> fatal failure
> {noformat}
> Full log is attached (clipped useless stack traces).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to