Alexey Serbin has posted comments on this change.

Change subject: [tests] de-flaking catalog_manager_tsk-itest
......................................................................


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/8017/1//COMMIT_MSG
Commit Message:

> Did you try looping this test with the recent failure-detection change (htt
OK, here the result running without and with 21b0f3d5 changelist:

Without the changelist (HEAD is at c8e04077), --stress-cpu-threads=16, 0/1024 
failed:
  http://dist-test.cloudera.org//job?job_id=aserbin.150515
9852.20744

With the changelist (HEAD is at 21b0f3d5), --stress-cpu-threads=16, at least 
7/1024 failed:
  http://dist-test.cloudera.org//job?job_id=aserbin.1505160964.1750


Could you clarify on what do you want to address in this regard?

As I understand, the test was built to induce many re-elections among masters, 
and the parameters were set so the process was converging more or less in the 
specified timeout intervals.  With the new way of sending heartbeats and doing 
master failure detection, it seems the masters sometimes were not fast enough 
to handle Raft HBs as fast as they used to be.  But it's all about 'boundary' 
conditions, as I understand.


-- 
To view, visit http://gerrit.cloudera.org:8080/8017
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-MessageType: comment
Gerrit-Change-Id: I50cee27a579cffa7232137c7039b02a1ad4ab7eb
Gerrit-PatchSet: 1
Gerrit-Project: kudu
Gerrit-Branch: master
Gerrit-Owner: Alexey Serbin <aser...@cloudera.com>
Gerrit-Reviewer: Adar Dembo <a...@cloudera.com>
Gerrit-Reviewer: Alexey Serbin <aser...@cloudera.com>
Gerrit-Reviewer: Andrew Wong <aw...@cloudera.com>
Gerrit-Reviewer: Kudu Jenkins
Gerrit-HasComments: Yes

Reply via email to