[ https://issues.apache.org/jira/browse/IMPALA-9340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sahil Takiar updated IMPALA-9340: --------------------------------- Summary: statestore_max_missed_heartbeats is off by one (was: Statestore statestore_max_missed_heartbeats is off by one) > statestore_max_missed_heartbeats is off by one > ---------------------------------------------- > > Key: IMPALA-9340 > URL: https://issues.apache.org/jira/browse/IMPALA-9340 > Project: IMPALA > Issue Type: Bug > Components: Backend > Reporter: Sahil Takiar > Priority: Minor > Labels: newbie, ramp-up > > The flag {{statestore_max_missed_heartbeats}} says: > {quote}Maximum number of consecutiveĀ heartbeat messages an impalad can miss > before being declared failed by theĀ statestore. > {quote} > However, the implementation actually waits for > {{statestore_max_missed_heartbeats}} + 1 missed heartbeats before considering > the impalad as failed. > Example when {{statestore_max_missed_heartbeats}} is set to 10 (the default > value): > {code:java} > logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 > 10:58:04.214053 29932 failure-detector.cc:90] 1 consecutive heartbeats > failed for > 'impa...@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. > State is OK > logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 > 10:58:04.267143 29937 failure-detector.cc:90] 2 consecutive heartbeats > failed for > 'impa...@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. > State is OK > logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 > 10:58:04.320443 29938 failure-detector.cc:90] 3 consecutive heartbeats > failed for > 'impa...@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. > State is OK > logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 > 10:58:04.373548 29934 failure-detector.cc:90] 4 consecutive heartbeats > failed for > 'impa...@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. > State is OK > logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 > 10:58:04.426955 29929 failure-detector.cc:90] 5 consecutive heartbeats > failed for > 'impa...@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. > State is OK > logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 > 10:58:04.479981 29933 failure-detector.cc:90] 6 consecutive heartbeats > failed for > 'impa...@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. > State is SUSPECTED > logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 > 10:58:04.533097 29930 failure-detector.cc:90] 7 consecutive heartbeats > failed for > 'impa...@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. > State is SUSPECTED > logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 > 10:58:04.586172 29934 failure-detector.cc:90] 8 consecutive heartbeats > failed for > 'impa...@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. > State is SUSPECTED > logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 > 10:58:04.639999 29936 failure-detector.cc:90] 9 consecutive heartbeats > failed for > 'impa...@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. > State is SUSPECTED > logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 > 10:58:04.692075 29929 failure-detector.cc:90] 10 consecutive heartbeats > failed for > 'impa...@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. > State is SUSPECTED > logs/custom_cluster_tests/statestored.impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com.jenkins.log.INFO.20200128-105531.29877:I0128 > 10:58:04.745105 29931 failure-detector.cc:90] 11 consecutive heartbeats > failed for > 'impa...@impala-ec2-centos74-m5-4xlarge-ondemand-09f9.vpc.cloudera.com:22002'. > State is FAILED {code} -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org For additional commands, e-mail: issues-all-h...@impala.apache.org