[jira] [Updated] (HDFS-9911) TestDataNodeLifeline Fails intermittently
[ https://issues.apache.org/jira/browse/HDFS-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anu Engineer updated HDFS-9911: --- Resolution: Fixed Hadoop Flags: Reviewed Fix Version/s: (was: 2.8.0) 3.0.0-alpha2 Target Version/s: 3.0.0-alpha2 (was: 2.8.0) Status: Resolved (was: Patch Available) [~cnauroth] Thanks for the code review comments. [~linyiqun] Thank you for the contribution. I have committed this to trunk. > TestDataNodeLifeline Fails intermittently > -- > > Key: HDFS-9911 > URL: https://issues.apache.org/jira/browse/HDFS-9911 > Project: Hadoop HDFS > Issue Type: Bug > Components: datanode >Affects Versions: 2.8.0 >Reporter: Anu Engineer >Assignee: Yiqun Lin > Fix For: 3.0.0-alpha2 > > Attachments: HDFS-9911.001.patch, HDFS-9911.002.patch > > > In HDFS-1312 branch, we have a failure for this test. > {{org.apache.hadoop.hdfs.server.datanode.TestDataNodeLifeline.testNoLifelineSentIfHeartbeatsOnTime}} > {noformat} > Error Message > Expect metrics to count no lifeline calls. expected:<0> but was:<1> > Stacktrace > java.lang.AssertionError: Expect metrics to count no lifeline calls. > expected:<0> but was:<1> > at org.junit.Assert.fail(Assert.java:88) > at org.junit.Assert.failNotEquals(Assert.java:743) > at org.junit.Assert.assertEquals(Assert.java:118) > at org.junit.Assert.assertEquals(Assert.java:555) > at > org.apache.hadoop.hdfs.server.datanode.TestDataNodeLifeline.testNoLifelineSentIfHeartbeatsOnTime(TestDataNodeLifeline.java:256) > {noformat} > Details can be found here. > https://builds.apache.org/job/PreCommit-HDFS-Build/14726/testReport/org.apache.hadoop.hdfs.server.datanode/TestDataNodeLifeline/testNoLifelineSentIfHeartbeatsOnTime/ -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-9911) TestDataNodeLifeline Fails intermittently
[ https://issues.apache.org/jira/browse/HDFS-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yiqun Lin updated HDFS-9911: Attachment: HDFS-9911.002.patch Thanks [~anu], [~tasanuma0829] and [~cnauroth] for the review! {quote} This would make it consistent with other points in the code where the lifeline time is scheduled relative to the heartbeat time. {quote} Agree with this. New patch attached. > TestDataNodeLifeline Fails intermittently > -- > > Key: HDFS-9911 > URL: https://issues.apache.org/jira/browse/HDFS-9911 > Project: Hadoop HDFS > Issue Type: Bug > Components: datanode >Affects Versions: 2.8.0 >Reporter: Anu Engineer >Assignee: Yiqun Lin > Fix For: 2.8.0 > > Attachments: HDFS-9911.001.patch, HDFS-9911.002.patch > > > In HDFS-1312 branch, we have a failure for this test. > {{org.apache.hadoop.hdfs.server.datanode.TestDataNodeLifeline.testNoLifelineSentIfHeartbeatsOnTime}} > {noformat} > Error Message > Expect metrics to count no lifeline calls. expected:<0> but was:<1> > Stacktrace > java.lang.AssertionError: Expect metrics to count no lifeline calls. > expected:<0> but was:<1> > at org.junit.Assert.fail(Assert.java:88) > at org.junit.Assert.failNotEquals(Assert.java:743) > at org.junit.Assert.assertEquals(Assert.java:118) > at org.junit.Assert.assertEquals(Assert.java:555) > at > org.apache.hadoop.hdfs.server.datanode.TestDataNodeLifeline.testNoLifelineSentIfHeartbeatsOnTime(TestDataNodeLifeline.java:256) > {noformat} > Details can be found here. > https://builds.apache.org/job/PreCommit-HDFS-Build/14726/testReport/org.apache.hadoop.hdfs.server.datanode/TestDataNodeLifeline/testNoLifelineSentIfHeartbeatsOnTime/ -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-9911) TestDataNodeLifeline Fails intermittently
[ https://issues.apache.org/jira/browse/HDFS-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris Nauroth updated HDFS-9911: Assignee: Yiqun Lin (was: Chris Nauroth) [~linyiqun], thank you for the analysis and volunteering to take over the patch. I am reassigning this to you. I suggest changing this to: {code} scheduleNextLifeline(nextHeartbeatTime); {code} This would make it consistent with other points in the code where the lifeline time is scheduled relative to the heartbeat time. It could help avoid confusion if 2 separate calls to {{monotonicNow()}} return 2 different timestamps (one for initialization of {{nextHeartbeatTime}} and the other for {{scheduleNextLifeline}}). > TestDataNodeLifeline Fails intermittently > -- > > Key: HDFS-9911 > URL: https://issues.apache.org/jira/browse/HDFS-9911 > Project: Hadoop HDFS > Issue Type: Bug > Components: datanode >Affects Versions: 2.8.0 >Reporter: Anu Engineer >Assignee: Yiqun Lin > Fix For: 2.8.0 > > Attachments: HDFS-9911.001.patch > > > In HDFS-1312 branch, we have a failure for this test. > {{org.apache.hadoop.hdfs.server.datanode.TestDataNodeLifeline.testNoLifelineSentIfHeartbeatsOnTime}} > {noformat} > Error Message > Expect metrics to count no lifeline calls. expected:<0> but was:<1> > Stacktrace > java.lang.AssertionError: Expect metrics to count no lifeline calls. > expected:<0> but was:<1> > at org.junit.Assert.fail(Assert.java:88) > at org.junit.Assert.failNotEquals(Assert.java:743) > at org.junit.Assert.assertEquals(Assert.java:118) > at org.junit.Assert.assertEquals(Assert.java:555) > at > org.apache.hadoop.hdfs.server.datanode.TestDataNodeLifeline.testNoLifelineSentIfHeartbeatsOnTime(TestDataNodeLifeline.java:256) > {noformat} > Details can be found here. > https://builds.apache.org/job/PreCommit-HDFS-Build/14726/testReport/org.apache.hadoop.hdfs.server.datanode/TestDataNodeLifeline/testNoLifelineSentIfHeartbeatsOnTime/ -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-9911) TestDataNodeLifeline Fails intermittently
[ https://issues.apache.org/jira/browse/HDFS-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yiqun Lin updated HDFS-9911: Attachment: HDFS-9911.001.patch > TestDataNodeLifeline Fails intermittently > -- > > Key: HDFS-9911 > URL: https://issues.apache.org/jira/browse/HDFS-9911 > Project: Hadoop HDFS > Issue Type: Bug > Components: datanode >Affects Versions: 2.8.0 >Reporter: Anu Engineer >Assignee: Chris Nauroth > Fix For: 2.8.0 > > Attachments: HDFS-9911.001.patch > > > In HDFS-1312 branch, we have a failure for this test. > {{org.apache.hadoop.hdfs.server.datanode.TestDataNodeLifeline.testNoLifelineSentIfHeartbeatsOnTime}} > {noformat} > Error Message > Expect metrics to count no lifeline calls. expected:<0> but was:<1> > Stacktrace > java.lang.AssertionError: Expect metrics to count no lifeline calls. > expected:<0> but was:<1> > at org.junit.Assert.fail(Assert.java:88) > at org.junit.Assert.failNotEquals(Assert.java:743) > at org.junit.Assert.assertEquals(Assert.java:118) > at org.junit.Assert.assertEquals(Assert.java:555) > at > org.apache.hadoop.hdfs.server.datanode.TestDataNodeLifeline.testNoLifelineSentIfHeartbeatsOnTime(TestDataNodeLifeline.java:256) > {noformat} > Details can be found here. > https://builds.apache.org/job/PreCommit-HDFS-Build/14726/testReport/org.apache.hadoop.hdfs.server.datanode/TestDataNodeLifeline/testNoLifelineSentIfHeartbeatsOnTime/ -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-9911) TestDataNodeLifeline Fails intermittently
[ https://issues.apache.org/jira/browse/HDFS-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yiqun Lin updated HDFS-9911: Status: Patch Available (was: Open) Post a simple patch here, I hope Chris will not mind that since I see this JIRA has been opened for a long time. Here I have two ways to fix this: * Delay the {{nextLifelineTime}} time and to let heartbeat sended first. * Use {{DataNode#setHeartbeatsDisabledForTests(true)}} to disabled the heratbeat in {{setup}} since this also makes sense to disabled lifeline send. Here I am prefer to use the first way and I think this is enough to fix this. And that's will make the order of sending messages correct. Softly ping [~anu], could you please have a review? Thanks. > TestDataNodeLifeline Fails intermittently > -- > > Key: HDFS-9911 > URL: https://issues.apache.org/jira/browse/HDFS-9911 > Project: Hadoop HDFS > Issue Type: Bug > Components: datanode >Affects Versions: 2.8.0 >Reporter: Anu Engineer >Assignee: Chris Nauroth > Fix For: 2.8.0 > > > In HDFS-1312 branch, we have a failure for this test. > {{org.apache.hadoop.hdfs.server.datanode.TestDataNodeLifeline.testNoLifelineSentIfHeartbeatsOnTime}} > {noformat} > Error Message > Expect metrics to count no lifeline calls. expected:<0> but was:<1> > Stacktrace > java.lang.AssertionError: Expect metrics to count no lifeline calls. > expected:<0> but was:<1> > at org.junit.Assert.fail(Assert.java:88) > at org.junit.Assert.failNotEquals(Assert.java:743) > at org.junit.Assert.assertEquals(Assert.java:118) > at org.junit.Assert.assertEquals(Assert.java:555) > at > org.apache.hadoop.hdfs.server.datanode.TestDataNodeLifeline.testNoLifelineSentIfHeartbeatsOnTime(TestDataNodeLifeline.java:256) > {noformat} > Details can be found here. > https://builds.apache.org/job/PreCommit-HDFS-Build/14726/testReport/org.apache.hadoop.hdfs.server.datanode/TestDataNodeLifeline/testNoLifelineSentIfHeartbeatsOnTime/ -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org
[jira] [Updated] (HDFS-9911) TestDataNodeLifeline Fails intermittently
[ https://issues.apache.org/jira/browse/HDFS-9911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anu Engineer updated HDFS-9911: --- Summary: TestDataNodeLifeline Fails intermittently (was: TestDataNodeLifeline failure in a branch) > TestDataNodeLifeline Fails intermittently > -- > > Key: HDFS-9911 > URL: https://issues.apache.org/jira/browse/HDFS-9911 > Project: Hadoop HDFS > Issue Type: Bug > Components: datanode >Affects Versions: 2.8.0 >Reporter: Anu Engineer >Assignee: Chris Nauroth > Fix For: 2.8.0 > > > In HDFS-1312 branch, we have a failure for this test. > {{org.apache.hadoop.hdfs.server.datanode.TestDataNodeLifeline.testNoLifelineSentIfHeartbeatsOnTime}} > {noformat} > Error Message > Expect metrics to count no lifeline calls. expected:<0> but was:<1> > Stacktrace > java.lang.AssertionError: Expect metrics to count no lifeline calls. > expected:<0> but was:<1> > at org.junit.Assert.fail(Assert.java:88) > at org.junit.Assert.failNotEquals(Assert.java:743) > at org.junit.Assert.assertEquals(Assert.java:118) > at org.junit.Assert.assertEquals(Assert.java:555) > at > org.apache.hadoop.hdfs.server.datanode.TestDataNodeLifeline.testNoLifelineSentIfHeartbeatsOnTime(TestDataNodeLifeline.java:256) > {noformat} > Details can be found here. > https://builds.apache.org/job/PreCommit-HDFS-Build/14726/testReport/org.apache.hadoop.hdfs.server.datanode/TestDataNodeLifeline/testNoLifelineSentIfHeartbeatsOnTime/ -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org