[jira] Commented: (HDFS-1286) Dry entropy pool on Hudson boxes causing test timeouts
[ https://issues.apache.org/jira/browse/HDFS-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12973424#action_12973424 ] Konstantin Boudnik commented on HDFS-1286: -- We haven't seen failures since I have updated the build machines for Apache Hudson. I am going to resolve it as "fixed" if there's no objections. > Dry entropy pool on Hudson boxes causing test timeouts > -- > > Key: HDFS-1286 > URL: https://issues.apache.org/jira/browse/HDFS-1286 > Project: Hadoop HDFS > Issue Type: Task > Components: test >Affects Versions: 0.22.0 >Reporter: Todd Lipcon > Fix For: 0.22.0 > > Attachments: TestFileAppend4.testCompleteOtherLeaseHoldersFile.log, > TestFileAppend4.testRecoverFinalizedBlock.log > > > Some test runs seem to fail with "already locked" errors, though it passes > locally. For example: > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/423/testReport/ > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/421/testReport/ -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HDFS-1286) Dry entropy pool on Hudson boxes causing test timeouts
[ https://issues.apache.org/jira/browse/HDFS-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12970027#action_12970027 ] Konstantin Boudnik commented on HDFS-1286: -- I have updated hadoop[2-8] with rng-tools (hadoop8 was like that for a few days now and I haven't seen any issues with this particular tests failing on that machine). It should take care of the problem. > Dry entropy pool on Hudson boxes causing test timeouts > -- > > Key: HDFS-1286 > URL: https://issues.apache.org/jira/browse/HDFS-1286 > Project: Hadoop HDFS > Issue Type: Task > Components: test >Affects Versions: 0.22.0 >Reporter: Todd Lipcon > Fix For: 0.22.0 > > Attachments: TestFileAppend4.testCompleteOtherLeaseHoldersFile.log, > TestFileAppend4.testRecoverFinalizedBlock.log > > > Some test runs seem to fail with "already locked" errors, though it passes > locally. For example: > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/423/testReport/ > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/421/testReport/ -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HDFS-1286) Dry entropy pool on Hudson boxes causing test timeouts
[ https://issues.apache.org/jira/browse/HDFS-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968542#action_12968542 ] Konstantin Boudnik commented on HDFS-1286: -- Apache's Jaunty mirror (minerva.apache.org is supported by y! Ops, I believe) is still down. I have manually installed rng-tools from Karmic repository. The entropy level is is 6-30 times higher than initially. Let's see if this fix the problem with the test. > Dry entropy pool on Hudson boxes causing test timeouts > -- > > Key: HDFS-1286 > URL: https://issues.apache.org/jira/browse/HDFS-1286 > Project: Hadoop HDFS > Issue Type: Task > Components: test >Affects Versions: 0.22.0 >Reporter: Todd Lipcon > Fix For: 0.22.0 > > Attachments: TestFileAppend4.testCompleteOtherLeaseHoldersFile.log, > TestFileAppend4.testRecoverFinalizedBlock.log > > > Some test runs seem to fail with "already locked" errors, though it passes > locally. For example: > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/423/testReport/ > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/421/testReport/ -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HDFS-1286) Dry entropy pool on Hudson boxes causing test timeouts
[ https://issues.apache.org/jira/browse/HDFS-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968501#action_12968501 ] Konstantin Boudnik commented on HDFS-1286: -- Answering Konstantin's point above bq. On the other hand we should also avoid using random data in tests. the randomization is coming from jetty, not from our tests, I think. > Dry entropy pool on Hudson boxes causing test timeouts > -- > > Key: HDFS-1286 > URL: https://issues.apache.org/jira/browse/HDFS-1286 > Project: Hadoop HDFS > Issue Type: Task > Components: test >Affects Versions: 0.22.0 >Reporter: Todd Lipcon > Fix For: 0.22.0 > > Attachments: TestFileAppend4.testCompleteOtherLeaseHoldersFile.log, > TestFileAppend4.testRecoverFinalizedBlock.log > > > Some test runs seem to fail with "already locked" errors, though it passes > locally. For example: > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/423/testReport/ > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/421/testReport/ -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HDFS-1286) Dry entropy pool on Hudson boxes causing test timeouts
[ https://issues.apache.org/jira/browse/HDFS-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968500#action_12968500 ] Konstantin Boudnik commented on HDFS-1286: -- Yes, you are exactly right. > Dry entropy pool on Hudson boxes causing test timeouts > -- > > Key: HDFS-1286 > URL: https://issues.apache.org/jira/browse/HDFS-1286 > Project: Hadoop HDFS > Issue Type: Task > Components: test >Affects Versions: 0.22.0 >Reporter: Todd Lipcon > Fix For: 0.22.0 > > Attachments: TestFileAppend4.testCompleteOtherLeaseHoldersFile.log, > TestFileAppend4.testRecoverFinalizedBlock.log > > > Some test runs seem to fail with "already locked" errors, though it passes > locally. For example: > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/423/testReport/ > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/421/testReport/ -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HDFS-1286) Dry entropy pool on Hudson boxes causing test timeouts
[ https://issues.apache.org/jira/browse/HDFS-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968453#action_12968453 ] Todd Lipcon commented on HDFS-1286: --- It's tough to catch this interactively, because when you're sshed in, you end up sending lots of network packets, etc, which feed the entropy pool. > Dry entropy pool on Hudson boxes causing test timeouts > -- > > Key: HDFS-1286 > URL: https://issues.apache.org/jira/browse/HDFS-1286 > Project: Hadoop HDFS > Issue Type: Task > Components: test >Affects Versions: 0.22.0 >Reporter: Todd Lipcon > Fix For: 0.22.0 > > Attachments: TestFileAppend4.testCompleteOtherLeaseHoldersFile.log, > TestFileAppend4.testRecoverFinalizedBlock.log > > > Some test runs seem to fail with "already locked" errors, though it passes > locally. For example: > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/423/testReport/ > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/421/testReport/ -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HDFS-1286) Dry entropy pool on Hudson boxes causing test timeouts
[ https://issues.apache.org/jira/browse/HDFS-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968443#action_12968443 ] Konstantin Boudnik commented on HDFS-1286: -- I have took a look at h8.grid.sp2.yahoo.net (hadoop8) build machine and it doesn't have rng-tools package on it. However, running TestFileAppend4 manually produces successful result every time. I am trying to get rng-tools on that machine to see if it will produce expected result in Hudson build, but apache's Ubuntu mirror is down at the moment. > Dry entropy pool on Hudson boxes causing test timeouts > -- > > Key: HDFS-1286 > URL: https://issues.apache.org/jira/browse/HDFS-1286 > Project: Hadoop HDFS > Issue Type: Task > Components: test >Affects Versions: 0.22.0 >Reporter: Todd Lipcon > Fix For: 0.22.0 > > Attachments: TestFileAppend4.testCompleteOtherLeaseHoldersFile.log, > TestFileAppend4.testRecoverFinalizedBlock.log > > > Some test runs seem to fail with "already locked" errors, though it passes > locally. For example: > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/423/testReport/ > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/421/testReport/ -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HDFS-1286) Dry entropy pool on Hudson boxes causing test timeouts
[ https://issues.apache.org/jira/browse/HDFS-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12890014#action_12890014 ] Konstantin Shvachko commented on HDFS-1286: --- Interesting catch. We should ask somebody to look at Hudson machines. On the other hand we should also avoid using random data in tests. Tests should be reproducible by definition. Using random bytes does not achieve this goal. So replacing random with sequential sounds like the right direction to me. > Dry entropy pool on Hudson boxes causing test timeouts > -- > > Key: HDFS-1286 > URL: https://issues.apache.org/jira/browse/HDFS-1286 > Project: Hadoop HDFS > Issue Type: Task > Components: test >Affects Versions: 0.22.0 >Reporter: Todd Lipcon > Fix For: 0.22.0 > > Attachments: TestFileAppend4.testCompleteOtherLeaseHoldersFile.log, > TestFileAppend4.testRecoverFinalizedBlock.log > > > Some test runs seem to fail with "already locked" errors, though it passes > locally. For example: > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/423/testReport/ > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/421/testReport/ -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Commented: (HDFS-1286) Dry entropy pool on Hudson boxes causing test timeouts
[ https://issues.apache.org/jira/browse/HDFS-1286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12886508#action_12886508 ] Todd Lipcon commented on HDFS-1286: --- Alternatively, we could probably change the tests to use sequential or other pseudo-random bytes, but the entropy thing caused lots of spurious test timeouts for us in the past, so fixing Hudson's probably worth it. > Dry entropy pool on Hudson boxes causing test timeouts > -- > > Key: HDFS-1286 > URL: https://issues.apache.org/jira/browse/HDFS-1286 > Project: Hadoop HDFS > Issue Type: Task > Components: test >Affects Versions: 0.22.0 >Reporter: Todd Lipcon > Fix For: 0.22.0 > > Attachments: TestFileAppend4.testCompleteOtherLeaseHoldersFile.log, > TestFileAppend4.testRecoverFinalizedBlock.log > > > Some test runs seem to fail with "already locked" errors, though it passes > locally. For example: > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/423/testReport/ > http://hudson.zones.apache.org/hudson/job/Hdfs-Patch-h5.grid.sp2.yahoo.net/421/testReport/ -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.