[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest
[ https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hongchao Deng updated ZOOKEEPER-1870: - Affects Version/s: (was: 3.5.0) 3.4.6 flakey test in StandaloneDisabledTest.startSingleServerTest --- Key: ZOOKEEPER-1870 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870 Project: ZooKeeper Issue Type: Bug Components: tests Affects Versions: 3.4.6 Reporter: Patrick Hunt Assignee: Helen Hastings Priority: Blocker Fix For: 3.5.0 Attachments: ZOOKEEPER-1870.patch, ZOOKEEPER-1870.patch, ZOOKEEPER-1870.patch, test.log I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} I've found 3 problems: 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the shutdown flag gets set. 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate. 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the leader election. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest
[ https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Flavio Junqueira updated ZOOKEEPER-1870: Attachment: ZOOKEEPER-1870.patch I've made a minor change to the patch... We are missing the 3.4 patch, that's why I haven't resolved this issue. flakey test in StandaloneDisabledTest.startSingleServerTest --- Key: ZOOKEEPER-1870 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870 Project: ZooKeeper Issue Type: Bug Components: tests Affects Versions: 3.5.0 Reporter: Patrick Hunt Assignee: Helen Hastings Priority: Blocker Fix For: 3.5.0 Attachments: ZOOKEEPER-1870.patch, ZOOKEEPER-1870.patch, ZOOKEEPER-1870.patch, test.log I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} I've found 3 problems: 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the shutdown flag gets set. 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate. 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the leader election. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest
[ https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michi Mutsuzaki updated ZOOKEEPER-1870: --- Attachment: ZOOKEEPER-1870.patch 3 additional changes: - Reset proposedLeader to -1 in FastLeaderElection.shutdown(). - Get out of the WorkerReceiver.run() loop after calling self.getElectionAlg().shutdown(). - Make FastLeaderElection.getVote() public for unit test. Let me know if making this method public is ok with you guys. flakey test in StandaloneDisabledTest.startSingleServerTest --- Key: ZOOKEEPER-1870 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870 Project: ZooKeeper Issue Type: Bug Components: tests Affects Versions: 3.5.0 Reporter: Patrick Hunt Assignee: Helen Hastings Priority: Critical Attachments: ZOOKEEPER-1870.patch, ZOOKEEPER-1870.patch, test.log I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} I've found 3 problems: 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the shutdown flag gets set. 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate. 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the leader election. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest
[ https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michi Mutsuzaki updated ZOOKEEPER-1870: --- Priority: Blocker (was: Critical) flakey test in StandaloneDisabledTest.startSingleServerTest --- Key: ZOOKEEPER-1870 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870 Project: ZooKeeper Issue Type: Bug Components: tests Affects Versions: 3.5.0 Reporter: Patrick Hunt Assignee: Helen Hastings Priority: Blocker Attachments: ZOOKEEPER-1870.patch, ZOOKEEPER-1870.patch, test.log I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} I've found 3 problems: 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the shutdown flag gets set. 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate. 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the leader election. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest
[ https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raul Gutierrez Segales updated ZOOKEEPER-1870: -- Attachment: test.log Hi Michi, Platform is Fedora Linux, with 3.13 Kernel on x86_64: {noformat} $ uname -r 3.13.6-200.fc20.x86_64 $ arch x86_64 $ cat /etc/fedora-release Fedora release 20 (Heisenbug) {noformat} This is mostly out of trunk plus some other patches that I had (but mostly unrelated). I'll run again out of pure trunk. flakey test in StandaloneDisabledTest.startSingleServerTest --- Key: ZOOKEEPER-1870 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870 Project: ZooKeeper Issue Type: Bug Components: tests Affects Versions: 3.5.0 Reporter: Patrick Hunt Assignee: Helen Hastings Priority: Critical Attachments: ZOOKEEPER-1870.patch, test.log I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} I've found 3 problems: 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the shutdown flag gets set. 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate. 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the leader election. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest
[ https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michi Mutsuzaki updated ZOOKEEPER-1870: --- Description: I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} I've found 3 problems that 1. Prevent socket leak in QuorumCnxManager.Listener.run(). 2. Make QuorumCnxManager.halt() join the listener. 3. Reset QuorumPeer.shuttingDownLE flag when restarting the leader election. was: I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} flakey test in StandaloneDisabledTest.startSingleServerTest --- Key: ZOOKEEPER-1870 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870 Project: ZooKeeper Issue Type: Bug Components: tests Affects Versions: 3.5.0 Reporter: Patrick Hunt Assignee: Helen Hastings Priority: Critical Attachments: ZOOKEEPER-1870.patch I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} I've found 3 problems that 1. Prevent socket leak in QuorumCnxManager.Listener.run(). 2. Make QuorumCnxManager.halt() join the listener. 3. Reset QuorumPeer.shuttingDownLE flag when restarting the leader election. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest
[ https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michi Mutsuzaki updated ZOOKEEPER-1870: --- Description: I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} I've found 3 problems: 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the shutdown flag gets set. 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate. 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the leader election. was: I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} I've found 3 problems that 1. Prevent socket leak in QuorumCnxManager.Listener.run(). 2. Make QuorumCnxManager.halt() join the listener. 3. Reset QuorumPeer.shuttingDownLE flag when restarting the leader election. flakey test in StandaloneDisabledTest.startSingleServerTest --- Key: ZOOKEEPER-1870 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870 Project: ZooKeeper Issue Type: Bug Components: tests Affects Versions: 3.5.0 Reporter: Patrick Hunt Assignee: Helen Hastings Priority: Critical Attachments: ZOOKEEPER-1870.patch I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} I've found 3 problems: 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the shutdown flag gets set. 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate. 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the leader election. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest
[ https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michi Mutsuzaki updated ZOOKEEPER-1870: --- Attachment: ZOOKEEPER-1870.patch flakey test in StandaloneDisabledTest.startSingleServerTest --- Key: ZOOKEEPER-1870 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870 Project: ZooKeeper Issue Type: Bug Components: tests Affects Versions: 3.5.0 Reporter: Patrick Hunt Assignee: Helen Hastings Priority: Critical Attachments: ZOOKEEPER-1870.patch I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest
[ https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michi Mutsuzaki updated ZOOKEEPER-1870: --- Assignee: Helen Hastings (was: Michi Mutsuzaki) flakey test in StandaloneDisabledTest.startSingleServerTest --- Key: ZOOKEEPER-1870 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870 Project: ZooKeeper Issue Type: Bug Components: tests Affects Versions: 3.5.0 Reporter: Patrick Hunt Assignee: Helen Hastings Priority: Critical I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest
[ https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michi Mutsuzaki updated ZOOKEEPER-1870: --- Assignee: Helen Hastings (was: Alexander Shraer) flakey test in StandaloneDisabledTest.startSingleServerTest --- Key: ZOOKEEPER-1870 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870 Project: ZooKeeper Issue Type: Bug Components: tests Affects Versions: 3.5.0 Reporter: Patrick Hunt Assignee: Helen Hastings Priority: Critical I'm seeing lots of the following failure. Seems like a flakey test (passes every so often). {noformat} junit.framework.AssertionFailedError: client could not connect to reestablished quorum: giving up after 30+ seconds. at org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143) at org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75) at org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52) {noformat} -- This message was sent by Atlassian JIRA (v6.1.5#6160)