[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest

2014-08-28 Thread Hongchao Deng (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hongchao Deng updated ZOOKEEPER-1870:
-

Affects Version/s: (was: 3.5.0)
   3.4.6

 flakey test in StandaloneDisabledTest.startSingleServerTest
 ---

 Key: ZOOKEEPER-1870
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870
 Project: ZooKeeper
  Issue Type: Bug
  Components: tests
Affects Versions: 3.4.6
Reporter: Patrick Hunt
Assignee: Helen Hastings
Priority: Blocker
 Fix For: 3.5.0

 Attachments: ZOOKEEPER-1870.patch, ZOOKEEPER-1870.patch, 
 ZOOKEEPER-1870.patch, test.log


 I'm seeing lots of the following failure. Seems like a flakey test (passes 
 every so often).
 {noformat}
 junit.framework.AssertionFailedError: client could not connect to 
 reestablished quorum: giving up after 30+ seconds.
   at 
 org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
   at 
 org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
   at 
 org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
 {noformat}
 I've found 3 problems:
 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the 
 shutdown flag gets set.
 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate.
 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the 
 leader election.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest

2014-06-25 Thread Flavio Junqueira (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Flavio Junqueira updated ZOOKEEPER-1870:


Attachment: ZOOKEEPER-1870.patch

I've made a minor change to the patch... We are missing the 3.4 patch, that's 
why I haven't resolved this issue.

 flakey test in StandaloneDisabledTest.startSingleServerTest
 ---

 Key: ZOOKEEPER-1870
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870
 Project: ZooKeeper
  Issue Type: Bug
  Components: tests
Affects Versions: 3.5.0
Reporter: Patrick Hunt
Assignee: Helen Hastings
Priority: Blocker
 Fix For: 3.5.0

 Attachments: ZOOKEEPER-1870.patch, ZOOKEEPER-1870.patch, 
 ZOOKEEPER-1870.patch, test.log


 I'm seeing lots of the following failure. Seems like a flakey test (passes 
 every so often).
 {noformat}
 junit.framework.AssertionFailedError: client could not connect to 
 reestablished quorum: giving up after 30+ seconds.
   at 
 org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
   at 
 org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
   at 
 org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
 {noformat}
 I've found 3 problems:
 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the 
 shutdown flag gets set.
 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate.
 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the 
 leader election.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest

2014-03-20 Thread Michi Mutsuzaki (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michi Mutsuzaki updated ZOOKEEPER-1870:
---

Attachment: ZOOKEEPER-1870.patch

3 additional changes:

- Reset proposedLeader to -1 in FastLeaderElection.shutdown().
- Get out of the WorkerReceiver.run() loop after calling 
self.getElectionAlg().shutdown().
- Make FastLeaderElection.getVote() public for unit test. Let me know if making 
this method public is ok with you guys.

 flakey test in StandaloneDisabledTest.startSingleServerTest
 ---

 Key: ZOOKEEPER-1870
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870
 Project: ZooKeeper
  Issue Type: Bug
  Components: tests
Affects Versions: 3.5.0
Reporter: Patrick Hunt
Assignee: Helen Hastings
Priority: Critical
 Attachments: ZOOKEEPER-1870.patch, ZOOKEEPER-1870.patch, test.log


 I'm seeing lots of the following failure. Seems like a flakey test (passes 
 every so often).
 {noformat}
 junit.framework.AssertionFailedError: client could not connect to 
 reestablished quorum: giving up after 30+ seconds.
   at 
 org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
   at 
 org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
   at 
 org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
 {noformat}
 I've found 3 problems:
 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the 
 shutdown flag gets set.
 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate.
 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the 
 leader election.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest

2014-03-20 Thread Michi Mutsuzaki (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michi Mutsuzaki updated ZOOKEEPER-1870:
---

Priority: Blocker  (was: Critical)

 flakey test in StandaloneDisabledTest.startSingleServerTest
 ---

 Key: ZOOKEEPER-1870
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870
 Project: ZooKeeper
  Issue Type: Bug
  Components: tests
Affects Versions: 3.5.0
Reporter: Patrick Hunt
Assignee: Helen Hastings
Priority: Blocker
 Attachments: ZOOKEEPER-1870.patch, ZOOKEEPER-1870.patch, test.log


 I'm seeing lots of the following failure. Seems like a flakey test (passes 
 every so often).
 {noformat}
 junit.framework.AssertionFailedError: client could not connect to 
 reestablished quorum: giving up after 30+ seconds.
   at 
 org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
   at 
 org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
   at 
 org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
 {noformat}
 I've found 3 problems:
 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the 
 shutdown flag gets set.
 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate.
 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the 
 leader election.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest

2014-03-19 Thread Raul Gutierrez Segales (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raul Gutierrez Segales updated ZOOKEEPER-1870:
--

Attachment: test.log

Hi Michi,

Platform is Fedora Linux, with 3.13 Kernel on x86_64:

{noformat}
$ uname -r
3.13.6-200.fc20.x86_64
$ arch
x86_64
$ cat /etc/fedora-release 
Fedora release 20 (Heisenbug)
{noformat}

This is mostly out of trunk plus some other patches that I had (but mostly 
unrelated). I'll run again out of pure trunk. 

 flakey test in StandaloneDisabledTest.startSingleServerTest
 ---

 Key: ZOOKEEPER-1870
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870
 Project: ZooKeeper
  Issue Type: Bug
  Components: tests
Affects Versions: 3.5.0
Reporter: Patrick Hunt
Assignee: Helen Hastings
Priority: Critical
 Attachments: ZOOKEEPER-1870.patch, test.log


 I'm seeing lots of the following failure. Seems like a flakey test (passes 
 every so often).
 {noformat}
 junit.framework.AssertionFailedError: client could not connect to 
 reestablished quorum: giving up after 30+ seconds.
   at 
 org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
   at 
 org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
   at 
 org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
 {noformat}
 I've found 3 problems:
 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the 
 shutdown flag gets set.
 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate.
 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the 
 leader election.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest

2014-03-18 Thread Michi Mutsuzaki (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michi Mutsuzaki updated ZOOKEEPER-1870:
---

Description: 
I'm seeing lots of the following failure. Seems like a flakey test (passes 
every so often).

{noformat}
junit.framework.AssertionFailedError: client could not connect to reestablished 
quorum: giving up after 30+ seconds.
at 
org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
at 
org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
at 
org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
{noformat}

I've found 3 problems that 

1. Prevent socket leak in QuorumCnxManager.Listener.run().
2. Make QuorumCnxManager.halt() join the listener.
3. Reset QuorumPeer.shuttingDownLE flag when restarting the leader election.


  was:
I'm seeing lots of the following failure. Seems like a flakey test (passes 
every so often).

{noformat}
junit.framework.AssertionFailedError: client could not connect to reestablished 
quorum: giving up after 30+ seconds.
at 
org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
at 
org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
at 
org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
{noformat}



 flakey test in StandaloneDisabledTest.startSingleServerTest
 ---

 Key: ZOOKEEPER-1870
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870
 Project: ZooKeeper
  Issue Type: Bug
  Components: tests
Affects Versions: 3.5.0
Reporter: Patrick Hunt
Assignee: Helen Hastings
Priority: Critical
 Attachments: ZOOKEEPER-1870.patch


 I'm seeing lots of the following failure. Seems like a flakey test (passes 
 every so often).
 {noformat}
 junit.framework.AssertionFailedError: client could not connect to 
 reestablished quorum: giving up after 30+ seconds.
   at 
 org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
   at 
 org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
   at 
 org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
 {noformat}
 I've found 3 problems that 
 1. Prevent socket leak in QuorumCnxManager.Listener.run().
 2. Make QuorumCnxManager.halt() join the listener.
 3. Reset QuorumPeer.shuttingDownLE flag when restarting the leader election.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest

2014-03-18 Thread Michi Mutsuzaki (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michi Mutsuzaki updated ZOOKEEPER-1870:
---

Description: 
I'm seeing lots of the following failure. Seems like a flakey test (passes 
every so often).

{noformat}
junit.framework.AssertionFailedError: client could not connect to reestablished 
quorum: giving up after 30+ seconds.
at 
org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
at 
org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
at 
org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
{noformat}

I've found 3 problems:

1. QuorumCnxManager.Listener.run() leaks the socket depending on when the 
shutdown flag gets set.
2. QuorumCnxManager.halt() doesn't wait for the listener to terminate.
3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the leader 
election.


  was:
I'm seeing lots of the following failure. Seems like a flakey test (passes 
every so often).

{noformat}
junit.framework.AssertionFailedError: client could not connect to reestablished 
quorum: giving up after 30+ seconds.
at 
org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
at 
org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
at 
org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
{noformat}

I've found 3 problems that 

1. Prevent socket leak in QuorumCnxManager.Listener.run().
2. Make QuorumCnxManager.halt() join the listener.
3. Reset QuorumPeer.shuttingDownLE flag when restarting the leader election.



 flakey test in StandaloneDisabledTest.startSingleServerTest
 ---

 Key: ZOOKEEPER-1870
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870
 Project: ZooKeeper
  Issue Type: Bug
  Components: tests
Affects Versions: 3.5.0
Reporter: Patrick Hunt
Assignee: Helen Hastings
Priority: Critical
 Attachments: ZOOKEEPER-1870.patch


 I'm seeing lots of the following failure. Seems like a flakey test (passes 
 every so often).
 {noformat}
 junit.framework.AssertionFailedError: client could not connect to 
 reestablished quorum: giving up after 30+ seconds.
   at 
 org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
   at 
 org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
   at 
 org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
 {noformat}
 I've found 3 problems:
 1. QuorumCnxManager.Listener.run() leaks the socket depending on when the 
 shutdown flag gets set.
 2. QuorumCnxManager.halt() doesn't wait for the listener to terminate.
 3. QuorumPeer.shuttingDownLE flag doesn't get reset when restarting the 
 leader election.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest

2014-03-16 Thread Michi Mutsuzaki (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michi Mutsuzaki updated ZOOKEEPER-1870:
---

Attachment: ZOOKEEPER-1870.patch

 flakey test in StandaloneDisabledTest.startSingleServerTest
 ---

 Key: ZOOKEEPER-1870
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870
 Project: ZooKeeper
  Issue Type: Bug
  Components: tests
Affects Versions: 3.5.0
Reporter: Patrick Hunt
Assignee: Helen Hastings
Priority: Critical
 Attachments: ZOOKEEPER-1870.patch


 I'm seeing lots of the following failure. Seems like a flakey test (passes 
 every so often).
 {noformat}
 junit.framework.AssertionFailedError: client could not connect to 
 reestablished quorum: giving up after 30+ seconds.
   at 
 org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
   at 
 org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
   at 
 org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest

2014-03-12 Thread Michi Mutsuzaki (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michi Mutsuzaki updated ZOOKEEPER-1870:
---

Assignee: Helen Hastings  (was: Michi Mutsuzaki)

 flakey test in StandaloneDisabledTest.startSingleServerTest
 ---

 Key: ZOOKEEPER-1870
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870
 Project: ZooKeeper
  Issue Type: Bug
  Components: tests
Affects Versions: 3.5.0
Reporter: Patrick Hunt
Assignee: Helen Hastings
Priority: Critical

 I'm seeing lots of the following failure. Seems like a flakey test (passes 
 every so often).
 {noformat}
 junit.framework.AssertionFailedError: client could not connect to 
 reestablished quorum: giving up after 30+ seconds.
   at 
 org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
   at 
 org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
   at 
 org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (ZOOKEEPER-1870) flakey test in StandaloneDisabledTest.startSingleServerTest

2014-01-28 Thread Michi Mutsuzaki (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michi Mutsuzaki updated ZOOKEEPER-1870:
---

Assignee: Helen Hastings  (was: Alexander Shraer)

 flakey test in StandaloneDisabledTest.startSingleServerTest
 ---

 Key: ZOOKEEPER-1870
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1870
 Project: ZooKeeper
  Issue Type: Bug
  Components: tests
Affects Versions: 3.5.0
Reporter: Patrick Hunt
Assignee: Helen Hastings
Priority: Critical

 I'm seeing lots of the following failure. Seems like a flakey test (passes 
 every so often).
 {noformat}
 junit.framework.AssertionFailedError: client could not connect to 
 reestablished quorum: giving up after 30+ seconds.
   at 
 org.apache.zookeeper.test.ReconfigTest.testNormalOperation(ReconfigTest.java:143)
   at 
 org.apache.zookeeper.server.quorum.StandaloneDisabledTest.startSingleServerTest(StandaloneDisabledTest.java:75)
   at 
 org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)