Intermittent java core test failures
------------------------------------

                 Key: ZOOKEEPER-1125
                 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1125
             Project: ZooKeeper
          Issue Type: Bug
            Reporter: Vishal K
            Priority: Blocker


Some of the tests are consistently failing for me and intermittently on hudson.

Posting discussion from mailing list below.

Vishal,
 Can you please open a jira for this and mark it as a blocker for 3.4
release? Looks like its transient:

https://builds.apache.org/job/ZooKeeper-trunk/

The latest build is passing.

thanks
mahadev
- Hide quoted text -

On Mon, Jul 11, 2011 at 12:49 PM, Vishal Kher <vishalm...@gmail.com> wrote:
> Hi,
>
> ant test-core-java is consistently failing for me.
>
> The error seems to be either:
>
> Testcase: testFollowersStartAfterLeader took 35.577 sec
>    Caused an ERROR
> Did not connect
> java.util.concurrent.TimeoutException: Did not connect
>    at
> org.apache.zookeeper.test.ClientBase$CountdownWatcher.waitForConnected(ClientBase.java:124)
>    at
> org.apache.zookeeper.test.QuorumTest.testFollowersStartAfterLeader(QuorumTest.java:308)
>    at
> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>
> or
>
> Testcase: testNoLogBeforeLeaderEstablishment took 8.831 sec
>    Caused an ERROR
> KeeperErrorCode = ConnectionLoss for /blah
> org.apache.zookeeper.KeeperException$ConnectionLossException:
> KeeperErrorCode = ConnectionLoss for /blah
>    at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
>    at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
>    at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:761)
>    at
> org.apache.zookeeper.test.QuorumTest.testNoLogBeforeLeaderEstablishment(QuorumTest.java:385)
>    at
> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>
> Looks like the reason why the tests are failing for me is similar to why the
> tests failed on hudson:
>
> 2011-07-11 14:47:26,219 [myid:] - INFO  [QuorumPeer[myid=2]/0.0.0.0:11379
> :Leader@425] - Shutdown called
> java.lang.Exception: shutdown Leader! reason: Only 0 followers, need 1
>    at org.apache.zookeeper.server.quorum.Leader.shutdown(Leader.java:425)
>    at org.apache.zookeeper.server.quorum.Leader.lead(Leader.java:400)
>    at
> org.apache.zookeeper.server.quorum.QuorumPeer.run(QuorumPeer.java:729)
> 2011-07-11 14:47:26,220 [myid:] - INFO  [QuorumPeer[myid=2]/0.0.0.0:11379
> :ZooKeeperServer@416] - shutting down
>
> The leader is not able to ping the followers. Has anyone seen this before?
>
> Thanks.
> -Vishal
>
> On Sun, Jul 10, 2011 at 6:52 AM, Apache Jenkins Server <
> jenk...@builds.apache.org> wrote:
>
>> See https://builds.apache.org/job/ZooKeeper-trunk/1239/
>>
>>
>> ###################################################################################
>> ########################## LAST 60 LINES OF THE CONSOLE
>> ###########################
>> [...truncated 242795 lines...]
>>    [junit] 2011-07-10 10:57:16,673 [myid:] - INFO
>>  [main:SessionTrackerImpl@206] - Shutting down
>>    [junit] 2011-07-10 10:57:16,673 [myid:] - INFO
>>  [main:PrepRequestProcessor@702] - Shutting down
>>    [junit] 2011-07-10 10:57:16,674 [myid:] - INFO
>>  [main:SyncRequestProcessor@170] - Shutting down
>>    [junit] 2011-07-10 10:57:16,674 [myid:] - INFO
>>  [SyncThread:0:SyncRequestProcessor@152] - SyncRequestProcessor exited!
>>    [junit] 2011-07-10 10:57:16,675 [myid:] - INFO
>>  [main:FinalRequestProcessor@423] - shutdown of request processor complete
>>    [junit] 2011-07-10 10:57:16,674 [myid:] - INFO  [ProcessThread(sid:0
>> cport:-1)::PrepRequestProcessor@133] - PrepRequestProcessor exited loop!
>>    [junit] 2011-07-10 10:57:16,676 [myid:] - INFO  [main:ClientBase@227] -
>> connecting to 127.0.0.1 11221
>>    [junit] ensureOnly:[]
>>    [junit] 2011-07-10 10:57:16,677 [myid:] - INFO  [main:ClientBase@428] -
>> STARTING server
>>    [junit] 2011-07-10 10:57:16,678 [myid:] - INFO
>>  [main:ZooKeeperServer@164] - Created server with tickTime 3000
>> minSessionTimeout 6000 maxSessionTimeout 60000 datadir
>> /grid/0/hudson/hudson-slave/workspace/ZooKeeper-trunk/trunk/build/test/tmp/test1139867753736175617.junit.dir/version-2
>> snapdir
>> /grid/0/hudson/hudson-slave/workspace/ZooKeeper-trunk/trunk/build/test/tmp/test1139867753736175617.junit.dir/version-2
>>    [junit] 2011-07-10 10:57:16,679 [myid:] - INFO
>>  [main:NIOServerCnxnFactory@94] - binding to port 0.0.0.0/0.0.0.0:11221
>>    [junit] 2011-07-10 10:57:16,680 [myid:] - INFO  [main:FileSnap@83] -
>> Reading snapshot
>> /grid/0/hudson/hudson-slave/workspace/ZooKeeper-trunk/trunk/build/test/tmp/test1139867753736175617.junit.dir/version-2/snapshot.b
>>    [junit] 2011-07-10 10:57:16,683 [myid:] - INFO  [main:FileTxnSnapLog@256]
>> - Snapshotting: b
>>    [junit] 2011-07-10 10:57:16,684 [myid:] - INFO  [main:ClientBase@227] -
>> connecting to 127.0.0.1 11221
>>    [junit] 2011-07-10 10:57:16,685 [myid:] - INFO  [NIOServerCxn.Factory:
>> 0.0.0.0/0.0.0.0:11221:NIOServerCnxnFactory@197] - Accepted socket
>> connection from /127.0.0.1:45122
>>    [junit] 2011-07-10 10:57:16,686 [myid:] - INFO  [NIOServerCxn.Factory:
>> 0.0.0.0/0.0.0.0:11221:NIOServerCnxn@815] - Processing stat command from /
>> 127.0.0.1:45122
>>    [junit] 2011-07-10 10:57:16,686 [myid:] - INFO
>>  [Thread-5:NIOServerCnxn$StatCommand@652] - Stat command output
>>    [junit] 2011-07-10 10:57:16,688 [myid:] - INFO
>>  [Thread-5:NIOServerCnxn@995] - Closed socket connection for client /
>> 127.0.0.1:45122 (no session established for client)
>>    [junit] ensureOnly:[InMemoryDataTree, StandaloneServer_port]
>>    [junit] expect:InMemoryDataTree
>>    [junit] found:InMemoryDataTree
>> org.apache.ZooKeeperService:name0=StandaloneServer_port-1,name1=InMemoryDataTree
>>    [junit] expect:StandaloneServer_port
>>    [junit] found:StandaloneServer_port
>> org.apache.ZooKeeperService:name0=StandaloneServer_port-1
>>    [junit] 2011-07-10 10:57:16,690 [myid:] - INFO
>>  [main:JUnit4ZKTestRunner$LoggedInvokeMethod@57] - FINISHED TEST METHOD
>> testQuota
>>    [junit] 2011-07-10 10:57:16,690 [myid:] - INFO  [main:ClientBase@465] -
>> tearDown starting
>>    [junit] 2011-07-10 10:57:16,754 [myid:] - INFO  [main:ZooKeeper@662] -
>> Session: 0x13113b1aca50000 closed
>>    [junit] 2011-07-10 10:57:16,754 [myid:] - INFO
>>  [main-EventThread:ClientCnxn$EventThread@495] - EventThread shut down
>>    [junit] 2011-07-10 10:57:16,754 [myid:] - INFO  [main:ClientBase@435] -
>> STOPPING server
>>    [junit] 2011-07-10 10:57:16,755 [myid:] - INFO  [NIOServerCxn.Factory:
>> 0.0.0.0/0.0.0.0:11221:NIOServerCnxnFactory@224] - NIOServerCnxn factory
>> exited run method
>>    [junit] 2011-07-10 10:57:16,755 [myid:] - INFO
>>  [main:ZooKeeperServer@416] - shutting down
>>    [junit] 2011-07-10 10:57:16,756 [myid:] - INFO
>>  [main:SessionTrackerImpl@206] - Shutting down
>>    [junit] 2011-07-10 10:57:16,756 [myid:] - INFO
>>  [main:PrepRequestProcessor@702] - Shutting down
>>    [junit] 2011-07-10 10:57:16,757 [myid:] - INFO
>>  [main:SyncRequestProcessor@170] - Shutting down
>>    [junit] 2011-07-10 10:57:16,760 [myid:] - INFO  [ProcessThread(sid:0
>> cport:-1)::PrepRequestProcessor@133] - PrepRequestProcessor exited loop!
>>    [junit] 2011-07-10 10:57:16,762 [myid:] - INFO
>>  [SyncThread:0:SyncRequestProcessor@152] - SyncRequestProcessor exited!
>>    [junit] 2011-07-10 10:57:16,762 [myid:] - INFO
>>  [main:FinalRequestProcessor@423] - shutdown of request processor complete
>>    [junit] 2011-07-10 10:57:16,763 [myid:] - INFO  [main:ClientBase@227] -
>> connecting to 127.0.0.1 11221
>>    [junit] ensureOnly:[]
>>    [junit] 2011-07-10 10:57:16,767 [myid:] - INFO  [main:ClientBase@493] -
>> fdcount after test is: 35 at start it was 24
>>    [junit] 2011-07-10 10:57:16,767 [myid:] - INFO  [main:ClientBase@495] -
>> sleeping for 20 secs
>>    [junit] 2011-07-10 10:57:16,768 [myid:] - INFO  [main:ZKTestCase$1@60]
>> - SUCCEEDED testQuota
>>    [junit] 2011-07-10 10:57:16,768 [myid:] - INFO  [main:ZKTestCase$1@55]
>> - FINISHED testQuota
>>    [junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0.691 sec
>>
>> BUILD FAILED
>> /grid/0/hudson/hudson-slave/workspace/ZooKeeper-trunk/trunk/build.xml:959:
>> The following error occurred while executing this line:
>> /grid/0/hudson/hudson-slave/workspace/ZooKeeper-trunk/trunk/build.xml:870:
>> Tests failed!
>>
>> Total time: 19 minutes 0 seconds
>> [FINDBUGS] Skipping publisher since build result is FAILURE
>> [WARNINGS] Skipping publisher since build result is FAILURE
>> Recording fingerprints
>> Archiving artifacts
>> Recording test results
>> Publishing Javadoc
>> Publishing Clover coverage report...
>> No Clover report will be published due to a Build Failure
>> Email was triggered for: Failure
>> Sending email for trigger: Failure
>>
>>
>>
>>
>> ###################################################################################
>> ############################## FAILED TESTS (if any)
>> ##############################
>> 2 tests failed.
>> REGRESSION:  org.apache.zookeeper.test.ObserverTest.testObserver
>>
>> Error Message:
>> KeeperErrorCode = ConnectionLoss for /obstest
>>
>> Stack Trace:
>> org.apache.zookeeper.KeeperException$ConnectionLossException:
>> KeeperErrorCode = ConnectionLoss for /obstest
>>        at
>> org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
>>        at
>> org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
>>        at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:761)
>>        at
>> org.apache.zookeeper.test.ObserverTest.testObserver(ObserverTest.java:101)
>>        at
>> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
>>
>>
>> REGRESSION:  org.apache.zookeeper.test.ReadOnlyModeTest.testSeekForRwServer
>>
>> Error Message:
>> KeeperErrorCode = ConnectionLoss for /test
>>
>> Stack Trace:
>> org.apache.zookeeper.KeeperException$ConnectionLossException:
>> KeeperErrorCode = ConnectionLoss for /test
>>        at
>> org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
>>        at
>> org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
>>        at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:761)
>>        at
>> org.apache.zookeeper.test.ReadOnlyModeTest.testSeekForRwServer(ReadOnlyModeTest.java:213)
>>        at
>> org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to