[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-18 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13973973#comment-13973973
 ] 

Hudson commented on YARN-1281:
--

FAILURE: Integrated in Hadoop-Yarn-trunk #544 (See 
[https://builds.apache.org/job/Hadoop-Yarn-trunk/544/])
YARN-1281. Fixed TestZKRMStateStoreZKClientConnections to not fail 
intermittently due to ZK-client timeouts. Contributed by Tsuyoshi Ozawa. 
(vinodkv: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1588369)
* /hadoop/common/trunk/hadoop-yarn-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/recovery/TestZKRMStateStoreZKClientConnections.java


 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Test
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Tsuyoshi OZAWA
 Fix For: 2.4.1

 Attachments: YARN-1281.1.patch, YARN-1281.2.patch, output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-18 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13974144#comment-13974144
 ] 

Hudson commented on YARN-1281:
--

SUCCESS: Integrated in Hadoop-Mapreduce-trunk #1761 (See 
[https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1761/])
YARN-1281. Fixed TestZKRMStateStoreZKClientConnections to not fail 
intermittently due to ZK-client timeouts. Contributed by Tsuyoshi Ozawa. 
(vinodkv: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1588369)
* /hadoop/common/trunk/hadoop-yarn-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/recovery/TestZKRMStateStoreZKClientConnections.java


 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Test
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Tsuyoshi OZAWA
 Fix For: 2.4.1

 Attachments: YARN-1281.1.patch, YARN-1281.2.patch, output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-17 Thread Mit Desai (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13972964#comment-13972964
 ] 

Mit Desai commented on YARN-1281:
-

+1 (non binding)

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Test
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Tsuyoshi OZAWA
 Attachments: YARN-1281.1.patch, YARN-1281.2.patch, output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-17 Thread Vinod Kumar Vavilapalli (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13973377#comment-13973377
 ] 

Vinod Kumar Vavilapalli commented on YARN-1281:
---

This looks good. Tx for the stress test of the test itself! Checking this in.

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Test
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Tsuyoshi OZAWA
 Attachments: YARN-1281.1.patch, YARN-1281.2.patch, output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-17 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13973432#comment-13973432
 ] 

Hudson commented on YARN-1281:
--

SUCCESS: Integrated in Hadoop-trunk-Commit #5535 (See 
[https://builds.apache.org/job/Hadoop-trunk-Commit/5535/])
YARN-1281. Fixed TestZKRMStateStoreZKClientConnections to not fail 
intermittently due to ZK-client timeouts. Contributed by Tsuyoshi Ozawa. 
(vinodkv: 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1588369)
* /hadoop/common/trunk/hadoop-yarn-project/CHANGES.txt
* 
/hadoop/common/trunk/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/recovery/TestZKRMStateStoreZKClientConnections.java


 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Test
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Tsuyoshi OZAWA
 Fix For: 2.4.1

 Attachments: YARN-1281.1.patch, YARN-1281.2.patch, output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-16 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13971751#comment-13971751
 ] 

Hadoop QA commented on YARN-1281:
-

{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12640495/YARN-1281.1.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in 
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager.

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-YARN-Build/3577//testReport/
Console output: https://builds.apache.org/job/PreCommit-YARN-Build/3577//console

This message is automatically generated.

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Tsuyoshi OZAWA
 Attachments: YARN-1281.1.patch, output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-16 Thread Mit Desai (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13971758#comment-13971758
 ] 

Mit Desai commented on YARN-1281:
-

I'll take a look on the patch. I'll post comments by end of the day

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Tsuyoshi OZAWA
 Attachments: YARN-1281.1.patch, output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-16 Thread Tsuyoshi OZAWA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13971775#comment-13971775
 ] 

Tsuyoshi OZAWA commented on YARN-1281:
--

Thanks, Mit. I found some mistakes in a first patch and updated it. Please 
check the latest one.

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Tsuyoshi OZAWA
 Attachments: YARN-1281.1.patch, YARN-1281.2.patch, output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-16 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13971836#comment-13971836
 ] 

Hadoop QA commented on YARN-1281:
-

{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12640514/YARN-1281.2.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in 
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager.

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-YARN-Build/3578//testReport/
Console output: https://builds.apache.org/job/PreCommit-YARN-Build/3578//console

This message is automatically generated.

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Tsuyoshi OZAWA
 Attachments: YARN-1281.1.patch, YARN-1281.2.patch, output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-16 Thread Mit Desai (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13971866#comment-13971866
 ] 

Mit Desai commented on YARN-1281:
-

LGTM.
Just one question. In the following change in patch-2
{code}
@@ -136,7 +137,7 @@ public void testZKClientRetry() throws Exception {
 TestZKClient zkClientTester = new TestZKClient();
 final String path = /test;
 YarnConfiguration conf = new YarnConfiguration();
-conf.setInt(YarnConfiguration.RM_ZK_TIMEOUT_MS, 1000);
+conf.setInt(YarnConfiguration.RM_ZK_TIMEOUT_MS, ZK_TIMEOUT_MS);
{code}
The timeout was 1000ms and the ZK_TIMEOUT_MS is also set to 1000. Do you think 
this will not cause the timeout issue in future? If unsure, I think setting 
ZK_TIMEOUT_MS = 1500 would be a good idea.

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Tsuyoshi OZAWA
 Attachments: YARN-1281.1.patch, YARN-1281.2.patch, output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-16 Thread Tsuyoshi OZAWA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13971955#comment-13971955
 ] 

Tsuyoshi OZAWA commented on YARN-1281:
--

Thanks for the review, Mit. 

RM_ZK_TIMEOUT_MS is for ZK session timeout. IMO, 1000 msec is safe and enough 
time to keep alive. The test failure is caused by 100 msec timeout in 
TestZKRMStateStore#getNewZooKeeper() called by 
ZKRMStateStore#createConnection(). 

Good news: I ran the test 300 times with patch-2, and confirmed all of them 
succeeded. Without the patch, the test fails each approx. 50 times.

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Tsuyoshi OZAWA
 Attachments: YARN-1281.1.patch, YARN-1281.2.patch, output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-15 Thread Tsuyoshi OZAWA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13969261#comment-13969261
 ] 

Tsuyoshi OZAWA commented on YARN-1281:
--

Reproduced bug.

{quote}
---
Test set: 
org.apache.hadoop.yarn.server.resourcemanager.recovery.TestZKRMStateStoreZKClientConnections
---
Tests run: 6, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 3.73 sec  
FAILURE! - in 
org.apache.hadoop.yarn.server.resourcemanager.recovery.TestZKRMStateStoreZKClientConnections
testZKClientDisconnectAndReconnect(org.apache.hadoop.yarn.server.resourcemanager.recovery.TestZKRMStateStoreZKClientConnections)
  Time elapsed: 0.174 sec   FAILURE!
java.lang.AssertionError: Unable to connect to server
at org.junit.Assert.fail(Assert.java:93)
at 
org.apache.hadoop.ha.ClientBaseWithFixes.createClient(ClientBaseWithFixes.java:180)
at 
org.apache.hadoop.yarn.server.resourcemanager.recovery.TestZKRMStateStoreZKClientConnections.access$100(TestZKRMStateStoreZKClientConnections.java:47)
{quote}

{quote}
2014-04-15 04:11:27,000 INFO  [SessionTracker] server.SessionTrackerImpl 
(SessionTrackerImpl.java:run(162)) - SessionTrackerImpl exited loop!
2014-04-15 04:11:27,660 INFO  [Thread-2-SendThread(localhost:11221)] 
zookeeper.ClientCnxn (ClientCnxn.java:logStartConnect(966)) - Opening socket 
connection to server localhost/127.0.0.1:11221. Will not attempt to 
authenticate using SASL (unknown error)
2014-04-15 04:11:27,661 WARN  [Thread-2-SendThread(localhost:11221)] 
zookeeper.ClientCnxn (ClientCnxn.java:run(1089)) - Session 0x145639514f7 
for server null, unexpected error, closing socket connection and attempting 
reconnect
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at 
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
at 
org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350)
at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068)
{quote}

I'll take a look deeper.

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla

 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-15 Thread Tsuyoshi OZAWA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13969336#comment-13969336
 ] 

Tsuyoshi OZAWA commented on YARN-1281:
--

Sorry, the log I mentioned at the last time is correct behavior. Please ignore 
it.

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
 Attachments: output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-15 Thread Tsuyoshi OZAWA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13969338#comment-13969338
 ] 

Tsuyoshi OZAWA commented on YARN-1281:
--

Maybe this is a log related to the test failure:

{quote}
2014-04-15 08:13:04,905 INFO  [Thread-16] zookeeper.JUnit4ZKTestRunner 
(JUnit4ZKTestRunner.java:evaluate(54)) - TEST METHOD FAILED testZKClientRetry
java.lang.AssertionError: Unable to connect to server
at org.junit.Assert.fail(Assert.java:93)
at 
org.apache.hadoop.ha.ClientBaseWithFixes.createClient(ClientBaseWithFixes.java:180)
at 
org.apache.hadoop.yarn.server.resourcemanager.recovery.TestZKRMStateStoreZKClientConnections.access$100(TestZKRMStateStoreZKClientConnections.java:47)
at 
org.apache.hadoop.yarn.server.resourcemanager.recovery.TestZKRMStateStoreZKClientConnections$TestZKClient$TestZKRMStateStore.getNewZooKeeper(TestZKRMStateStoreZKClientConnections.java:87)
at 
org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore.createConnection(ZKRMStateStore.java:993)
at 
org.apache.hadoop.yarn.server.resourcemanager.recovery.ZKRMStateStore.startInternal(ZKRMStateStore.java:248)
at 
org.apache.hadoop.yarn.server.resourcemanager.recovery.RMStateStore.serviceStart(RMStateStore.java:282)
at 
org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
at 
org.apache.hadoop.yarn.server.resourcemanager.recovery.TestZKRMStateStoreZKClientConnections$TestZKClient$TestZKRMStateStore.init(TestZKRMStateStoreZKClientConnections.java:80)
at 
org.apache.hadoop.yarn.server.resourcemanager.recovery.TestZKRMStateStoreZKClientConnections$TestZKClient.getRMStateStore(TestZKRMStateStoreZKClientConnections.java:129)
at 
org.apache.hadoop.yarn.server.resourcemanager.recovery.TestZKRMStateStoreZKClientConnections.testZKClientRetry(TestZKRMStateStoreZKClientConnections.java:141)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at 
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:45)
at 
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:15)
at 
org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:42)
at 
org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:20)
at 
org.apache.zookeeper.JUnit4ZKTestRunner$LoggedInvokeMethod.evaluate(JUnit4ZKTestRunner.java:52)
at 
org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:62)
{quote}

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
 Attachments: output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-15 Thread Mit Desai (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13969640#comment-13969640
 ] 

Mit Desai commented on YARN-1281:
-

[~ozawa], If you are working on the issue, can you assign it to yourself?

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
 Attachments: output.txt


 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-14 Thread Mit Desai (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13968480#comment-13968480
 ] 

Mit Desai commented on YARN-1281:
-

[~kasha], are you still seeing this test failing?

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla

 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-04-14 Thread Karthik Kambatla (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13968501#comment-13968501
 ] 

Karthik Kambatla commented on YARN-1281:


Yes. Almost in every nightly run. I have been caught up with other things, and 
haven't been able to look into this. Temporarily marked it Unassigned so 
someone else can pick it. Will take it back when I get a chance to fix. 

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla

 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-02-19 Thread Mit Desai (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13905799#comment-13905799
 ] 

Mit Desai commented on YARN-1281:
-

Is this failure just related to the test or is there some bug in hadoop?

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla

 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-02-19 Thread Karthik Kambatla (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13905833#comment-13905833
 ] 

Karthik Kambatla commented on YARN-1281:


I believe it is just related to the test, as other testing didn't reveal 
anything. Haven't been able to reliably reproduce it either. 

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla

 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-02-19 Thread Mit Desai (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13906020#comment-13906020
 ] 

Mit Desai commented on YARN-1281:
-

I had tried it on my machine and it was passing too. Just wanted to make sure 
it is a test issue and not a real bug

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla

 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-02-19 Thread Karthik Kambatla (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13906125#comment-13906125
 ] 

Karthik Kambatla commented on YARN-1281:


I actually see this failing in our nightly builds every so often. It is just 
that, I haven't figured out a way to reliably reproduce it. 

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla

 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2014-02-07 Thread Chen He (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13894800#comment-13894800
 ] 

Chen He commented on YARN-1281:
---

Agree with Ozawa, I tested it on my laptop. It passed. 

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla

 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2013-11-14 Thread Tsuyoshi OZAWA (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13823360#comment-13823360
 ] 

Tsuyoshi OZAWA commented on YARN-1281:
--

The test also pass in my local machine. Sometimes tests in hadoop can fail 
because DNS settings is wrong and sockets cannot be connected to local server. 
Is this one of these?

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla

 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2013-10-28 Thread Karthik Kambatla (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13807472#comment-13807472
 ] 

Karthik Kambatla commented on YARN-1281:


Looking into the stack trace of a sample failure, it looks like a timeout 
waiting for connection to ZooKeeper.

{noformat}
java.lang.AssertionError: Unable to connect to server
at org.junit.Assert.fail(Assert.java:93)
at 
org.apache.hadoop.ha.ClientBaseWithFixes.createClient(ClientBaseWithFixes.java:180)
at 
org.apache.hadoop.yarn.server.resourcemanager.recovery.TestZKRMStateStoreZKClientConnections.access$100(TestZKRMStateStoreZKClientConnections.java:44)
{noformat}

Not sure if we can do anything but just bump up the default value for the 
timeout? 

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla

 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (YARN-1281) TestZKRMStateStoreZKClientConnections fails intermittently

2013-10-28 Thread Karthik Kambatla (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13807478#comment-13807478
 ] 

Karthik Kambatla commented on YARN-1281:


The timeout currently is 30 seconds. On my local machine, the test pass even 
with a timeout of 3 seconds. Thoughts? 

 TestZKRMStateStoreZKClientConnections fails intermittently
 --

 Key: YARN-1281
 URL: https://issues.apache.org/jira/browse/YARN-1281
 Project: Hadoop YARN
  Issue Type: Bug
  Components: resourcemanager
Reporter: Karthik Kambatla
Assignee: Karthik Kambatla

 The test fails intermittently - haven't been able to reproduce the failure 
 deterministically. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)