[jira] [Updated] (ZOOKEEPER-938) Support Kerberos authentication of clients.

2011-08-09 Thread Eugene Koontz (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eugene Koontz updated ZOOKEEPER-938:


Attachment: ZOOKEEPER-938.patch

Address Benjamin Reed's feedback:

-Remove OpCode.sasl from Request.java since it's not needed there.
-in ZooKeeperServer, move SASL-related server-side code into separate method 
processSasl()

 Support Kerberos authentication of clients.
 ---

 Key: ZOOKEEPER-938
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-938
 Project: ZooKeeper
  Issue Type: New Feature
  Components: java client, server
Reporter: Eugene Koontz
Assignee: Eugene Koontz
 Fix For: 3.4.0, 3.5.0

 Attachments: NIOServerCnxn.patch, ZOOKEEPER-938.patch, 
 ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, 
 ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, 
 ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, 
 ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, jaas.conf, 
 sasl.patch


 Support Kerberos authentication of clients. 
 The following usage would let an admin use Kerberos authentication to assign 
 ACLs to authenticated clients.
 1. Admin logs into zookeeper (not necessarily through Kerberos however). 
 2. Admin decides that a new node called '/mynode' should be owned by the user 
 'zkclient' and have full permissions on this.
 3. Admin does: zk create /mynode content sasl:zkcli...@foofers.org:cdrwa
 4. User 'zkclient' logins to kerberos using the command line utility 'kinit'.
 5. User connects to zookeeper server using a Kerberos-enabled version of 
 zkClient (ZookeeperMain).
 6. Behind the scenes, the client and server exchange authentication 
 information. User is now authenticated as 'zkclient'.
 7. User accesses /mynode with permissions 'cdrwa'.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (ZOOKEEPER-938) Support Kerberos authentication of clients.

2011-08-09 Thread Eugene Koontz (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081866#comment-13081866
 ] 

Eugene Koontz commented on ZOOKEEPER-938:
-

Hi Benjamin, 
Thanks a lot for your feedback! I addressed the two cleanup issues in a new 
patch. 

Regarding the sun.* classes, these are only needed to determine the default 
Kerberos realm from the system's Kerberos conf file (usually in 
/etc/krb5.conf). 

I think we can use the java.security.* functionality to replace sun.security.*: 
 
 
http://download.oracle.com/javase/1.4.2/docs/guide/security/jgss/tutorials/KerberosReq.html

I will try this out and update the patch if this will work.

 Support Kerberos authentication of clients.
 ---

 Key: ZOOKEEPER-938
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-938
 Project: ZooKeeper
  Issue Type: New Feature
  Components: java client, server
Reporter: Eugene Koontz
Assignee: Eugene Koontz
 Fix For: 3.4.0, 3.5.0

 Attachments: NIOServerCnxn.patch, ZOOKEEPER-938.patch, 
 ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, 
 ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, 
 ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, 
 ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, jaas.conf, 
 sasl.patch


 Support Kerberos authentication of clients. 
 The following usage would let an admin use Kerberos authentication to assign 
 ACLs to authenticated clients.
 1. Admin logs into zookeeper (not necessarily through Kerberos however). 
 2. Admin decides that a new node called '/mynode' should be owned by the user 
 'zkclient' and have full permissions on this.
 3. Admin does: zk create /mynode content sasl:zkcli...@foofers.org:cdrwa
 4. User 'zkclient' logins to kerberos using the command line utility 'kinit'.
 5. User connects to zookeeper server using a Kerberos-enabled version of 
 zkClient (ZookeeperMain).
 6. Behind the scenes, the client and server exchange authentication 
 information. User is now authenticated as 'zkclient'.
 7. User accesses /mynode with permissions 'cdrwa'.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (ZOOKEEPER-938) Support Kerberos authentication of clients.

2011-08-09 Thread Eugene Koontz (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13081939#comment-13081939
 ] 

Eugene Koontz commented on ZOOKEEPER-938:
-

Hi Benjamin,
 
Unfortunately I couldn't find support for obtaining the default Kerberos realm 
in java.security or javax.security. 

(In fact, it seems that javax.security uses sun.security internally, for 
example: 
http://www.java2s.com/Open-Source/Java-Document/6.0-JDK-Core/security/javax/security/auth/kerberos/KerberosPrincipal.java.htm.
  (see import sun.security.krb5.Asn1Exception; in the source code)).

So I don't see any way around using the sun.* classes.

-Eugene


 Support Kerberos authentication of clients.
 ---

 Key: ZOOKEEPER-938
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-938
 Project: ZooKeeper
  Issue Type: New Feature
  Components: java client, server
Reporter: Eugene Koontz
Assignee: Eugene Koontz
 Fix For: 3.4.0, 3.5.0

 Attachments: NIOServerCnxn.patch, ZOOKEEPER-938.patch, 
 ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, 
 ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, 
 ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, 
 ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, ZOOKEEPER-938.patch, jaas.conf, 
 sasl.patch


 Support Kerberos authentication of clients. 
 The following usage would let an admin use Kerberos authentication to assign 
 ACLs to authenticated clients.
 1. Admin logs into zookeeper (not necessarily through Kerberos however). 
 2. Admin decides that a new node called '/mynode' should be owned by the user 
 'zkclient' and have full permissions on this.
 3. Admin does: zk create /mynode content sasl:zkcli...@foofers.org:cdrwa
 4. User 'zkclient' logins to kerberos using the command line utility 'kinit'.
 5. User connects to zookeeper server using a Kerberos-enabled version of 
 zkClient (ZookeeperMain).
 6. Behind the scenes, the client and server exchange authentication 
 information. User is now authenticated as 'zkclient'.
 7. User accesses /mynode with permissions 'cdrwa'.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (ZOOKEEPER-1145) ObserverTest.testObserver fails at particular point after several runs of ant junt.run -Dtestcase=ObserverTest

2011-08-09 Thread Eugene Koontz (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13082015#comment-13082015
 ] 

Eugene Koontz commented on ZOOKEEPER-1145:
--

Hi Vishal, Good news. I tried trunk today (commit e9373536) with your patch on 
a EC-2 m2.xlarge. I ran ObserverTest 300 times and the whole ant test 3 
times. No errors.

 ObserverTest.testObserver fails at particular point after several runs of ant 
 junt.run -Dtestcase=ObserverTest
 --

 Key: ZOOKEEPER-1145
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1145
 Project: ZooKeeper
  Issue Type: Bug
Affects Versions: 3.4.0
Reporter: Eugene Koontz
Assignee: Vishal Kher
Priority: Blocker
 Fix For: 3.4.0

 Attachments: out.txt, repeat.sh


 Use the attached repeat.sh to run ObserverTest repeatedly by doing: 
 src/repeat.sh ObserverTest
 The test will will fail eventually after a few iterations; should be only a 
 few minutes.
 The line that fails in the test is: 
 zk = new ZooKeeper(127.0.0.1: + CLIENT_PORT_OBS,
 ClientBase.CONNECTION_TIMEOUT, this);
 Attached as out.txt is the output showing a successful run, for comparison, 
 followed by a failed run.
 Note that in the seconds before the test fails, in the following lines, that 
 there is a 24 second gap in time (between 22:13:02 and 22:13:26):
 bq.
 [junit] 2011-08-03 22:13:02,167 [myid:3] - INFO  
 [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11229:ZooKeeperServer@833] - Client 
 attempting to establish new session at /127.0.0.1:46929
 [junit] 2011-08-03 22:13:26,003 [myid:2] - INFO  
 [QuorumPeer[myid=2]/0:0:0:0:0:0:0:0:11228:Leader@419] - Shutting down
 [junit] 2011-08-03 22:13:26,003 [myid:2] - INFO  
 [QuorumPeer[myid=2]/0:0:0:0:0:0:0:0:11228:Leader@425] - Shutdown called
 [junit] java.lang.Exception: shutdown Leader! reason: Only 0 followers, need 1

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




ZooKeeper-trunk - Build # 1258 - Failure

2011-08-09 Thread Apache Jenkins Server
See https://builds.apache.org/job/ZooKeeper-trunk/1258/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 1122 lines...]
AUhudsonPatchQueueAdmin.sh
AUprocessHadoopPatchEmail.sh
A README.txt
A test-patch
A test-patch/test-patch.sh
At revision 1155988
No emails were triggered.
[ZooKeeper-trunk] $ /bin/bash /tmp/hudson3188972585120407500.sh
1024


==
==
CLEAN: cleaning workspace
==
==


Error: JAVA_HOME is not defined correctly.
  We cannot execute /home/hudson/tools/java/latest1.6/bin/java


==
==
BUILD: ant hudson-test-trunk -Dtest.junit.output.format=xml -Dtest.output=yes 
-Dfindbugs.home=${FINDBUGS_HOME} -Dforrest.home=${FORREST_HOME} 
-Djava5.home=${JAVA5_HOME}
==
==


Error: JAVA_HOME is not defined correctly.
  We cannot execute /home/hudson/tools/java/latest1.6/bin/java


==
==
STORE: saving artifacts
==
==


mv: cannot stat `build/*.tar.gz': No such file or directory
mv: cannot stat `build/*.jar': No such file or directory
mv: cannot stat `build/test/findbugs': No such file or directory
mv: cannot stat `build/docs/api': No such file or directory
Build Failed
[FINDBUGS] Skipping publisher since build result is FAILURE
[WARNINGS] Skipping publisher since build result is FAILURE
Recording fingerprints
Archiving artifacts
Recording test results
Publishing Javadoc
Updating ZOOKEEPER-1090
Updating ZOOKEEPER-1138
Updating ZOOKEEPER-1139
Updating ZOOKEEPER-1076
Publishing Clover coverage report...
No Clover report will be published due to a Build Failure
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
No tests ran.


[jira] [Commented] (ZOOKEEPER-1138) release audit failing for a number of new files

2011-08-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13082062#comment-13082062
 ] 

Hudson commented on ZOOKEEPER-1138:
---

Integrated in ZooKeeper-trunk #1258 (See 
[https://builds.apache.org/job/ZooKeeper-trunk/1258/])
ZOOKEEPER-1138. release audit failing for a number of new files. (phunt via 
mahadev) - forgot file
ZOOKEEPER-1138. release audit failing for a number of new files. (phunt via 
mahadev)

mahadev : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1152033
Files : 
* /zookeeper/trunk/src/c/NOTICE.txt

mahadev : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1152018
Files : 
* 
/zookeeper/trunk/src/java/test/org/apache/zookeeper/server/quorum/LearnerTest.java
* /zookeeper/trunk/src/contrib/huebrowser/zkui/src/zkui/static/js/package.yml
* /zookeeper/trunk/src/contrib/zooinspector/config/defaultNodeVeiwers.cfg
* /zookeeper/trunk/NOTICE.txt
* 
/zookeeper/trunk/src/java/test/org/apache/zookeeper/MultiTransactionRecordTest.java
* /zookeeper/trunk/CHANGES.txt
* /zookeeper/trunk/build.xml
* /zookeeper/trunk/src/contrib/huebrowser/zkui/src/zkui/static/help/index.html
* /zookeeper/trunk/src/contrib/zooinspector/config/defaultConnectionSettings.cfg


 release audit failing for a number of new files
 ---

 Key: ZOOKEEPER-1138
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1138
 Project: ZooKeeper
  Issue Type: Bug
Affects Versions: 3.4.0
Reporter: Patrick Hunt
Assignee: Patrick Hunt
Priority: Blocker
 Fix For: 3.4.0

 Attachments: ZOOKEEPER-1138.patch


 I'm seeing a number of problems in the release audit output for 3.4.0, these 
 must be fixed before 3.4.0 release:
 {noformat}
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/contrib/ZooInspector/config/defaultConnectionSettings.cfg
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/contrib/ZooInspector/config/defaultNodeVeiwers.cfg
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/contrib/ZooInspector/licences/epl-v10.html
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/c/Cli.vcproj
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/c/include/winconfig.h
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/c/include/winstdint.h
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/c/zookeeper.sln
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/c/zookeeper.vcproj
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/contrib/huebrowser/zkui/src/zkui/static/help/index.html
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/contrib/huebrowser/zkui/src/zkui/static/js/package.yml
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/contrib/loggraph/web/org/apache/zookeeper/graph/log4j.properties
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/contrib/loggraph/web/org/apache/zookeeper/graph/resources/date.format.js
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/contrib/loggraph/web/org/apache/zookeeper/graph/resources/g.bar.js
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/contrib/loggraph/web/org/apache/zookeeper/graph/resources/g.dot.js
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/contrib/loggraph/web/org/apache/zookeeper/graph/resources/g.line.js
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/contrib/loggraph/web/org/apache/zookeeper/graph/resources/g.pie.js
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/contrib/loggraph/web/org/apache/zookeeper/graph/resources/g.raphael.js
 [rat:report]  !? 
 /grid/0/hudson/hudson-slave/workspace/PreCommit-ZOOKEEPER-Build/trunk/build/zookeeper-3.4.0/src/contrib/loggraph/web/org/apache/zookeeper/graph/resources/raphael.js
 [rat:report]  

[jira] [Commented] (ZOOKEEPER-1076) some quorum tests are unnecessarily extending QuorumBase

2011-08-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1076?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13082064#comment-13082064
 ] 

Hudson commented on ZOOKEEPER-1076:
---

Integrated in ZooKeeper-trunk #1258 (See 
[https://builds.apache.org/job/ZooKeeper-trunk/1258/])
ZOOKEEPER-1076. some quorum tests are unnecessarily extending QuorumBase 
(phunt via mahadev)

mahadev : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1152141
Files : 
* 
/zookeeper/trunk/src/java/test/org/apache/zookeeper/test/FollowerResyncConcurrencyTest.java
* /zookeeper/trunk/src/java/test/org/apache/zookeeper/test/QuorumTest.java
* /zookeeper/trunk/CHANGES.txt
* 
/zookeeper/trunk/src/java/test/org/apache/zookeeper/test/ZkDatabaseCorruptionTest.java
* /zookeeper/trunk/src/java/test/org/apache/zookeeper/test/QuorumHammerTest.java
* /zookeeper/trunk/src/java/test/org/apache/zookeeper/test/ReadOnlyModeTest.java


 some quorum tests are unnecessarily extending QuorumBase
 

 Key: ZOOKEEPER-1076
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1076
 Project: ZooKeeper
  Issue Type: Bug
  Components: tests
Affects Versions: 3.4.0
Reporter: Patrick Hunt
Assignee: Patrick Hunt
Priority: Minor
 Fix For: 3.4.0

 Attachments: ZOOKEEPER-1076.patch, ZOOKEEPER-1076.patch


 Some tests are unnecessarily extending QuorumBase. Typically this is not a 
 big issue, but it may cause more servers than necessary to be started (harder 
 to debug a failing test in particular).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (ZOOKEEPER-1139) jenkins is reporting two warnings, fix these

2011-08-09 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1139?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13082063#comment-13082063
 ] 

Hudson commented on ZOOKEEPER-1139:
---

Integrated in ZooKeeper-trunk #1258 (See 
[https://builds.apache.org/job/ZooKeeper-trunk/1258/])
ZOOKEEPER-1139. jenkins is reporting two warnings, fix these (phunt via 
mahadev)

mahadev : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVNview=revrev=1152036
Files : 
* /zookeeper/trunk/src/java/main/org/apache/zookeeper/Op.java
* /zookeeper/trunk/CHANGES.txt
* /zookeeper/trunk/src/java/main/org/apache/zookeeper/ClientWatchManager.java


 jenkins is reporting two warnings, fix these
 

 Key: ZOOKEEPER-1139
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1139
 Project: ZooKeeper
  Issue Type: Bug
Affects Versions: 3.4.0
Reporter: Patrick Hunt
Assignee: Patrick Hunt
Priority: Minor
 Fix For: 3.4.0

 Attachments: ZOOKEEPER-1139.patch


 cleanup jenkins report, currently 2 compiler warnings being reported.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (ZOOKEEPER-1145) ObserverTest.testObserver fails at particular point after several runs of ant junt.run -Dtestcase=ObserverTest

2011-08-09 Thread Mahadev konar (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13082109#comment-13082109
 ] 

Mahadev konar commented on ZOOKEEPER-1145:
--

thats great news!

 ObserverTest.testObserver fails at particular point after several runs of ant 
 junt.run -Dtestcase=ObserverTest
 --

 Key: ZOOKEEPER-1145
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1145
 Project: ZooKeeper
  Issue Type: Bug
Affects Versions: 3.4.0
Reporter: Eugene Koontz
Assignee: Vishal Kher
Priority: Blocker
 Fix For: 3.4.0

 Attachments: out.txt, repeat.sh


 Use the attached repeat.sh to run ObserverTest repeatedly by doing: 
 src/repeat.sh ObserverTest
 The test will will fail eventually after a few iterations; should be only a 
 few minutes.
 The line that fails in the test is: 
 zk = new ZooKeeper(127.0.0.1: + CLIENT_PORT_OBS,
 ClientBase.CONNECTION_TIMEOUT, this);
 Attached as out.txt is the output showing a successful run, for comparison, 
 followed by a failed run.
 Note that in the seconds before the test fails, in the following lines, that 
 there is a 24 second gap in time (between 22:13:02 and 22:13:26):
 bq.
 [junit] 2011-08-03 22:13:02,167 [myid:3] - INFO  
 [NIOServerCxn.Factory:0.0.0.0/0.0.0.0:11229:ZooKeeperServer@833] - Client 
 attempting to establish new session at /127.0.0.1:46929
 [junit] 2011-08-03 22:13:26,003 [myid:2] - INFO  
 [QuorumPeer[myid=2]/0:0:0:0:0:0:0:0:11228:Leader@419] - Shutting down
 [junit] 2011-08-03 22:13:26,003 [myid:2] - INFO  
 [QuorumPeer[myid=2]/0:0:0:0:0:0:0:0:11228:Leader@425] - Shutdown called
 [junit] java.lang.Exception: shutdown Leader! reason: Only 0 followers, need 1

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira




Re: Hadoop build machine update

2011-08-09 Thread Mahadev Konar
Thanks a lot Giri.

I see something for ZK builds. Hopefully they'll be functional tomm.

thanks
mahadev



On Tue, Aug 9, 2011 at 7:35 PM, Giridharan Kesavan gkesa...@hortonworks.com
 wrote:

 All,

 All the hudson slaves are back online. I was working on getting the build
 tools installed on all the new slaves and it looks like Its going to take
 some more time to change the build job configs before I could enable the
 pre-commit jobs.

 Will get the precommit and nightly build jobs back online tomorrow.

 -Giri


 On Tue, Aug 2, 2011 at 11:45 PM, Nigel Daley nda...@mac.com wrote:

  Great!  Thanks for getting the info.
 
  Cheers,
  n.
 
  On Aug 2, 2011, at 8:05 PM, Eric Baldeschwieler wrote:
 
   Hi Folks,
  
   I've talked to the folks at Yahoo about the build machines and am happy
  to report that an end to the blackout is in sight.
  
   1) They need to reimage the machines, which is in progress and machines
  should be restored within a week (pessimistically).
  
   2) They plan to reach out to the apache infrastructure team and change
  the admin of these machines to remove yahoo from the admin loop.  This
  should avoid future outages and give apache more flexibility in managing
 the
  machines.
  
   I'm told they will be posting more details today or tomorrow.  Everyone
  there is committed to maintaining quality support for Apache and is
  concerned about the outage and making sure it does not repeat.
  
   Thanks,
  
   E14
 
 



RE: Hadoop build machine update

2011-08-09 Thread Laxman
Hi Mahadev,

Do we need to resubmit the patches which were uploaded last week?
Or Hudson will pick them up automatically?


-Original Message-
From: Mahadev Konar [mailto:maha...@hortonworks.com] 
Sent: Wednesday, August 10, 2011 8:14 AM
To: dev@zookeeper.apache.org; Giridharan Kesavan
Subject: Re: Hadoop build machine update

Thanks a lot Giri.

I see something for ZK builds. Hopefully they'll be functional tomm.

thanks
mahadev



On Tue, Aug 9, 2011 at 7:35 PM, Giridharan Kesavan gkesa...@hortonworks.com
 wrote:

 All,

 All the hudson slaves are back online. I was working on getting the build
 tools installed on all the new slaves and it looks like Its going to take
 some more time to change the build job configs before I could enable the
 pre-commit jobs.

 Will get the precommit and nightly build jobs back online tomorrow.

 -Giri


 On Tue, Aug 2, 2011 at 11:45 PM, Nigel Daley nda...@mac.com wrote:

  Great!  Thanks for getting the info.
 
  Cheers,
  n.
 
  On Aug 2, 2011, at 8:05 PM, Eric Baldeschwieler wrote:
 
   Hi Folks,
  
   I've talked to the folks at Yahoo about the build machines and am
happy
  to report that an end to the blackout is in sight.
  
   1) They need to reimage the machines, which is in progress and
machines
  should be restored within a week (pessimistically).
  
   2) They plan to reach out to the apache infrastructure team and change
  the admin of these machines to remove yahoo from the admin loop.  This
  should avoid future outages and give apache more flexibility in managing
 the
  machines.
  
   I'm told they will be posting more details today or tomorrow.
Everyone
  there is committed to maintaining quality support for Apache and is
  concerned about the outage and making sure it does not repeat.
  
   Thanks,
  
   E14
 
 




[jira] [Commented] (ZOOKEEPER-1140) server shutdown is not stopping threads

2011-08-09 Thread Laxman (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13082146#comment-13082146
 ] 

Laxman commented on ZOOKEEPER-1140:
---

Thread [Sender-/127.0.0.1:3084] (Running)

When I observed QuorumZxidSyncTest, the above thread is also leaking in a 
different scenario apart from ReadOnlyZKServer. Above mentioned thread will be 
shutdown only if it receives proposalOfDeath.

We are queuing proposalOfDeath in finally block of LearnerHandler.run().

To summarize, proposalOfDeath may not queued up when LearnerHandler receives 
IOException and other thread calling LearnerHandler.shutdown(). This leads to 
failure of queuing the proposalOfDeath.

To solve this, can we add the proposalOfDeath in shutdown() rather in finally 
block.

Anyways, this finding will solve one of the leaks. We still need to fix other 
leaks caused by ROZK.

{quote}Just to see if I understand now, are you saying that the test is 
spawning over one hundred servers because we are shutting before the server 
actually starts? If so, then it is certainly a problem.{quote}

Yes. I will add more analysis and possibly patch today.

 server shutdown is not stopping threads
 ---

 Key: ZOOKEEPER-1140
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1140
 Project: ZooKeeper
  Issue Type: Bug
  Components: server, tests
Affects Versions: 3.4.0
Reporter: Patrick Hunt
Priority: Blocker
 Fix For: 3.4.0


 Near the end of QuorumZxidSyncTest there are tons of threads running - 115 
 ProcessThread threads, similar numbers of SessionTracker.
 Also I see ~100 ReadOnlyRequestProcessor - why is this running as a separate 
 thread? (henry/flavio?)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira