[jira] [Commented] (ZOOKEEPER-1412) java client watches inconsistently triggered on reconnect

2012-03-13 Thread Benjamin Reed (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13228483#comment-13228483
 ] 

Benjamin Reed commented on ZOOKEEPER-1412:
--

i agree with botond that we should also fix the java client to update lastzxid 
on every message. it might be worth doing in a separate jira.

 java client watches inconsistently triggered on reconnect
 -

 Key: ZOOKEEPER-1412
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1412
 Project: ZooKeeper
  Issue Type: Bug
  Components: server
Affects Versions: 3.3.3, 3.3.4, 3.4.0, 3.4.1, 3.4.2, 3.4.3
Reporter: Botond Hejj
Assignee: Patrick Hunt
Priority: Blocker
 Fix For: 3.3.5, 3.4.4, 3.5.0

 Attachments: ZOOKEEPER-1412_br33.patch, ZOOKEEPER-1412_br33.patch, 
 ZOOKEEPER-1412_br34.patch, ZOOKEEPER-1412_br34.patch, 
 ZOOKEEPER-1412_trunk.patch, ZOOKEEPER-1412_trunk.patch


 I've observed an inconsistent behavior in java client watches. The 
 inconsistency relates to the behavior after the client reconnects to the 
 zookeeper ensemble.
 After the client reconnects to the ensemble only those watches should trigger 
 which should have been triggered also if the connections was not lost. This 
 means if I watch for changes in node /foo and there is no change there than 
 my watch should not be triggered on reconnecting to the ensemble.
 This is not always the case in the java client.
 I've debugged the issues and I could locate the case when the watch is always 
 triggered on reconnect. This is consistently happening if I connect to a 
 follower in the ensemble and I don't do any operation which goes through the 
 leader.
 Looking at the code I see that the client stores the lastzxid and sends that 
 with its request. This is 0 on startup and will be updated everytime from the 
 server replies. This lastzxid is also sent to the server after reconnect 
 together with watches. The server decides which watch to trigger based on 
 this lastzxid probably because that should mean the last known state of the 
 client. If this lastzxid is 0 than all the watches are triggered.
 I've checked why is this lastzxid 0. I thought it shouldn't be since there 
 was already a request to the server to set the watch and in the reply the 
 server could have sent back the zxid but it turns out that it sends just 0. 
 Looking at the server code I see that for requests which doesn't go through 
 the leader the follower server just sends back the same zxid that the client 
 sent.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (ZOOKEEPER-1344) ZooKeeper client multi-update command is not considering the Chroot request

2012-03-13 Thread Hadoop QA (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13228485#comment-13228485
 ] 

Hadoop QA commented on ZOOKEEPER-1344:
--

+1 overall.  Here are the results of testing the latest attachment 
  
http://issues.apache.org/jira/secure/attachment/12518195/ZOOKEEPER-1344_trunk.patch
  against trunk revision 1297740.

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

+1 findbugs.  The patch does not introduce any new Findbugs (version 1.3.9) 
warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed core unit tests.

+1 contrib tests.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/992//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/992//artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
Console output: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/992//console

This message is automatically generated.

 ZooKeeper client multi-update command is not considering the Chroot request
 ---

 Key: ZOOKEEPER-1344
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1344
 Project: ZooKeeper
  Issue Type: Bug
  Components: java client
Affects Versions: 3.4.0
Reporter: Rakesh R
Assignee: Rakesh R
Priority: Critical
 Fix For: 3.5.0

 Attachments: ZOOKEEPER-1344-onlytestcase.patch, 
 ZOOKEEPER-1344.1.patch, ZOOKEEPER-1344.patch, 
 ZOOKEEPER-1344_branch_3.4.patch, ZOOKEEPER-1344_trunk.patch


 For example: 
 I have created a ZooKeeper client with subtree as 10.18.52.144:2179/apps/X. 
 Now just generated OP command for the creation of zNode /myId. When the 
 client creates the path /myid, the ZooKeeper server is actually be creating 
 the path as /myid instead of creating as /apps/X/myid
 Expected output: zNode has to be created as /apps/X/myid

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




Success: ZOOKEEPER-1344 PreCommit Build #992

2012-03-13 Thread Apache Jenkins Server
Jira: https://issues.apache.org/jira/browse/ZOOKEEPER-1344
Build: https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/992/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 168250 lines...]
 [exec] BUILD SUCCESSFUL
 [exec] Total time: 0 seconds
 [exec] 
 [exec] 
 [exec] 
 [exec] 
 [exec] +1 overall.  Here are the results of testing the latest attachment 
 [exec]   
http://issues.apache.org/jira/secure/attachment/12518195/ZOOKEEPER-1344_trunk.patch
 [exec]   against trunk revision 1297740.
 [exec] 
 [exec] +1 @author.  The patch does not contain any @author tags.
 [exec] 
 [exec] +1 tests included.  The patch appears to include 3 new or 
modified tests.
 [exec] 
 [exec] +1 javadoc.  The javadoc tool did not generate any warning 
messages.
 [exec] 
 [exec] +1 javac.  The applied patch does not increase the total number 
of javac compiler warnings.
 [exec] 
 [exec] +1 findbugs.  The patch does not introduce any new Findbugs 
(version 1.3.9) warnings.
 [exec] 
 [exec] +1 release audit.  The applied patch does not increase the 
total number of release audit warnings.
 [exec] 
 [exec] +1 core tests.  The patch passed core unit tests.
 [exec] 
 [exec] +1 contrib tests.  The patch passed contrib unit tests.
 [exec] 
 [exec] Test results: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/992//testReport/
 [exec] Findbugs warnings: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/992//artifact/trunk/build/test/findbugs/newPatchFindbugsWarnings.html
 [exec] Console output: 
https://builds.apache.org/job/PreCommit-ZOOKEEPER-Build/992//console
 [exec] 
 [exec] This message is automatically generated.
 [exec] 
 [exec] 
 [exec] 
==
 [exec] 
==
 [exec] Adding comment to Jira.
 [exec] 
==
 [exec] 
==
 [exec] 
 [exec] 
 [exec] Comment added.
 [exec] 8D4ghhhRDE logged out
 [exec] 
 [exec] 
 [exec] 
==
 [exec] 
==
 [exec] Finished build.
 [exec] 
==
 [exec] 
==
 [exec] 
 [exec] 

BUILD SUCCESSFUL
Total time: 24 minutes 43 seconds
Archiving artifacts
Recording test results
Description set: ZOOKEEPER-1344
Email was triggered for: Success
Sending email for trigger: Success



###
## FAILED TESTS (if any) 
##
All tests passed

[jira] [Created] (ZOOKEEPER-1413) Use on-disk transaction log for learner sync up

2012-03-13 Thread Thawan Kooburat (Created) (JIRA)
Use on-disk transaction log for learner sync up
---

 Key: ZOOKEEPER-1413
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1413
 Project: ZooKeeper
  Issue Type: Improvement
  Components: server
Affects Versions: 3.4.3
Reporter: Thawan Kooburat
Priority: Minor


Motivation:
The learner syncs up with leader by retrieving committed log from the leader. 
Currently, the leader only keeps 500 entries of recently committed log in 
memory. If the learner falls behind more than 500 updates, the leader will send 
the entire snapshot to the learner. 

With the size of the snapshot for some of our Zookeeper deployments (~10G), it 
is prohibitively expensive to send the entire snapshot over network. 
Additionally, our Zookeeper may serve more than 4K updates per seconds. As a 
result, a network hiccups for less than a second will cause the learner to use 
snapshot transfer.

Design:
Instead of looking only at committed log in memory, the leader will also look 
at transaction log on disk. The amount of transaction log kept on disk is 
configurable and the current default is 100k. This will allow Zookeeper to 
tolerate longer temporal network failure before initiating the snapshot 
transfer.  

Implementation:
We plan to add interface to the persistence layer will can be use to retrieve 
proposals from on-disk transaction log. These proposals can then be used to 
send to the learner using existing protocol. 


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (ZOOKEEPER-1355) Add zk.updateServerList(newServerList)

2012-03-13 Thread Marshall McMullen (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-1355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marshall McMullen updated ZOOKEEPER-1355:
-

Attachment: ZOOKEEPER-1355-ver10-4.patch

Uploading a new version that fixes the C code to properly force a disconnect if 
the list of servers is modified and the host we were connected to is not in the 
list anymore.

 Add zk.updateServerList(newServerList) 
 ---

 Key: ZOOKEEPER-1355
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-1355
 Project: ZooKeeper
  Issue Type: New Feature
  Components: java client
Reporter: Alexander Shraer
Assignee: Alexander Shraer
 Fix For: 3.5.0

 Attachments: ZOOKEEPER-1355-ver10-1.patch, 
 ZOOKEEPER-1355-ver10-2.patch, ZOOKEEPER-1355-ver10-3.patch, 
 ZOOKEEPER-1355-ver10-4.patch, ZOOKEEPER-1355-ver10-4.patch, 
 ZOOKEEPER-1355-ver10.patch, ZOOKEEPER-1355-ver2.patch, 
 ZOOKEEPER-1355-ver4.patch, ZOOKEEPER-1355-ver5.patch, 
 ZOOKEEPER-1355-ver6.patch, ZOOKEEPER-1355-ver7.patch, 
 ZOOKEEPER-1355-ver8.patch, ZOOKEEPER-1355-ver9-1.patch, 
 ZOOKEEPER-1355-ver9.patch, ZOOKEEPER=1355-ver3.patch, 
 ZOOOKEEPER-1355-test.patch, ZOOOKEEPER-1355-ver1.patch, 
 ZOOOKEEPER-1355.patch, loadbalancing-more-details.pdf, loadbalancing.pdf


 When the set of servers changes, we would like to update the server list 
 stored by clients without restarting the clients.
 Moreover, assuming that the number of clients per server is the same (in 
 expectation) in the old configuration (as guaranteed by the current list 
 shuffling for example), we would like to re-balance client connections across 
 the new set of servers in a way that a) the number of clients per server is 
 the same for all servers (in expectation) and b) there is no 
 excessive/unnecessary client migration.
 It is simple to achieve (a) without (b) - just re-shuffle the new list of 
 servers at every client. But this would create unnecessary migration, which 
 we'd like to avoid.
 We propose a simple probabilistic migration scheme that achieves (a) and (b) 
 - each client locally decides whether and where to migrate when the list of 
 servers changes. The attached document describes the scheme and shows an 
 evaluation of it in Zookeeper. We also implemented re-balancing through a 
 consistent-hashing scheme and show a comparison. We derived the probabilistic 
 migration rules from a simple formula that we can also provide, if someone's 
 interested in the proof.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (BOOKKEEPER-163) Prevent incorrect NoSuchLedgerException for readLastConfirmed.

2012-03-13 Thread Sijie Guo (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/BOOKKEEPER-163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13228247#comment-13228247
 ] 

Sijie Guo commented on BOOKKEEPER-163:
--

committed as r1299984. Great work, Thanks Ivan.

 Prevent incorrect NoSuchLedgerException for readLastConfirmed.
 --

 Key: BOOKKEEPER-163
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-163
 Project: Bookkeeper
  Issue Type: Bug
  Components: bookkeeper-client, bookkeeper-server
Affects Versions: 4.0.0
Reporter: Sijie Guo
Assignee: Ivan Kelly
 Fix For: 4.1.0

 Attachments: BOOKKEEPER-163.diff, BOOKKEEPER-163.diff, 
 BOOKKEEPER-163.diff, BOOKKEEPER-163.diff


 bookkeeper client treats NoSuchLedgerException as valid response when reading 
 last confirmed. If NoSuchLedgerException is caused due to an empty directory 
 in following cases, it is an incorrect response. 
 1) A disk is replaced or ledger index is removed by a sloppy admin.
 2) A disk is not mounted when a bookie machine is restarted.
 We need a mechanism to prevent such incorrect responses.
 Ivan suggested to generate a instance key for each bookie and write it into 
 the ledger directories. If a directory doesn't have the key, and other 
 directories do, then it shouldn't start. This would also resolve the issue 
 that someone starting a new bookie with the same IP as a bookie which has 
 previously died.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Created] (BOOKKEEPER-184) CompactionTest failing on Jenkins

2012-03-13 Thread Sijie Guo (Created) (JIRA)
CompactionTest failing on Jenkins
-

 Key: BOOKKEEPER-184
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-184
 Project: Bookkeeper
  Issue Type: Bug
  Components: bookkeeper-server
Reporter: Sijie Guo
Assignee: Sijie Guo
 Fix For: 4.1.0


the compaction test hanging on Jenkins. 

Ivan did some investigation on this issue, and found that it wasn't hanging, it 
was just taking a really long time. He suggested that it is because it does a 
lot of I/O, theres a couple of ways we could reduce this.
1. Only create 1 bookie
2. Not inherit from BaseTestCase
3. Reduce the number of entries

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (BOOKKEEPER-25) Implementation of the JMS specification and API

2012-03-13 Thread Flavio Junqueira (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/BOOKKEEPER-25?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Flavio Junqueira updated BOOKKEEPER-25:
---

Fix Version/s: (was: 4.1.0)
   4.2.0

 Implementation of the JMS specification and API
 ---

 Key: BOOKKEEPER-25
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-25
 Project: Bookkeeper
  Issue Type: Task
Reporter: Matthieu Morel
 Fix For: 4.2.0

 Attachments: JMS implementation.pdf, JMS implementation.pdf, 
 hedwigjms-milestone1preview.pdf


 Here is a proposal for implementing JMS on top of Hedwig.
 Overview:
 -
 After comparing the JMS spec and Hedwig's current implementation, it appears 
 that Hedwig provides most concepts of the JMS spec. The work will therefore 
 mostly be implementing wrappers around Hedwig's primitives.
 There is no 1 to 1 mapping most of the time though, and there are some 
 subtleties in the JMS spec that we must be cautious about. There are also 
 things that don't exist in Hedwig and that we'll have to add. 
 For instance, administered objects, the different kinds of standard 
 messages, or a parser for the SQL-like selector queries. These are marked 
 with a triangular flag on the attached mindmap.
 The JMS spec also specifies support for distributed transactions (XA) but 
 this part is optional. I would suggest to leave that for later, if there is 
 any request for it.
 Design:
 ---
 I suggest to separate all JMS-related code in a separate project (as 
 hedwig-client and hedwig-protocol). 
 We'll therefore make sure that Hedwig-server is JMS agnostic, and it will 
 later be possible to use the same approach for implementing AMQP or JMS2, 
 without mixing all concepts.
 Development:
 ---
 An iterative approach is well suited. We could start with fundamentals, and 
 progressively add some meat: 
 1. messages, connections and sessions
 2. basic messaging with selectors
 3. all cases of messaging models
 4. validation tests
 5. performance tests (I checked and unfortunately, there does not seem to be 
 any free standard for benchmarking jms implementation...)
 Each iteration would involve adding regression tests and doing a review.
 About the time this would take:
 My rough estimation (not being very familiar with Hedwig's codebase): in the 
 order 1 or 2 weeks for each of these iterations.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (BOOKKEEPER-56) Race condition of message handler in connection recovery in Hedwig client

2012-03-13 Thread Flavio Junqueira (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/BOOKKEEPER-56?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13228308#comment-13228308
 ] 

Flavio Junqueira commented on BOOKKEEPER-56:


Should we move this one to 4.2.0?

 Race condition of message handler in connection recovery in Hedwig client
 -

 Key: BOOKKEEPER-56
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-56
 Project: Bookkeeper
  Issue Type: Bug
  Components: hedwig-client
Affects Versions: 4.0.0
Reporter: Gavin Li
Assignee: Gavin Li
 Fix For: 4.1.0

 Attachments: patch_56


 There's a race condition in the connection recovery logic in Hedwig client. 
 The message handler user set might be overwritten incorrectly. 
 When handling channelDisconnected event, we try to reconnect to Hedwig 
 server. After the connection is created and subscribed, we'll call 
 StartDelivery() to recover the message handler to the original one of the 
 disconnected connection. But if during this process, user calls 
 StartDelivery() to set a new message handler, it will get overwritten to the 
 original one.
 The process can be demonstrated as below:
 || main thread || netty worker thread ||
 | StartDelivery(messageHandlerA) | |
 | (connection Broken here, and recovered later...) |
 | | 
 ResponseHandler::channelDisconnected()   (connection disconnected event 
 received) |
 | | new 
 SubscribeReconnectCallback(subHandler.getMessageHandler()) (store 
 messageHandlerA in SubscribeReconnectCallback to recover later) |
 | | client.doConnect() (try 
 reconnect)  |
 | | doSubUnsub() (resubscribe) |
 | | 
 SubscriberResponseHandler::handleSubscribeResponse()  (subscription succeeds) 
 |
 | StartDelivery(messageHandlderB) |   
 |
 | | 
 SubscribeReconnectCallback::operationFinished()   
 |
 | | StartDelvery(messageHandlerA) 
   (messageHandler get overwritten)  |   
 I can stably reproduce this by simulating this race condition by put some 
 sleep in ResponseHandler.
 I think essentially speaking we should not store messageHandler in 
 ResponseHandler, since the message handler is supposed to be bound to 
 connection. Instead, no matter which connection is in use, we should use the 
 same messageHandler, the one user set last time. So I think we should change 
 to store messageHandler in the HedwigSubscriber, in this way we don't need to 
 recover the handler in connection recovery and thus won't face this race 
 condition.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (BOOKKEEPER-168) Message bounding on subscriptions

2012-03-13 Thread Ivan Kelly (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/BOOKKEEPER-168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ivan Kelly updated BOOKKEEPER-168:
--

Attachment: BOOKKEEPER-168.diff

Added new subscribe method, which takes SubscriptionOptions. This can be 
extended in the future to take more options if this is ever needed.

 Message bounding on subscriptions
 -

 Key: BOOKKEEPER-168
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-168
 Project: Bookkeeper
  Issue Type: New Feature
Reporter: Ivan Kelly
Assignee: Ivan Kelly
 Fix For: 4.1.0

 Attachments: BOOKKEEPER-168.diff, BOOKKEEPER-168.diff, 
 BOOKKEEPER-168.diff


 In hedwig, messages for a subscription will queue up forever if the 
 subscriber is offline. In some usecases, this is undesirable, as it will 
 eventually mean resource exhaustion. In this JIRA we propose an optional 
 change to the subscription contract, which allows the user to set a bound on 
 the number of messages which will be queued for its subscription while it is 
 offline.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (BOOKKEEPER-168) Message bounding on subscriptions

2012-03-13 Thread jirapos...@reviews.apache.org (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/BOOKKEEPER-168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13228320#comment-13228320
 ] 

jirapos...@reviews.apache.org commented on BOOKKEEPER-168:
--


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/3824/
---

(Updated 2012-03-13 11:16:09.577984)


Review request for bookkeeper.


Summary
---

In hedwig, messages for a subscription will queue up forever if the subscriber 
is offline. In some usecases, this is undesirable, as it will eventually mean 
resource exhaustion. In this JIRA we propose an optional change to the 
subscription contract, which allows the user to set a bound on the number of 
messages which will be queued for its subscription while it is offline.


This addresses bug BOOKKEEPER-168.
https://issues.apache.org/jira/browse/BOOKKEEPER-168


Diffs (updated)
-

  hedwig-client/src/main/cpp/inc/hedwig/client.h f37ef98 
  hedwig-client/src/main/cpp/inc/hedwig/subscribe.h 775a32c 
  hedwig-client/src/main/cpp/lib/client.cpp 6d70ad9 
  hedwig-client/src/main/cpp/lib/data.h b4e2c15 
  hedwig-client/src/main/cpp/lib/data.cpp a223120 
  hedwig-client/src/main/cpp/lib/subscriberimpl.h 1cad984 
  hedwig-client/src/main/cpp/lib/subscriberimpl.cpp 32075c4 
  hedwig-client/src/main/cpp/log4cpp.conf 5651e25 
  hedwig-client/src/main/cpp/scripts/log4cxx.conf fc1084c 
  hedwig-client/src/main/cpp/scripts/tester.sh 5df613a 
  hedwig-client/src/main/cpp/test/Makefile.am 62902e6 
  hedwig-client/src/main/cpp/test/main.cpp 3290af3 
  hedwig-client/src/main/cpp/test/messageboundtest.cpp PRE-CREATION 
  hedwig-client/src/main/java/org/apache/hedwig/client/api/Subscriber.java 
6910e7d 
  
hedwig-client/src/main/java/org/apache/hedwig/client/conf/ClientConfiguration.java
 66e049f 
  hedwig-client/src/main/java/org/apache/hedwig/client/data/PubSubData.java 
1b33c2f 
  
hedwig-client/src/main/java/org/apache/hedwig/client/netty/HedwigSubscriber.java
 5fb1d88 
  hedwig-protocol/src/main/java/org/apache/hedwig/protocol/PubSubProtocol.java 
f46868b 
  hedwig-protocol/src/main/protobuf/PubSubProtocol.proto e44d981 
  
hedwig-server/src/main/java/org/apache/hedwig/server/delivery/FIFODeliveryManager.java
 2c9af4d 
  
hedwig-server/src/main/java/org/apache/hedwig/server/persistence/BookkeeperPersistenceManager.java
 f1fd965 
  
hedwig-server/src/main/java/org/apache/hedwig/server/persistence/LocalDBPersistenceManager.java
 02ec607 
  
hedwig-server/src/main/java/org/apache/hedwig/server/persistence/PersistenceManager.java
 5c38ad9 
  
hedwig-server/src/main/java/org/apache/hedwig/server/persistence/ReadAheadCache.java
 0261521 
  
hedwig-server/src/main/java/org/apache/hedwig/server/regions/HedwigHubSubscriber.java
 40aa338 
  
hedwig-server/src/main/java/org/apache/hedwig/server/subscriptions/AbstractSubscriptionManager.java
 714a631 
  
hedwig-server/src/test/java/org/apache/hedwig/server/persistence/MessageBoundedPersistenceTest.java
 PRE-CREATION 
  
hedwig-server/src/test/java/org/apache/hedwig/server/persistence/StubPersistenceManager.java
 84b866d 

Diff: https://reviews.apache.org/r/3824/diff


Testing
---


Thanks,

Ivan



 Message bounding on subscriptions
 -

 Key: BOOKKEEPER-168
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-168
 Project: Bookkeeper
  Issue Type: New Feature
Reporter: Ivan Kelly
Assignee: Ivan Kelly
 Fix For: 4.1.0

 Attachments: BOOKKEEPER-168.diff, BOOKKEEPER-168.diff, 
 BOOKKEEPER-168.diff


 In hedwig, messages for a subscription will queue up forever if the 
 subscriber is offline. In some usecases, this is undesirable, as it will 
 eventually mean resource exhaustion. In this JIRA we propose an optional 
 change to the subscription contract, which allows the user to set a bound on 
 the number of messages which will be queued for its subscription while it is 
 offline.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (BOOKKEEPER-143) Add SSL support for hedwig cpp client

2012-03-13 Thread Flavio Junqueira (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/BOOKKEEPER-143?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Flavio Junqueira updated BOOKKEEPER-143:


Fix Version/s: (was: 4.1.0)
   4.2.0

Moving it to 4.2.0.

 Add SSL support for hedwig cpp client
 -

 Key: BOOKKEEPER-143
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-143
 Project: Bookkeeper
  Issue Type: Improvement
  Components: bookkeeper-client
Affects Versions: 4.0.0
Reporter: Sijie Guo
 Fix For: 4.2.0


 Currently Hedwig CPP client doesn't support SSL. 
 Example for boost::asio using SSL:
 http://www.boost.org/doc/libs/1_46_1/doc/html/boost_asio/example/ssl/client.cpp

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (BOOKKEEPER-184) CompactionTest failing on Jenkins

2012-03-13 Thread Ivan Kelly (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/BOOKKEEPER-184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13228324#comment-13228324
 ] 

Ivan Kelly commented on BOOKKEEPER-184:
---

Did you figure out why the test cases were not failing? Before this change, the 
disabling of compaction in #testDisableCompaction had no affect. However, the 
test still passed happily, which indicates that the test wasn't correct.

 CompactionTest failing on Jenkins
 -

 Key: BOOKKEEPER-184
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-184
 Project: Bookkeeper
  Issue Type: Bug
  Components: bookkeeper-server
Reporter: Sijie Guo
Assignee: Sijie Guo
 Fix For: 4.1.0

 Attachments: BK-184.diff


 the compaction test hanging on Jenkins. 
 Ivan did some investigation on this issue, and found that it wasn't hanging, 
 it was just taking a really long time. He suggested that it is because it 
 does a lot of I/O, theres a couple of ways we could reduce this.
 1. Only create 1 bookie
 2. Not inherit from BaseTestCase
 3. Reduce the number of entries

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




Review Request: BOOKKEEPER-96: extends zookeeper JMX to monitor and manage hedwig server

2012-03-13 Thread Sijie Guo

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/4307/
---

Review request for bookkeeper.


Summary
---

we can extends / reuses zookeeper JMX utils to monitor and manage hedwig server


This addresses bug BOOKKEEPER-96.
https://issues.apache.org/jira/browse/BOOKKEEPER-96


Diffs
-

  
hedwig-server/src/main/java/org/apache/hedwig/server/jmx/HedwigJMXService.java 
PRE-CREATION 
  hedwig-server/src/main/java/org/apache/hedwig/server/jmx/HedwigMBeanInfo.java 
PRE-CREATION 
  
hedwig-server/src/main/java/org/apache/hedwig/server/jmx/HedwigMBeanRegistry.java
 PRE-CREATION 

Diff: https://reviews.apache.org/r/4307/diff


Testing
---


Thanks,

Sijie



Review Request: BOOKKEEPER-97: collect pub/sub/consume statistics on hub server

2012-03-13 Thread Sijie Guo

---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/4308/
---

Review request for bookkeeper.


Summary
---

collecting pub/sub/consume statistics on hub server and expose them thru JMX.

the patch is based on BOOKKEEPER-96, includes BOOKKEEPER-99, exposing 
statistics thru JMX. 


This addresses bug BOOKKEEPER-97.
https://issues.apache.org/jira/browse/BOOKKEEPER-97


Diffs
-

  
hedwig-server/src/main/java/org/apache/hedwig/server/delivery/FIFODeliveryManager.java
 2c9af4d 
  
hedwig-server/src/main/java/org/apache/hedwig/server/handlers/BaseHandler.java 
2573df0 
  
hedwig-server/src/main/java/org/apache/hedwig/server/handlers/ConsumeHandler.java
 1712027 
  
hedwig-server/src/main/java/org/apache/hedwig/server/handlers/NettyHandlerBean.java
 PRE-CREATION 
  
hedwig-server/src/main/java/org/apache/hedwig/server/handlers/NettyHandlerMXBean.java
 PRE-CREATION 
  
hedwig-server/src/main/java/org/apache/hedwig/server/handlers/PublishHandler.java
 7a785c4 
  
hedwig-server/src/main/java/org/apache/hedwig/server/handlers/SubscribeHandler.java
 f3b108a 
  
hedwig-server/src/main/java/org/apache/hedwig/server/handlers/UnsubscribeHandler.java
 09e8b92 
  hedwig-server/src/main/java/org/apache/hedwig/server/netty/PubSubServer.java 
27e3104 
  
hedwig-server/src/main/java/org/apache/hedwig/server/netty/PubSubServerBean.java
 PRE-CREATION 
  
hedwig-server/src/main/java/org/apache/hedwig/server/netty/PubSubServerMXBean.java
 PRE-CREATION 
  hedwig-server/src/main/java/org/apache/hedwig/server/netty/ServerStats.java 
PRE-CREATION 
  
hedwig-server/src/main/java/org/apache/hedwig/server/netty/UmbrellaHandler.java 
2f896d2 
  
hedwig-server/src/main/java/org/apache/hedwig/server/persistence/ReadAheadCache.java
 0261521 
  
hedwig-server/src/main/java/org/apache/hedwig/server/persistence/ReadAheadCacheBean.java
 PRE-CREATION 
  
hedwig-server/src/main/java/org/apache/hedwig/server/persistence/ReadAheadCacheMXBean.java
 PRE-CREATION 

Diff: https://reviews.apache.org/r/4308/diff


Testing
---


Thanks,

Sijie



[jira] [Updated] (BOOKKEEPER-55) SubscribeReconnectRetryTask might retry subscription endlessly when another subscription is already successfully created previously

2012-03-13 Thread Flavio Junqueira (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/BOOKKEEPER-55?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Flavio Junqueira updated BOOKKEEPER-55:
---

Fix Version/s: (was: 4.1.0)
   4.2.0

Moving it to 4.2.0.

 SubscribeReconnectRetryTask might retry subscription endlessly when another 
 subscription is already successfully created previously
 ---

 Key: BOOKKEEPER-55
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-55
 Project: Bookkeeper
  Issue Type: Bug
  Components: bookkeeper-client
Affects Versions: 4.0.0
Reporter: Gavin Li
Assignee: Gavin Li
 Fix For: 4.2.0

 Attachments: patch


 For channelDisconnected envent, we try to automatically recover the 
 connection and subscription. But when users call HedwigSubscriber.subscribe() 
 at the same time, it might succeed before the auto recovery. Then the auto 
 recovery can never succeed as the server will report topic busy failure. Then 
 the SubscribeReconnectRetryTask will retry again and again endlessly. We 
 found this in our auto test.
 Fix is easy, we just need to firstly check if the channel for this topic and 
 subscribe id is null, if not it means some subscription is already created 
 before, we don't need to bother recover.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




Question about jira

2012-03-13 Thread Flavio Junqueira
I have created a new release version (4.2.0) on jira and marked a bunch of 
issues for this version. When I go to issues, 4.2.0 does not show up and if I 
write a filter for it, it says that the there is no issue with fix version 
4.2.0. 

Does anyone know if this is a bug or if I need to change something else?

-Flavio

[jira] [Created] (BOOKKEEPER-185) Remove bookkeeper-server dependency on hadoop-common

2012-03-13 Thread Ivan Kelly (Created) (JIRA)
Remove bookkeeper-server dependency on hadoop-common


 Key: BOOKKEEPER-185
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-185
 Project: Bookkeeper
  Issue Type: Improvement
Reporter: Ivan Kelly
Assignee: Ivan Kelly
 Fix For: 4.1.0


bookkeeper-server depends on hadoop-common simply for the use of 1 class, 
HardLink. This clutters the pom.xml as a lot of stuff needs to be excluded from 
the transactive dependencies. It would be better just to copy HardLink.java 
into bookkeeper-server.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Updated] (BOOKKEEPER-185) Remove bookkeeper-server dependency on hadoop-common

2012-03-13 Thread Ivan Kelly (Updated) (JIRA)

 [ 
https://issues.apache.org/jira/browse/BOOKKEEPER-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ivan Kelly updated BOOKKEEPER-185:
--

Attachment: BOOKKEEPER-185.diff

Patch copies the HardLink from hadoop-common 0.23.1. A small modification had 
to be made for makeShellPath.

 Remove bookkeeper-server dependency on hadoop-common
 

 Key: BOOKKEEPER-185
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-185
 Project: Bookkeeper
  Issue Type: Improvement
Reporter: Ivan Kelly
Assignee: Ivan Kelly
 Fix For: 4.1.0

 Attachments: BOOKKEEPER-185.diff


 bookkeeper-server depends on hadoop-common simply for the use of 1 class, 
 HardLink. This clutters the pom.xml as a lot of stuff needs to be excluded 
 from the transactive dependencies. It would be better just to copy 
 HardLink.java into bookkeeper-server.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (BOOKKEEPER-164) Add checksumming for ledger index files

2012-03-13 Thread Sijie Guo (Commented) (JIRA)

[ 
https://issues.apache.org/jira/browse/BOOKKEEPER-164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13228852#comment-13228852
 ] 

Sijie Guo commented on BOOKKEEPER-164:
--

sorry, I haven't finished this one. since it's an improvement to make system 
more robust, not a blocking issue. I think we can move it to 4.2.0.

 Add checksumming for ledger index files
 ---

 Key: BOOKKEEPER-164
 URL: https://issues.apache.org/jira/browse/BOOKKEEPER-164
 Project: Bookkeeper
  Issue Type: Improvement
  Components: bookkeeper-server
Affects Versions: 4.0.0
Reporter: Sijie Guo
 Fix For: 4.1.0


 now bookie ledger index files lacks checksumming to prevent 
 truncation/corruption. if a ledger index file is truncated, the ledger index 
 file still works but responds wrong response when reading last confirmed.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira