[jira] Commented: (ZOOKEEPER-792) zkpython memory leak

2010-08-19 Thread Lei Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12900563#action_12900563
 ] 

Lei Zhang commented on ZOOKEEPER-792:
-

Thanks a lot.


> zkpython memory leak
> 
>
> Key: ZOOKEEPER-792
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-792
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.3.1
> Environment: vmware workstation - guest OS:Linux python:2.4.3
>Reporter: Lei Zhang
>Assignee: Lei Zhang
> Fix For: 3.3.2, 3.4.0
>
> Attachments: ZOOKEEPER-792.patch, ZOOKEEPER-792.patch, 
> ZOOKEEPER-792.patch
>
>
> We recently upgraded zookeeper from 3.2.1 to 3.3.1, now we are seeing less 
> client deadlock on session expiration, which is a definite plus!
> Unfortunately we are seeing memory leak that requires our zk clients to be 
> restarted every half-day. Valgrind result:
> ==8804== 25 (12 direct, 13 indirect) bytes in 1 blocks are definitely lost in 
> loss record 255 of 670
> ==8804==at 0x4021C42: calloc (vg_replace_malloc.c:418)
> ==8804==by 0x5047B42: parse_acls (zookeeper.c:369)
> ==8804==by 0x5047EF6: pyzoo_create (zookeeper.c:1009)
> ==8804==by 0x40786CC: PyCFunction_Call (in /usr/lib/libpython2.4.so.1.0)
> ==8804==by 0x40B31DC: PyEval_EvalFrame (in /usr/lib/libpython2.4.so.1.0)
> ==8804==by 0x40B4485: PyEval_EvalCodeEx (in /usr/lib/libpython2.4.so.1.0)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (ZOOKEEPER-732) Improper translation of error into Python exception

2010-08-13 Thread Lei Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12898156#action_12898156
 ] 

Lei Zhang commented on ZOOKEEPER-732:
-

Thanks Andrei. Patch looks good to me.


> Improper translation of error into Python exception
> ---
>
> Key: ZOOKEEPER-732
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-732
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.3.0
>Reporter: Gustavo Niemeyer
>Assignee: Lei Zhang
>Priority: Minor
> Fix For: 3.3.2
>
> Attachments: ZOOKEEPER-732.patch, ZOOKEEPER-732.patch
>
>
> Apparently errors returned by the C library are not being correctly converted 
> into a Python exception in some cases: 
> >>> zookeeper.get_children(0, "/", None)
> Traceback (most recent call last):
>   File "", line 1, in 
> SystemError: error return without exception set

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-732) Improper translation of error into Python exception

2010-08-13 Thread Lei Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Zhang updated ZOOKEEPER-732:


   Status: Resolved  (was: Patch Available)
 Hadoop Flags: [Reviewed]
Fix Version/s: 3.3.2
   Resolution: Fixed

> Improper translation of error into Python exception
> ---
>
> Key: ZOOKEEPER-732
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-732
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.3.0
>Reporter: Gustavo Niemeyer
>Assignee: Lei Zhang
>Priority: Minor
> Fix For: 3.3.2
>
> Attachments: ZOOKEEPER-732.patch, ZOOKEEPER-732.patch
>
>
> Apparently errors returned by the C library are not being correctly converted 
> into a Python exception in some cases: 
> >>> zookeeper.get_children(0, "/", None)
> Traceback (most recent call last):
>   File "", line 1, in 
> SystemError: error return without exception set

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-732) Improper translation of error into Python exception

2010-08-13 Thread Lei Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Zhang updated ZOOKEEPER-732:


   Status: Patch Available  (was: Open)
 Release Note: Client that uses python binding may receive SystemError 
on session expiration.
Affects Version/s: 3.3.0
   (was: 3.2.2)

> Improper translation of error into Python exception
> ---
>
> Key: ZOOKEEPER-732
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-732
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.3.0
>Reporter: Gustavo Niemeyer
>Assignee: Lei Zhang
>Priority: Minor
> Attachments: ZOOKEEPER-732.patch, ZOOKEEPER-732.patch
>
>
> Apparently errors returned by the C library are not being correctly converted 
> into a Python exception in some cases: 
> >>> zookeeper.get_children(0, "/", None)
> Traceback (most recent call last):
>   File "", line 1, in 
> SystemError: error return without exception set

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-732) Improper translation of error into Python exception

2010-08-12 Thread Lei Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Zhang updated ZOOKEEPER-732:


Attachment: (was: ZOOKEEPER-732.patch)

> Improper translation of error into Python exception
> ---
>
> Key: ZOOKEEPER-732
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-732
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.2.2
>Reporter: Gustavo Niemeyer
>Assignee: Lei Zhang
>Priority: Minor
> Attachments: ZOOKEEPER-732.patch
>
>
> Apparently errors returned by the C library are not being correctly converted 
> into a Python exception in some cases: 
> >>> zookeeper.get_children(0, "/", None)
> Traceback (most recent call last):
>   File "", line 1, in 
> SystemError: error return without exception set

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-732) Improper translation of error into Python exception

2010-08-12 Thread Lei Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Zhang updated ZOOKEEPER-732:


Attachment: ZOOKEEPER-732.patch

> Improper translation of error into Python exception
> ---
>
> Key: ZOOKEEPER-732
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-732
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.2.2
>Reporter: Gustavo Niemeyer
>Assignee: Lei Zhang
>Priority: Minor
> Attachments: ZOOKEEPER-732.patch
>
>
> Apparently errors returned by the C library are not being correctly converted 
> into a Python exception in some cases: 
> >>> zookeeper.get_children(0, "/", None)
> Traceback (most recent call last):
>   File "", line 1, in 
> SystemError: error return without exception set

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (ZOOKEEPER-792) zkpython memory leak

2010-08-12 Thread Lei Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12897986#action_12897986
 ] 

Lei Zhang commented on ZOOKEEPER-792:
-

We've been using this patch in production (16-node cluster) for over a month. 
I'd like to have it go into never release. Can somebody please code review?

> zkpython memory leak
> 
>
> Key: ZOOKEEPER-792
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-792
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.3.1
> Environment: vmware workstation - guest OS:Linux python:2.4.3
>Reporter: Lei Zhang
>Assignee: Lei Zhang
> Fix For: 3.3.2, 3.4.0
>
> Attachments: ZOOKEEPER-792.patch
>
>
> We recently upgraded zookeeper from 3.2.1 to 3.3.1, now we are seeing less 
> client deadlock on session expiration, which is a definite plus!
> Unfortunately we are seeing memory leak that requires our zk clients to be 
> restarted every half-day. Valgrind result:
> ==8804== 25 (12 direct, 13 indirect) bytes in 1 blocks are definitely lost in 
> loss record 255 of 670
> ==8804==at 0x4021C42: calloc (vg_replace_malloc.c:418)
> ==8804==by 0x5047B42: parse_acls (zookeeper.c:369)
> ==8804==by 0x5047EF6: pyzoo_create (zookeeper.c:1009)
> ==8804==by 0x40786CC: PyCFunction_Call (in /usr/lib/libpython2.4.so.1.0)
> ==8804==by 0x40B31DC: PyEval_EvalFrame (in /usr/lib/libpython2.4.so.1.0)
> ==8804==by 0x40B4485: PyEval_EvalCodeEx (in /usr/lib/libpython2.4.so.1.0)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-732) Improper translation of error into Python exception

2010-08-12 Thread Lei Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Zhang updated ZOOKEEPER-732:


Attachment: ZOOKEEPER-732.patch

Attached is a patch that fixes issue 732. Can somebody please review?

> Improper translation of error into Python exception
> ---
>
> Key: ZOOKEEPER-732
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-732
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.2.2
>Reporter: Gustavo Niemeyer
>Assignee: Lei Zhang
>Priority: Minor
> Attachments: ZOOKEEPER-732.patch
>
>
> Apparently errors returned by the C library are not being correctly converted 
> into a Python exception in some cases: 
> >>> zookeeper.get_children(0, "/", None)
> Traceback (most recent call last):
>   File "", line 1, in 
> SystemError: error return without exception set

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Assigned: (ZOOKEEPER-732) Improper translation of error into Python exception

2010-08-12 Thread Lei Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Zhang reassigned ZOOKEEPER-732:
---

Assignee: Lei Zhang

> Improper translation of error into Python exception
> ---
>
> Key: ZOOKEEPER-732
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-732
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.2.2
>Reporter: Gustavo Niemeyer
>Assignee: Lei Zhang
>Priority: Minor
>
> Apparently errors returned by the C library are not being correctly converted 
> into a Python exception in some cases: 
> >>> zookeeper.get_children(0, "/", None)
> Traceback (most recent call last):
>   File "", line 1, in 
> SystemError: error return without exception set

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Resolved: (ZOOKEEPER-603) zkpython should do a better job of freeing memory under error conditions

2010-08-12 Thread Lei Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Zhang resolved ZOOKEEPER-603.
-

Resolution: Duplicate

> zkpython should do a better job of freeing memory under error conditions
> 
>
> Key: ZOOKEEPER-603
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-603
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.2.1
>Reporter: Henry Robinson
>Assignee: Lei Zhang
> Fix For: 3.4.0
>
>
> The general pattern is that the construction of a collection might fail, but 
> the module is not freeing the memory that it has already allocated. 
> Exceptions that are raised during this process aren't always propagated back 
> to the Python side either. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Assigned: (ZOOKEEPER-603) zkpython should do a better job of freeing memory under error conditions

2010-08-12 Thread Lei Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Zhang reassigned ZOOKEEPER-603:
---

Assignee: Lei Zhang  (was: Henry Robinson)

> zkpython should do a better job of freeing memory under error conditions
> 
>
> Key: ZOOKEEPER-603
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-603
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.2.1
>Reporter: Henry Robinson
>Assignee: Lei Zhang
> Fix For: 3.4.0
>
>
> The general pattern is that the construction of a collection might fail, but 
> the module is not freeing the memory that it has already allocated. 
> Exceptions that are raised during this process aren't always propagated back 
> to the Python side either. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-559) valgrind warnings running zkpython bindings

2010-08-12 Thread Lei Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Zhang updated ZOOKEEPER-559:


Assignee: Lei Zhang  (was: Henry Robinson)

Can we make this a duplicate of 792?

> valgrind warnings running zkpython bindings
> ---
>
> Key: ZOOKEEPER-559
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-559
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.3.0
>Reporter: Patrick Hunt
>Assignee: Lei Zhang
> Fix For: 3.4.0
>
> Attachments: valgrind-zk.tar.gz
>
>
> I'm seeing some weird behavior running zk-latencies.py
> http://github.com/phunt/zk-smoketest
> don't know if it's related to zkbindings itself, but I ran valgrind to see if 
> it noticed any issues. see attached.
> afaict these issues are related to zkpython binding, however I'm not sure. I 
> did run valgrind against the
> zookeeper c library tests and these issues were not highlighted. So I'm 
> thinking this is zkpython errors, however
> I'm not 100% sure. 
> Henry can you take a look?
>  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-792) zkpython memory leak

2010-06-24 Thread Lei Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Zhang updated ZOOKEEPER-792:


Status: Patch Available  (was: Open)

> zkpython memory leak
> 
>
> Key: ZOOKEEPER-792
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-792
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.3.1
> Environment: vmware workstation - guest OS:Linux python:2.4.3
>Reporter: Lei Zhang
> Fix For: 3.3.2, 3.4.0
>
> Attachments: ZOOKEEPER-792.patch
>
>
> We recently upgraded zookeeper from 3.2.1 to 3.3.1, now we are seeing less 
> client deadlock on session expiration, which is a definite plus!
> Unfortunately we are seeing memory leak that requires our zk clients to be 
> restarted every half-day. Valgrind result:
> ==8804== 25 (12 direct, 13 indirect) bytes in 1 blocks are definitely lost in 
> loss record 255 of 670
> ==8804==at 0x4021C42: calloc (vg_replace_malloc.c:418)
> ==8804==by 0x5047B42: parse_acls (zookeeper.c:369)
> ==8804==by 0x5047EF6: pyzoo_create (zookeeper.c:1009)
> ==8804==by 0x40786CC: PyCFunction_Call (in /usr/lib/libpython2.4.so.1.0)
> ==8804==by 0x40B31DC: PyEval_EvalFrame (in /usr/lib/libpython2.4.so.1.0)
> ==8804==by 0x40B4485: PyEval_EvalCodeEx (in /usr/lib/libpython2.4.so.1.0)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-792) zkpython memory leak

2010-06-24 Thread Lei Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Zhang updated ZOOKEEPER-792:


Attachment: ZOOKEEPER-792.patch

Plug memory leak in pyzoo_get() and pyzoo_create().

> zkpython memory leak
> 
>
> Key: ZOOKEEPER-792
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-792
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.3.1
> Environment: vmware workstation - guest OS:Linux python:2.4.3
>Reporter: Lei Zhang
> Fix For: 3.3.2, 3.4.0
>
> Attachments: ZOOKEEPER-792.patch
>
>
> We recently upgraded zookeeper from 3.2.1 to 3.3.1, now we are seeing less 
> client deadlock on session expiration, which is a definite plus!
> Unfortunately we are seeing memory leak that requires our zk clients to be 
> restarted every half-day. Valgrind result:
> ==8804== 25 (12 direct, 13 indirect) bytes in 1 blocks are definitely lost in 
> loss record 255 of 670
> ==8804==at 0x4021C42: calloc (vg_replace_malloc.c:418)
> ==8804==by 0x5047B42: parse_acls (zookeeper.c:369)
> ==8804==by 0x5047EF6: pyzoo_create (zookeeper.c:1009)
> ==8804==by 0x40786CC: PyCFunction_Call (in /usr/lib/libpython2.4.so.1.0)
> ==8804==by 0x40B31DC: PyEval_EvalFrame (in /usr/lib/libpython2.4.so.1.0)
> ==8804==by 0x40B4485: PyEval_EvalCodeEx (in /usr/lib/libpython2.4.so.1.0)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (ZOOKEEPER-792) zkpython memory leak

2010-06-24 Thread Lei Zhang (JIRA)
zkpython memory leak


 Key: ZOOKEEPER-792
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-792
 Project: Zookeeper
  Issue Type: Bug
  Components: contrib-bindings
Affects Versions: 3.3.1
 Environment: vmware workstation - guest OS:Linux python:2.4.3
Reporter: Lei Zhang


We recently upgraded zookeeper from 3.2.1 to 3.3.1, now we are seeing less 
client deadlock on session expiration, which is a definite plus!

Unfortunately we are seeing memory leak that requires our zk clients to be 
restarted every half-day. Valgrind result:

==8804== 25 (12 direct, 13 indirect) bytes in 1 blocks are definitely lost in 
loss record 255 of 670
==8804==at 0x4021C42: calloc (vg_replace_malloc.c:418)
==8804==by 0x5047B42: parse_acls (zookeeper.c:369)
==8804==by 0x5047EF6: pyzoo_create (zookeeper.c:1009)
==8804==by 0x40786CC: PyCFunction_Call (in /usr/lib/libpython2.4.so.1.0)
==8804==by 0x40B31DC: PyEval_EvalFrame (in /usr/lib/libpython2.4.so.1.0)
==8804==by 0x40B4485: PyEval_EvalCodeEx (in /usr/lib/libpython2.4.so.1.0)


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (ZOOKEEPER-732) Improper translation of error into Python exception

2010-06-24 Thread Lei Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12882333#action_12882333
 ] 

Lei Zhang commented on ZOOKEEPER-732:
-

Seems fixed in 3.3.1:

>>> zookeeper.get_children(0, "/", None)
Traceback (most recent call last):
  File "", line 1, in ?
zookeeper.ZooKeeperException: zhandle out of range

Issue can be closed.

> Improper translation of error into Python exception
> ---
>
> Key: ZOOKEEPER-732
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-732
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.2.2
>Reporter: Gustavo Niemeyer
>Priority: Minor
>
> Apparently errors returned by the C library are not being correctly converted 
> into a Python exception in some cases: 
> >>> zookeeper.get_children(0, "/", None)
> Traceback (most recent call last):
>   File "", line 1, in 
> SystemError: error return without exception set

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (ZOOKEEPER-642) "exceeded deadline by N ms" floods logs

2010-02-23 Thread Lei Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837506#action_12837506
 ] 

Lei Zhang commented on ZOOKEEPER-642:
-



Maybe I had misinterpreted what you meant by "timeout" - I was referring to
http://mail-archives.apache.org/mod_mbox/hadoop-zookeeper-user/200908.mbox/%253c4a8d7b4b.5020...@apache.org%253e
.

"typically we suggest timeouts in the 20-30 second range"




Same. Still seeing same message every 10 seconds.


> "exceeded deadline by N ms" floods logs
> ---
>
> Key: ZOOKEEPER-642
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-642
> Project: Zookeeper
>  Issue Type: Bug
>  Components: c client
>Affects Versions: 3.2.1
> Environment: virtualized linux - ec2 - ubuntu
>Reporter: Dale Johnson
> Fix For: 3.4.0
>
>
> More important zookeeper warnings are drown out by the following several 
> times per minute:
> 2010-01-12 17:39:57,227:22317(0x4147eb90):zoo_w...@zookeeper_interest@1335: 
> Exceeded deadline by 13ms
> Perhaps this is an issue with the way virtualized systems manage gettimeofday 
> results?
> Maybe the current 10ms threshold could be pushed up a bit.  I notice that 95% 
> of the messages are below 50ms.
> Is there an obvious configuration change that I can make to fix this?
> config file below:
> # The number of milliseconds of each tick
> tickTime=2000
> # The number of ticks that the initial
> # synchronization phase can take
> initLimit=10
> # The number of ticks that can pass between
> # sending a request and getting an acknowledgement
> syncLimit=5
> # the directory where the snapshot is stored.
> dataDir=/mnt/zookeeper
> # the port at which the clients will connect
> clientPort=2181
> server.1=hbase.1:2888:3888
> server.2=hbase.2:2888:3888
> server.3=hbase.3:2888:3888
> server.4=hbase.4:2888:3888
> server.5=hbase.5:2888:3888

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (ZOOKEEPER-642) "exceeded deadline by N ms" floods logs

2010-02-23 Thread Lei Zhang (JIRA)

[ 
https://issues.apache.org/jira/browse/ZOOKEEPER-642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12837457#action_12837457
 ] 

Lei Zhang commented on ZOOKEEPER-642:
-

I have taken over Dale's responsibility of zookeeper. We have bumped up 
tickTime to 2 per Patrick's suggestion in another thread. Now we see these 
"Exceeded deadline by 769ms" message every 10 seconds - I'm testing using 
'cli_st localhost:', on a VMware Linux machine that is mostly idle . I 
echo Dale's comment:

The message as it is has a fairly low diagnostic value.

Since this message is at WARN level, I feel we need to do something. But what:
  o bump up priority of zookeeper daemon
  o check bug in client library
  o check bug in zookeeper server

Somehow this doesn't smell like a real "Exceeded timeline" issue to me.

> "exceeded deadline by N ms" floods logs
> ---
>
> Key: ZOOKEEPER-642
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-642
> Project: Zookeeper
>  Issue Type: Bug
>  Components: c client
>Affects Versions: 3.2.1
> Environment: virtualized linux - ec2 - ubuntu
>Reporter: Dale Johnson
> Fix For: 3.4.0
>
>
> More important zookeeper warnings are drown out by the following several 
> times per minute:
> 2010-01-12 17:39:57,227:22317(0x4147eb90):zoo_w...@zookeeper_interest@1335: 
> Exceeded deadline by 13ms
> Perhaps this is an issue with the way virtualized systems manage gettimeofday 
> results?
> Maybe the current 10ms threshold could be pushed up a bit.  I notice that 95% 
> of the messages are below 50ms.
> Is there an obvious configuration change that I can make to fix this?
> config file below:
> # The number of milliseconds of each tick
> tickTime=2000
> # The number of ticks that the initial
> # synchronization phase can take
> initLimit=10
> # The number of ticks that can pass between
> # sending a request and getting an acknowledgement
> syncLimit=5
> # the directory where the snapshot is stored.
> dataDir=/mnt/zookeeper
> # the port at which the clients will connect
> clientPort=2181
> server.1=hbase.1:2888:3888
> server.2=hbase.2:2888:3888
> server.3=hbase.3:2888:3888
> server.4=hbase.4:2888:3888
> server.5=hbase.5:2888:3888

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (ZOOKEEPER-670) zkpython leading to segfault on zookeeper server restart

2010-02-12 Thread Lei Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/ZOOKEEPER-670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lei Zhang updated ZOOKEEPER-670:


Attachment: voyager.patch
zk.py

> zkpython leading to segfault on zookeeper server restart
> 
>
> Key: ZOOKEEPER-670
> URL: https://issues.apache.org/jira/browse/ZOOKEEPER-670
> Project: Zookeeper
>  Issue Type: Bug
>  Components: contrib-bindings
>Affects Versions: 3.2.1, 3.2.2
> Environment: CentOS w/ Python 2.4
>Reporter: Lei Zhang
> Attachments: voyager.patch, zk.py
>
>
> Zookeeper client using zkpython segfaults on zookeeper server restart. It is 
> reliably reproducible using the attached script zk.py.
> I'm able to stop segfault using the attached patch voyager.patch, but 
> zkpython seems to have deeper issue on its use of watcher_dispatch - on 
> zookeeper server restart, I see up to 6 invocation of watcher_dispatch while 
> my script is simply sleeping in the main thread. This can't be right.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Created: (ZOOKEEPER-670) zkpython leading to segfault on zookeeper server restart

2010-02-12 Thread Lei Zhang (JIRA)
zkpython leading to segfault on zookeeper server restart


 Key: ZOOKEEPER-670
 URL: https://issues.apache.org/jira/browse/ZOOKEEPER-670
 Project: Zookeeper
  Issue Type: Bug
  Components: contrib-bindings
Affects Versions: 3.2.2, 3.2.1
 Environment: CentOS w/ Python 2.4
Reporter: Lei Zhang


Zookeeper client using zkpython segfaults on zookeeper server restart. It is 
reliably reproducible using the attached script zk.py.
I'm able to stop segfault using the attached patch voyager.patch, but zkpython 
seems to have deeper issue on its use of watcher_dispatch - on zookeeper server 
restart, I see up to 6 invocation of watcher_dispatch while my script is simply 
sleeping in the main thread. This can't be right.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.