[Openais] recover from corosync daemon restart and cpg_finalize timing

2010-06-25 Thread dan clark
Dear Gentle Reader Attached is a small test program to stress initializing and finalizing communication between a corosync cpg client and the corosync daemon. The test was run under version 1.2.4. Initial testing was with a single node, subsequent testing occurred on a system consisting of 3

Re: [Openais] recover from corosync daemon restart and cpg_finalize timing

2010-06-24 Thread Andrew Beekhof
On Thu, Jun 24, 2010 at 1:50 AM, dan clark 2cla...@gmail.com wrote: Dear Gentle Reader Attached is a small test program to stress initializing and finalizing communication between a corosync cpg client and the corosync daemon. The test was run under version 1.2.4.  Initial testing was

Re: [Openais] recover from corosync daemon restart and cpg_finalize timing

2010-06-24 Thread Steven Dake
On 06/23/2010 11:35 PM, Andrew Beekhof wrote: On Thu, Jun 24, 2010 at 1:50 AM, dan clark2cla...@gmail.com wrote: Dear Gentle Reader Attached is a small test program to stress initializing and finalizing communication between a corosync cpg client and the corosync daemon. The test was

Re: [Openais] recover from corosync daemon restart and cpg_finalize timing

2010-06-24 Thread Andrew Beekhof
On Thu, Jun 24, 2010 at 9:16 AM, Steven Dake sd...@redhat.com wrote: On 06/23/2010 11:35 PM, Andrew Beekhof wrote: On Thu, Jun 24, 2010 at 1:50 AM, dan clark2cla...@gmail.com  wrote: Dear Gentle Reader Attached is a small test program to stress initializing and finalizing communication

Re: [Openais] recover from corosync daemon restart and cpg_finalize timing

2010-06-24 Thread Steven Dake
Dan, Thanks for the test case responses inline On 06/23/2010 04:50 PM, dan clark wrote: Dear Gentle Reader Attached is a small test program to stress initializing and finalizing communication between a corosync cpg client and the corosync daemon. The test was run under version 1.2.4.

Re: [Openais] recover from corosync daemon restart and cpg_finalize timing

2010-06-24 Thread dan clark
Thank you for trying out this test. I have upgraded to release 1.2.5 and applied the fix posted for the leak to /dev/shm. Unfortunately when I run the test application (slightly modified to fix a couple of bugs I found) I still find /dev/shm filling up with large files control_buffer-xxx,

Re: [Openais] recover from corosync daemon restart and cpg_finalize timing

2010-06-24 Thread dan clark
Hi Steven! I really appreciate the consideration you have given to this scenario and I am thankful that the test case has warranted a bug submission and your work to move through the hoops required to do the e-work. Please note, the stress test only reflects the nature of the problem. A primary