Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-11 Thread Vadym Chepkov
The is nothing to kill. crmd has finished (I can see it in the log) and it's a ghost in defunct state at this point. On Tue, May 11, 2010 at 8:42 AM, Dejan Muhamedagic wrote: > Hi, > > On Tue, May 11, 2010 at 07:40:39AM -0400, Vadym Chepkov wrote: > > By the way, reboot is too drastic, I do kil

Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-11 Thread Dejan Muhamedagic
Hi, On Tue, May 11, 2010 at 07:40:39AM -0400, Vadym Chepkov wrote: > By the way, reboot is too drastic, I do kill -9 of the corosync I guess that corosync is waiting for crmd to stop. Did you try to kill crmd? Thanks, Dejan > On May 11, 2010, at 7:37 AM, Alain.Moulle wrote: > > > Hi Steven ,

Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-11 Thread Vadym Chepkov
By the way, reboot is too drastic, I do kill -9 of the corosync On May 11, 2010, at 7:37 AM, Alain.Moulle wrote: > Hi Steven , > Vadym, just to know: did you execute crm_mon on another window when the > corosync > shutdown was stalled , just to see if there was some "failed" items ? > On my side

Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-11 Thread Vadym Chepkov
To not screw my resources I stop all of them to make sure nothing is running. crm configure property stop-all-resources=true then I issue service corosync stop and it never ends. If I run ps, I can see all corosync's children has stopped and are in state. Vadym On May 11, 2010, at 7:37 AM, A

Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-11 Thread Alain.Moulle
Hi Steven , Vadym, just to know: did you execute crm_mon on another window when the corosync shutdown was stalled , just to see if there was some "failed" items ? On my side : I've set debug off and the news (bad or good) is that it did not occur again, but it was also the case since yesterday wi

Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-11 Thread Vadym Chepkov
The bad news - it didn't help, still observing the same issue. The good news - it's 100% reproducible. Vadym On May 10, 2010, at 7:19 PM, Steven Dake wrote: > On Mon, 2010-05-10 at 19:02 -0400, Vadym Chepkov wrote: >> Yes, I am >> > try without > >> >> On May 10, 2010, at 6:59 PM, Steven Dake

Re: [Openais] plan for resolving corosync services unloading, problem blocking shutdown on opensuse

2010-05-10 Thread Andrew Beekhof
On Tue, May 11, 2010 at 7:52 AM, Steven Dake wrote: > On Tue, 2010-05-11 at 07:48 +0200, Alain.Moulle wrote: >> Hi, >> FYI : me too, I have debug : on and I faced the problem on RHEL5 as well >> as on fc12. >> Alain > > I have found the root cause I believe is related to your issues. > Basically w

Re: [Openais] plan for resolving corosync services unloading, problem blocking shutdown on opensuse

2010-05-10 Thread Steven Dake
On Tue, 2010-05-11 at 07:48 +0200, Alain.Moulle wrote: > Hi, > FYI : me too, I have debug : on and I faced the problem on RHEL5 as well > as on fc12. > Alain I have found the root cause I believe is related to your issues. Basically with debug:on the internal buffers inside logsys are overflowed,

Re: [Openais] plan for resolving corosync services unloading, problem blocking shutdown on opensuse

2010-05-10 Thread Alain.Moulle
Hi, FYI : me too, I have debug : on and I faced the problem on RHEL5 as well as on fc12. Alain > Hi, > > I experienced the same issue on Redhat 5.5 PPC. > I compiled all packages myself, since there are no ppc packages available in > the clusterlabs repository. > If Andrew will post his SRPM some

Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-10 Thread Steven Dake
On Mon, 2010-05-10 at 23:58 +0200, Andreas Mock wrote: > -Ursprüngliche Nachricht- > Von: Steven Dake > Gesendet: 10.05.2010 23:38:01 > An: "Alain.Moulle" > Betreff: [Openais] plan for resolving corosync services unloading problem > blocking shutdown on

Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-10 Thread Steven Dake
On Mon, 2010-05-10 at 19:02 -0400, Vadym Chepkov wrote: > Yes, I am > try without > > On May 10, 2010, at 6:59 PM, Steven Dake wrote: > > > Do you have debug: on in your config file? > > > > Regards > > -steve > > > > On Mon, 2010-05-10 at 18:24 -0400, Vadym Chepkov wrote: > >> Hi, > >> > >>

Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-10 Thread Vadym Chepkov
Yes, I am On May 10, 2010, at 6:59 PM, Steven Dake wrote: > Do you have debug: on in your config file? > > Regards > -steve > > On Mon, 2010-05-10 at 18:24 -0400, Vadym Chepkov wrote: >> Hi, >> >> I experienced the same issue on Redhat 5.5 PPC. >> I compiled all packages myself, since there a

Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-10 Thread Steven Dake
Do you have debug: on in your config file? Regards -steve On Mon, 2010-05-10 at 18:24 -0400, Vadym Chepkov wrote: > Hi, > > I experienced the same issue on Redhat 5.5 PPC. > I compiled all packages myself, since there are no ppc packages available in > the clusterlabs repository. > If Andrew wi

Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-10 Thread Vadym Chepkov
Hi, I experienced the same issue on Redhat 5.5 PPC. I compiled all packages myself, since there are no ppc packages available in the clusterlabs repository. If Andrew will post his SRPM somewhere or maybe instructions how to compile it, I would be happy to contribute. Vadym On May 10, 2010, at

Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-10 Thread Steven Dake
Bug analysis that we are undertaking can be found here: https://bugzilla.redhat.com/show_bug.cgi?id=590898 Please feel free to add any extra data you may have beyond the backtrace. Thanks -steve On Mon, 2010-05-10 at 14:38 -0700, Steven Dake wrote: > It seems pretty clear from the mailing list

Re: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-10 Thread Andreas Mock
-Ursprüngliche Nachricht- Von: Steven Dake Gesendet: 10.05.2010 23:38:01 An: "Alain.Moulle" Betreff: [Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse >We will begin analysis of the instrumentation results once we have a >t

[Openais] plan for resolving corosync services unloading problem blocking shutdown on opensuse

2010-05-10 Thread Steven Dake
It seems pretty clear from the mailing list traffic recently there is a critical flaw with the shutdown related in some way to Pacemaker and Corosync that happens on a few people's opensuse systems. It seems to only reproduce on opensuse however we don't know if it is limited to this platform. Fi