On Thu, Nov 11, 2010 at 4:39 PM, Steven Dake <sd...@redhat.com> wrote: > On 11/11/2010 02:35 AM, Andrew Beekhof wrote: >> On Wed, Oct 27, 2010 at 5:15 PM, Steven Dake <sd...@redhat.com> wrote: >>> On 10/26/2010 11:17 PM, Andrew Beekhof wrote: >>>> >>>> On Wed, Oct 27, 2010 at 7:32 AM, nozawat<noza...@gmail.com> wrote: >>>>> >>>>> Hi Andrew, >>>>> >>>>> I send two log files of terminal.log and ha.log. >>>>> The contents of the terminal log are command results of "ps -ef|grep >>>>> coro" >>>>> and "crm_mon -f -1". >>>>> >>>>> It is what processing completes normally when what did not understand me >>>>> well watches log though corosync outputs core. >>>> >>>> Oct 27 10:53:12 hb0101 corosync[6695]: [pcmk ] plugin.c:1526 ERROR: >>>> send_cluster_msg_raw: Child 7016 spawned to record non-fatal assertion >>>> failure line 1526: rc == 0 >>>> >>>> Oct 27 10:53:12 hb0101 corosync[6695]: [pcmk ] plugin.c:1526 ERROR: >>>> send_cluster_msg_raw: Message not sent (-1):<copy t="cib" >>>> cib_op="cib_replace" cib_delegated_from="hb0102" >>>> cib_clientname="hb0102" cib_isreplyto="hb0102" original_c >>>> >>>> For some reason >>>> rc = pcmk_api->totem_mcast(&iovec, 1, TOTEMPG_SAFE); >>>> is returning -1 >>>> >>>> >>>> Steve: would this happen if membership was in flux? >>>> I thought only IPC got stopped. >>>> >>> >>> it could >>> >>> If api->totem_mcast sends many messages it can fill up the totem queue and >>> return -1. The best solution to handling sending messages outside of IPC is >>> to use the schedwrk api. It will request a piece of work be done when the >>> token is sent (and hopefully there are more spots in the new message queue). >>> It will continue to schedule work until 0 is retuned by the callback >>> registered with schedwrk. >> >> what about a while-loop with a sleep in it? >> >>> > > That could cause all kinds of problems with the membership system timers > resulting in wierd behavior and bad membership states. That is why > there is a schedwrk api.
Looks painful. I think I'd prefer people moved to the MCP instead. > > Regards > -steve > >>> Regards >>> -steve >>> >>>>> >>>>> Regards, >>>>> Tomo >>>>> >>>>> >>>>> 2010/10/27 Andrew Beekhof<and...@beekhof.net> >>>>>> >>>>>> On Tue, Oct 26, 2010 at 11:22 AM, nozawat<noza...@gmail.com> wrote: >>>>>>> >>>>>>> Hi all, >>>>>>> >>>>>>> My environment is as follows. >>>>>>> * cluster-glue-1.0.6 >>>>>>> * resource-agents-1.0.3 >>>>>>> * corosync-1.2.8 (svn revision '3059') >>>>>>> * pacemaker-1.1.3-2f0326468a33acb1ada8fa744c7d36d0b315bd35 >>>>>>> >>>>>>> Core file was output by corosync of the DC node when I load a crm file. >>>>>>> >>>>>>> It is the infomation of the core file as follows. >>>>>> >>>>>> log file? >>>>>> you're tripping over an assertion, it would be good to know which one >>>>>> >>>>>>> >>>>>>> [r...@hb0101 ~]$ file /var/lib/corosync/core.32727 >>>>>>> /var/lib/corosync/core.32727: ELF 64-bit LSB core file AMD x86-64, >>>>>>> version 1 >>>>>>> (SYSV), SVR4-style, from 'corosync' >>>>>>> >>>>>>> [r...@hb0101 ~]$ gdb /usr/sbin/corosync /var/lib/corosync/core.32727 >>>>>>> GNU gdb Fedora (6.8-37.el5) >>>>>>> Copyright (C) 2008 Free Software Foundation, Inc. >>>>>>> License GPLv3+: GNU GPL version 3 or later >>>>>>> <http://gnu.org/licenses/gpl.html> >>>>>>> This is free software: you are free to change and redistribute it. >>>>>>> There is NO WARRANTY, to the extent permitted by law. Type "show >>>>>>> copying" >>>>>>> and "show warranty" for details. >>>>>>> This GDB was configured as "x86_64-redhat-linux-gnu"... >>>>>>> Reading symbols from /usr/lib64/libtotem_pg.so.4...done. >>>>>>> Loaded symbols for /usr/lib64/libtotem_pg.so.4 >>>>>>> Reading symbols from /usr/lib64/liblogsys.so.4...done. >>>>>>> Loaded symbols for /usr/lib64/liblogsys.so.4 >>>>>>> Reading symbols from /usr/lib64/libcoroipcs.so.4...done. >>>>>>> Loaded symbols for /usr/lib64/libcoroipcs.so.4 >>>>>>> Reading symbols from /lib64/librt.so.1...done. >>>>>>> Loaded symbols for /lib64/librt.so.1 >>>>>>> Reading symbols from /lib64/libpthread.so.0...done. >>>>>>> Loaded symbols for /lib64/libpthread.so.0 >>>>>>> Reading symbols from /lib64/libdl.so.2...done. >>>>>>> Loaded symbols for /lib64/libdl.so.2 >>>>>>> Reading symbols from /lib64/libc.so.6...done. >>>>>>> Loaded symbols for /lib64/libc.so.6 >>>>>>> Reading symbols from /usr/lib64/libssl3.so...done. >>>>>>> Loaded symbols for /usr/lib64/libssl3.so >>>>>>> Reading symbols from /usr/lib64/libsmime3.so...done. >>>>>>> Loaded symbols for /usr/lib64/libsmime3.so >>>>>>> Reading symbols from /usr/lib64/libnss3.so...done. >>>>>>> Loaded symbols for /usr/lib64/libnss3.so >>>>>>> Reading symbols from /usr/lib64/libnssutil3.so...done. >>>>>>> Loaded symbols for /usr/lib64/libnssutil3.so >>>>>>> Reading symbols from /usr/lib64/libplds4.so...done. >>>>>>> Loaded symbols for /usr/lib64/libplds4.so >>>>>>> Reading symbols from /usr/lib64/libplc4.so...done. >>>>>>> Loaded symbols for /usr/lib64/libplc4.so >>>>>>> Reading symbols from /usr/lib64/libnspr4.so...done. >>>>>>> Loaded symbols for /usr/lib64/libnspr4.so >>>>>>> Reading symbols from /lib64/ld-linux-x86-64.so.2...done. >>>>>>> Loaded symbols for /lib64/ld-linux-x86-64.so.2 >>>>>>> Reading symbols from /usr/libexec/lcrso/objdb.lcrso...done. >>>>>>> Loaded symbols for /usr/libexec/lcrso/objdb.lcrso >>>>>>> Reading symbols from /usr/libexec/lcrso/coroparse.lcrso...done. >>>>>>> Loaded symbols for /usr/libexec/lcrso/coroparse.lcrso >>>>>>> Reading symbols from /usr/libexec/lcrso/pacemaker.lcrso...done. >>>>>>> Loaded symbols for /usr/libexec/lcrso/pacemaker.lcrso >>>>>>> Reading symbols from /usr/lib64/libplumb.so.2...done. >>>>>>> Loaded symbols for /usr/lib64/libplumb.so.2 >>>>>>> Reading symbols from /usr/lib64/libpils.so.2...done. >>>>>>> Loaded symbols for /usr/lib64/libpils.so.2 >>>>>>> Reading symbols from /usr/lib64/libbz2.so.1...done. >>>>>>> Loaded symbols for /usr/lib64/libbz2.so.1 >>>>>>> Reading symbols from /usr/lib64/libxslt.so.1...done. >>>>>>> Loaded symbols for /usr/lib64/libxslt.so.1 >>>>>>> Reading symbols from /usr/lib/libxml2.so.2...done. >>>>>>> Loaded symbols for /usr/lib/libxml2.so.2 >>>>>>> Reading symbols from /lib64/libuuid.so.1...done. >>>>>>> Loaded symbols for /lib64/libuuid.so.1 >>>>>>> Reading symbols from /lib64/libpam.so.0...done. >>>>>>> Loaded symbols for /lib64/libpam.so.0 >>>>>>> Reading symbols from /lib64/libglib-2.0.so.0...done. >>>>>>> Loaded symbols for /lib64/libglib-2.0.so.0 >>>>>>> Reading symbols from /usr/lib64/libz.so.1...done. >>>>>>> Loaded symbols for /usr/lib64/libz.so.1 >>>>>>> Reading symbols from /lib64/libm.so.6...done. >>>>>>> Loaded symbols for /lib64/libm.so.6 >>>>>>> Reading symbols from /lib64/libaudit.so.0...done. >>>>>>> Loaded symbols for /lib64/libaudit.so.0 >>>>>>> Reading symbols from /lib64/libnss_files.so.2...done. >>>>>>> Loaded symbols for /lib64/libnss_files.so.2 >>>>>>> Reading symbols from /usr/libexec/lcrso/service_evs.lcrso...done. >>>>>>> Loaded symbols for /usr/libexec/lcrso/service_evs.lcrso >>>>>>> Reading symbols from /usr/libexec/lcrso/service_cfg.lcrso...done. >>>>>>> Loaded symbols for /usr/libexec/lcrso/service_cfg.lcrso >>>>>>> Reading symbols from /usr/libexec/lcrso/service_cpg.lcrso...done. >>>>>>> Loaded symbols for /usr/libexec/lcrso/service_cpg.lcrso >>>>>>> Reading symbols from /usr/libexec/lcrso/service_confdb.lcrso...done. >>>>>>> Loaded symbols for /usr/libexec/lcrso/service_confdb.lcrso >>>>>>> Reading symbols from /usr/libexec/lcrso/service_pload.lcrso...done. >>>>>>> Loaded symbols for /usr/libexec/lcrso/service_pload.lcrso >>>>>>> Reading symbols from /usr/libexec/lcrso/vsf_quorum.lcrso...done. >>>>>>> Loaded symbols for /usr/libexec/lcrso/vsf_quorum.lcrso >>>>>>> Core was generated by `corosync'. >>>>>>> Program terminated with signal 6, Aborted. >>>>>>> [New process 32727] >>>>>>> #0 0x0000003fff430265 in raise () from /lib64/libc.so.6 >>>>>>> (gdb) where >>>>>>> #0 0x0000003fff430265 in raise () from /lib64/libc.so.6 >>>>>>> #1 0x0000003fff431d10 in abort () from /lib64/libc.so.6 >>>>>>> #2 0x00002aaaaabaea0e in send_cluster_msg_raw () from >>>>>>> /usr/libexec/lcrso/pacemaker.lcrso >>>>>>> #3 0x00002aaaaabae4e2 in route_ais_message () from >>>>>>> /usr/libexec/lcrso/pacemaker.lcrso >>>>>>> #4 0x00002aaaaabac13f in pcmk_ipc () from >>>>>>> /usr/libexec/lcrso/pacemaker.lcrso >>>>>>> #5 0x00000039316026cc in pthread_ipc_consumer (conn=<value optimized >>>>>>> out>) >>>>>>> at coroipcs.c:727 >>>>>>> #6 0x00000030000064a7 in start_thread () from /lib64/libpthread.so.0 >>>>>>> #7 0x0000003fff4d3c2d in clone () from /lib64/libc.so.6 >>>>>>> (gdb) >>>>>>> >>>>>>> >>>>>>> Regards, >>>>>>> Tomo >>>>>>> >>>>>>> >>>>>>> >>>>>>> _______________________________________________ >>>>>>> Openais mailing list >>>>>>> Openais@lists.linux-foundation.org >>>>>>> https://lists.linux-foundation.org/mailman/listinfo/openais >>>>>>> >>>>> >>>>> >>>> _______________________________________________ >>>> Openais mailing list >>>> Openais@lists.linux-foundation.org >>>> https://lists.linux-foundation.org/mailman/listinfo/openais >>> >>> > > _______________________________________________ Openais mailing list Openais@lists.linux-foundation.org https://lists.linux-foundation.org/mailman/listinfo/openais