On Thu, Nov 11, 2010 at 4:39 PM, Steven Dake <sd...@redhat.com> wrote:
> On 11/11/2010 02:35 AM, Andrew Beekhof wrote:
>> On Wed, Oct 27, 2010 at 5:15 PM, Steven Dake <sd...@redhat.com> wrote:
>>> On 10/26/2010 11:17 PM, Andrew Beekhof wrote:
>>>>
>>>> On Wed, Oct 27, 2010 at 7:32 AM, nozawat<noza...@gmail.com>  wrote:
>>>>>
>>>>> Hi Andrew,
>>>>>
>>>>>  I send two log files of terminal.log and ha.log.
>>>>>  The contents of the terminal log are command results of "ps -ef|grep
>>>>> coro"
>>>>> and "crm_mon -f -1".
>>>>>
>>>>>  It is what processing completes normally when what did not understand me
>>>>> well watches log though corosync outputs core.
>>>>
>>>> Oct 27 10:53:12 hb0101 corosync[6695]:   [pcmk  ] plugin.c:1526 ERROR:
>>>> send_cluster_msg_raw: Child 7016 spawned to record non-fatal assertion
>>>> failure line 1526: rc == 0
>>>>
>>>> Oct 27 10:53:12 hb0101 corosync[6695]:   [pcmk  ] plugin.c:1526 ERROR:
>>>> send_cluster_msg_raw: Message not sent (-1):<copy t="cib"
>>>> cib_op="cib_replace" cib_delegated_from="hb0102"
>>>> cib_clientname="hb0102" cib_isreplyto="hb0102" original_c
>>>>
>>>> For some reason
>>>>     rc = pcmk_api->totem_mcast(&iovec, 1, TOTEMPG_SAFE);
>>>> is returning -1
>>>>
>>>>
>>>> Steve: would this happen if membership was in flux?
>>>> I thought only IPC got stopped.
>>>>
>>>
>>> it could
>>>
>>> If api->totem_mcast sends many messages it can fill up the totem queue and
>>> return -1.  The best solution to handling sending messages outside of IPC is
>>> to use the schedwrk api.  It will request a piece of work be done when the
>>> token is sent (and hopefully there are more spots in the new message queue).
>>>  It will continue to schedule work until 0 is retuned by the callback
>>> registered with schedwrk.
>>
>> what about a while-loop with a sleep in it?
>>
>>>
>
> That could cause all kinds of problems with the membership system timers
> resulting in wierd behavior and bad membership states.  That is why
> there is a schedwrk api.

Looks painful.
I think I'd prefer people moved to the MCP instead.

>
> Regards
> -steve
>
>>> Regards
>>> -steve
>>>
>>>>>
>>>>> Regards,
>>>>> Tomo
>>>>>
>>>>>
>>>>> 2010/10/27 Andrew Beekhof<and...@beekhof.net>
>>>>>>
>>>>>> On Tue, Oct 26, 2010 at 11:22 AM, nozawat<noza...@gmail.com>  wrote:
>>>>>>>
>>>>>>> Hi all,
>>>>>>>
>>>>>>> My environment is as follows.
>>>>>>>  * cluster-glue-1.0.6
>>>>>>>  * resource-agents-1.0.3
>>>>>>>  * corosync-1.2.8 (svn revision '3059')
>>>>>>>  * pacemaker-1.1.3-2f0326468a33acb1ada8fa744c7d36d0b315bd35
>>>>>>>
>>>>>>> Core file was output by corosync of the DC node when I load a crm file.
>>>>>>>
>>>>>>> It is the infomation of the core file as follows.
>>>>>>
>>>>>> log file?
>>>>>> you're tripping over an assertion, it would be good to know which one
>>>>>>
>>>>>>>
>>>>>>> [r...@hb0101 ~]$ file /var/lib/corosync/core.32727
>>>>>>> /var/lib/corosync/core.32727: ELF 64-bit LSB core file AMD x86-64,
>>>>>>> version 1
>>>>>>> (SYSV), SVR4-style, from 'corosync'
>>>>>>>
>>>>>>> [r...@hb0101 ~]$ gdb /usr/sbin/corosync /var/lib/corosync/core.32727
>>>>>>> GNU gdb Fedora (6.8-37.el5)
>>>>>>> Copyright (C) 2008 Free Software Foundation, Inc.
>>>>>>> License GPLv3+: GNU GPL version 3 or later
>>>>>>> <http://gnu.org/licenses/gpl.html>
>>>>>>> This is free software: you are free to change and redistribute it.
>>>>>>> There is NO WARRANTY, to the extent permitted by law.  Type "show
>>>>>>> copying"
>>>>>>> and "show warranty" for details.
>>>>>>> This GDB was configured as "x86_64-redhat-linux-gnu"...
>>>>>>> Reading symbols from /usr/lib64/libtotem_pg.so.4...done.
>>>>>>> Loaded symbols for /usr/lib64/libtotem_pg.so.4
>>>>>>> Reading symbols from /usr/lib64/liblogsys.so.4...done.
>>>>>>> Loaded symbols for /usr/lib64/liblogsys.so.4
>>>>>>> Reading symbols from /usr/lib64/libcoroipcs.so.4...done.
>>>>>>> Loaded symbols for /usr/lib64/libcoroipcs.so.4
>>>>>>> Reading symbols from /lib64/librt.so.1...done.
>>>>>>> Loaded symbols for /lib64/librt.so.1
>>>>>>> Reading symbols from /lib64/libpthread.so.0...done.
>>>>>>> Loaded symbols for /lib64/libpthread.so.0
>>>>>>> Reading symbols from /lib64/libdl.so.2...done.
>>>>>>> Loaded symbols for /lib64/libdl.so.2
>>>>>>> Reading symbols from /lib64/libc.so.6...done.
>>>>>>> Loaded symbols for /lib64/libc.so.6
>>>>>>> Reading symbols from /usr/lib64/libssl3.so...done.
>>>>>>> Loaded symbols for /usr/lib64/libssl3.so
>>>>>>> Reading symbols from /usr/lib64/libsmime3.so...done.
>>>>>>> Loaded symbols for /usr/lib64/libsmime3.so
>>>>>>> Reading symbols from /usr/lib64/libnss3.so...done.
>>>>>>> Loaded symbols for /usr/lib64/libnss3.so
>>>>>>> Reading symbols from /usr/lib64/libnssutil3.so...done.
>>>>>>> Loaded symbols for /usr/lib64/libnssutil3.so
>>>>>>> Reading symbols from /usr/lib64/libplds4.so...done.
>>>>>>> Loaded symbols for /usr/lib64/libplds4.so
>>>>>>> Reading symbols from /usr/lib64/libplc4.so...done.
>>>>>>> Loaded symbols for /usr/lib64/libplc4.so
>>>>>>> Reading symbols from /usr/lib64/libnspr4.so...done.
>>>>>>> Loaded symbols for /usr/lib64/libnspr4.so
>>>>>>> Reading symbols from /lib64/ld-linux-x86-64.so.2...done.
>>>>>>> Loaded symbols for /lib64/ld-linux-x86-64.so.2
>>>>>>> Reading symbols from /usr/libexec/lcrso/objdb.lcrso...done.
>>>>>>> Loaded symbols for /usr/libexec/lcrso/objdb.lcrso
>>>>>>> Reading symbols from /usr/libexec/lcrso/coroparse.lcrso...done.
>>>>>>> Loaded symbols for /usr/libexec/lcrso/coroparse.lcrso
>>>>>>> Reading symbols from /usr/libexec/lcrso/pacemaker.lcrso...done.
>>>>>>> Loaded symbols for /usr/libexec/lcrso/pacemaker.lcrso
>>>>>>> Reading symbols from /usr/lib64/libplumb.so.2...done.
>>>>>>> Loaded symbols for /usr/lib64/libplumb.so.2
>>>>>>> Reading symbols from /usr/lib64/libpils.so.2...done.
>>>>>>> Loaded symbols for /usr/lib64/libpils.so.2
>>>>>>> Reading symbols from /usr/lib64/libbz2.so.1...done.
>>>>>>> Loaded symbols for /usr/lib64/libbz2.so.1
>>>>>>> Reading symbols from /usr/lib64/libxslt.so.1...done.
>>>>>>> Loaded symbols for /usr/lib64/libxslt.so.1
>>>>>>> Reading symbols from /usr/lib/libxml2.so.2...done.
>>>>>>> Loaded symbols for /usr/lib/libxml2.so.2
>>>>>>> Reading symbols from /lib64/libuuid.so.1...done.
>>>>>>> Loaded symbols for /lib64/libuuid.so.1
>>>>>>> Reading symbols from /lib64/libpam.so.0...done.
>>>>>>> Loaded symbols for /lib64/libpam.so.0
>>>>>>> Reading symbols from /lib64/libglib-2.0.so.0...done.
>>>>>>> Loaded symbols for /lib64/libglib-2.0.so.0
>>>>>>> Reading symbols from /usr/lib64/libz.so.1...done.
>>>>>>> Loaded symbols for /usr/lib64/libz.so.1
>>>>>>> Reading symbols from /lib64/libm.so.6...done.
>>>>>>> Loaded symbols for /lib64/libm.so.6
>>>>>>> Reading symbols from /lib64/libaudit.so.0...done.
>>>>>>> Loaded symbols for /lib64/libaudit.so.0
>>>>>>> Reading symbols from /lib64/libnss_files.so.2...done.
>>>>>>> Loaded symbols for /lib64/libnss_files.so.2
>>>>>>> Reading symbols from /usr/libexec/lcrso/service_evs.lcrso...done.
>>>>>>> Loaded symbols for /usr/libexec/lcrso/service_evs.lcrso
>>>>>>> Reading symbols from /usr/libexec/lcrso/service_cfg.lcrso...done.
>>>>>>> Loaded symbols for /usr/libexec/lcrso/service_cfg.lcrso
>>>>>>> Reading symbols from /usr/libexec/lcrso/service_cpg.lcrso...done.
>>>>>>> Loaded symbols for /usr/libexec/lcrso/service_cpg.lcrso
>>>>>>> Reading symbols from /usr/libexec/lcrso/service_confdb.lcrso...done.
>>>>>>> Loaded symbols for /usr/libexec/lcrso/service_confdb.lcrso
>>>>>>> Reading symbols from /usr/libexec/lcrso/service_pload.lcrso...done.
>>>>>>> Loaded symbols for /usr/libexec/lcrso/service_pload.lcrso
>>>>>>> Reading symbols from /usr/libexec/lcrso/vsf_quorum.lcrso...done.
>>>>>>> Loaded symbols for /usr/libexec/lcrso/vsf_quorum.lcrso
>>>>>>> Core was generated by `corosync'.
>>>>>>> Program terminated with signal 6, Aborted.
>>>>>>> [New process 32727]
>>>>>>> #0  0x0000003fff430265 in raise () from /lib64/libc.so.6
>>>>>>> (gdb) where
>>>>>>> #0  0x0000003fff430265 in raise () from /lib64/libc.so.6
>>>>>>> #1  0x0000003fff431d10 in abort () from /lib64/libc.so.6
>>>>>>> #2  0x00002aaaaabaea0e in send_cluster_msg_raw () from
>>>>>>> /usr/libexec/lcrso/pacemaker.lcrso
>>>>>>> #3  0x00002aaaaabae4e2 in route_ais_message () from
>>>>>>> /usr/libexec/lcrso/pacemaker.lcrso
>>>>>>> #4  0x00002aaaaabac13f in pcmk_ipc () from
>>>>>>> /usr/libexec/lcrso/pacemaker.lcrso
>>>>>>> #5  0x00000039316026cc in pthread_ipc_consumer (conn=<value optimized
>>>>>>> out>)
>>>>>>> at coroipcs.c:727
>>>>>>> #6  0x00000030000064a7 in start_thread () from /lib64/libpthread.so.0
>>>>>>> #7  0x0000003fff4d3c2d in clone () from /lib64/libc.so.6
>>>>>>> (gdb)
>>>>>>>
>>>>>>>
>>>>>>> Regards,
>>>>>>> Tomo
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> Openais mailing list
>>>>>>> Openais@lists.linux-foundation.org
>>>>>>> https://lists.linux-foundation.org/mailman/listinfo/openais
>>>>>>>
>>>>>
>>>>>
>>>> _______________________________________________
>>>> Openais mailing list
>>>> Openais@lists.linux-foundation.org
>>>> https://lists.linux-foundation.org/mailman/listinfo/openais
>>>
>>>
>
>
_______________________________________________
Openais mailing list
Openais@lists.linux-foundation.org
https://lists.linux-foundation.org/mailman/listinfo/openais

Reply via email to