Hi Hideo-san, On Fri, Mar 19, 2010 at 03:52:55PM +0900, [email protected] wrote: > Hi, > > lrmd fails in the log output, and there is a case to reboot. > > (snip) > Mar 19 13:48:07 kvm1 lrmd: [2402]: info: rsc:prmApPostgreSQLDB:29: monitorMar > 19 13:48:08 kvm1 lrmd: > [2402]: info: Managed prmApPostgreSQLDB:monitor process 3153 exited with > return code 0.Mar 19 13:48:08 > kvm1 crmd: [2405]: info: process_lrm_event: LRM operation > prmApPostgreSQLDB_monitor_10000 (call=29, > rc=0, cib-update=70, confirmed=false) okMar 19 13:48:08 kvm1 crmd: [2405]: > info: match_graph_event: > Action prmApPostgreSQLDB_monitor_10000 (23) confirmed on kvm1 (rc=0)Mar 19 > 13:48:08 kvm1 crmd: [2405]: > info: run_graph: ====================================================Mar 19 > 13:48:08 kvm1 crmd: > [2405]: notice: run_graph: Transition 6 (Complete=13, Pending=0, Fired=0, > Skipped=0, Incomplete=0, > Source=/var/lib/pengine/pe-input-6.bz2): CompleteMar 19 13:48:08 kvm1 crmd: > [2405]: info: > te_graph_trigger: Transition 6 is now complete > Mar 19 13:48:08 kvm1 crmd: [2405]: info: notify_crmd: Transition 6 status: > done - <null> > Mar 19 13:48:08 kvm1 crmd: [2405]: info: do_state_transition: State > transition S_TRANSITION_ENGINE -> > S_IDLE [ input=I_TE_SUCCESS cause=C_FSA_INTERNAL origin=notify_crmd ] > Mar 19 13:48:08 kvm1 crmd: [2405]: info: do_state_transition: Starting > PEngine Recheck Timer > Mar 19 13:48:15 kvm1 crmd: [2405]: CRIT: lrm_connection_destroy: LRM > Connection failed > Mar 19 13:48:15 kvm1 crmd: [2405]: info: lrm_connection_destroy: LRM > Connection disconnected > Mar 19 13:48:15 kvm1 crmd: [2405]: ERROR: do_log: FSA: Input I_ERROR from > lrm_connection_destroy() > received in state S_IDLEMar 19 13:48:15 kvm1 crmd: [2405]: info: > do_state_transition: State transition > S_IDLE -> S_RECOVERY [ input=I_ERROR cause=C_FSA_INTERNAL > origin=lrm_connection_destroy ]Mar 19 > 13:48:15 kvm1 crmd: [2405]: ERROR: do_recover: Action A_RECOVER > (0000000001000000) not supported > Mar 19 13:48:15 kvm1 lrmd: [3204]: info: G_main_add_SignalHandler: Added > signal handler for signal 15 > (snip) > > It is a problem to handle NULL somehow or other. > > --------------PATCH START------------------ > diff -r 616c40df1d5d lrm/lrmd/lrmd.h > --- a/lrm/lrmd/lrmd.h Fri Mar 19 15:49:41 2010 +0900 > +++ b/lrm/lrmd/lrmd.h Fri Mar 19 15:50:49 2010 +0900 > @@ -167,7 +167,7 @@ > (longclockto_ms(sub_longclock(time_longclock(), > op->t_lastlogmsg))/1000 >= \ > (unsigned long)LOGMSG_INTERVAL) > #define probe_str(op,op_type) \ > - ((!op->interval && !strcmp(op_type,"monitor")) ? "probe" : op_type) > + ((op != NULL && !op->interval && !strcmp(op_type,"monitor")) ? > "probe" : op_type) > > struct lrmd_rsc > { > --------------PATCH END------------------
Thanks for finding this and for the patch. Cheers, Dejan > Best Regards, > Hideo Yamauchi. > > _______________________________________________________ > Linux-HA-Dev: [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev > Home Page: http://linux-ha.org/ _______________________________________________________ Linux-HA-Dev: [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev Home Page: http://linux-ha.org/
