Hello all,

the value "default-action-timeout" was too small;(

All thinks works FINE:))))))))))


Greetings 

Andre

Am Donnerstag, 14. Juni 2007 schrieb Andre Heine:
> Hello all,
>
> I need some help with LSB nfssserver on SLES9 64bit Linux.
>
> We run a two Node cluster with heartbeat 2.0.8 and crm=yes...
> I configured DRDB/NFS/etc. All think works fine, when I start them
> manually without HA.
>
> The Problem:
>
> Heartbeat starts IMHO correctly. I can see the nodes on the crm_mon
> with status "online".
>
> For a few seconds all resources was started on my preferred master,
> but heartbeat stopped them all;(
>
> On the the secondary the same problem;(
>
> -- cib.xml -- resources --------------
>
>         <primitive class="lsb" id="nfsserver_4" type="nfsserver">
>            <operations>
>              <op id="nfsserver_4_mon" interval="120s"
> name="monitor" timeout="240s"/>
>            </operations>
>          </primitive>
> - - - - - - - - - - - - - - - -
>
> When I remove this entry from the cib.xml alle resources will start
> via HA (drbddisk/FileSystem/IPaddr/Mailto) correctly!
>
> So I look at /var/log/messages and find some strange log entries:
>
> --/var/log/messages -------------------
> crmd: info: do_lrm_rsc_op: Performing op=nfsserver_4_start_0
> lrmd: WARN: For LSB init script, no additional parameters are
> needed.
>
> lrmd: info: RA output: (nfsserver_4:start:stdout) Starting kernel \
>                                                       based NFS
> server
>
>
> lrmd: WARN: on_op_timeout_expired: TIMEOUT: operation \
>             start[13] on lsb::nfsserver::nfsserver_4 for client,
> its \ parameters: CRM_meta_op_target_rc=[7] \
>            CRM_meta_timeout=[5000] crm_feature_set=[1.0.7] .
> ---->                              ^^^^^^^^^^^^^^^^^
> crmd: [12358]: ERROR: process_lrm_event: LRM operation
>  nfsserver_4_start_0 (13) Timed Out (timeout=5000ms)
> ---->                              ^^^^^^^^^^^^^^^^^
> crmd: [12358]: info: append_restart_list: Resource nfsserver_4 does
> not support reloads
>
> tengine: [12364]: WARN: status_from_rc: Action start on sot0000140
> failed (target: (null) vs. rc: -1): Timed Out
> ---->                              ^^^^^^^^^^^^^^^^^
>
> pengine: [12365]: notice: StopRsc:   sot0000140       Stop datadisk_2
> pengine: [12365]: notice: StopRsc:   sot0000140       Stop Filesystem_3
> pengine: [12365]: notice: StopRsc:   sot0000140       Stop nfsserver_4
> --------------------
>
> Why failed the resources?
>
> AFAIK should use HA /etc/init.d/nfssserver as init-script. When I
> call the nfssserver init-script with arg "status" I got an exit
> code "0".
>
> Isn't it correct?
>
> Any hints?
>
> Best regards.
>
>
> Andre
>
>
> PS: You can see the full /var/log/message && cib.xml here:
>
> http://www.linux-experience.de/cib.xml
> http://www.linux-experience.de/messages.1
> http://www.linux-experience.de/messages.2
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to