Hello all, the value "default-action-timeout" was too small;(
All thinks works FINE:)))))))))) Greetings Andre Am Donnerstag, 14. Juni 2007 schrieb Andre Heine: > Hello all, > > I need some help with LSB nfssserver on SLES9 64bit Linux. > > We run a two Node cluster with heartbeat 2.0.8 and crm=yes... > I configured DRDB/NFS/etc. All think works fine, when I start them > manually without HA. > > The Problem: > > Heartbeat starts IMHO correctly. I can see the nodes on the crm_mon > with status "online". > > For a few seconds all resources was started on my preferred master, > but heartbeat stopped them all;( > > On the the secondary the same problem;( > > -- cib.xml -- resources -------------- > > <primitive class="lsb" id="nfsserver_4" type="nfsserver"> > <operations> > <op id="nfsserver_4_mon" interval="120s" > name="monitor" timeout="240s"/> > </operations> > </primitive> > - - - - - - - - - - - - - - - - > > When I remove this entry from the cib.xml alle resources will start > via HA (drbddisk/FileSystem/IPaddr/Mailto) correctly! > > So I look at /var/log/messages and find some strange log entries: > > --/var/log/messages ------------------- > crmd: info: do_lrm_rsc_op: Performing op=nfsserver_4_start_0 > lrmd: WARN: For LSB init script, no additional parameters are > needed. > > lrmd: info: RA output: (nfsserver_4:start:stdout) Starting kernel \ > based NFS > server > > > lrmd: WARN: on_op_timeout_expired: TIMEOUT: operation \ > start[13] on lsb::nfsserver::nfsserver_4 for client, > its \ parameters: CRM_meta_op_target_rc=[7] \ > CRM_meta_timeout=[5000] crm_feature_set=[1.0.7] . > ----> ^^^^^^^^^^^^^^^^^ > crmd: [12358]: ERROR: process_lrm_event: LRM operation > nfsserver_4_start_0 (13) Timed Out (timeout=5000ms) > ----> ^^^^^^^^^^^^^^^^^ > crmd: [12358]: info: append_restart_list: Resource nfsserver_4 does > not support reloads > > tengine: [12364]: WARN: status_from_rc: Action start on sot0000140 > failed (target: (null) vs. rc: -1): Timed Out > ----> ^^^^^^^^^^^^^^^^^ > > pengine: [12365]: notice: StopRsc: sot0000140 Stop datadisk_2 > pengine: [12365]: notice: StopRsc: sot0000140 Stop Filesystem_3 > pengine: [12365]: notice: StopRsc: sot0000140 Stop nfsserver_4 > -------------------- > > Why failed the resources? > > AFAIK should use HA /etc/init.d/nfssserver as init-script. When I > call the nfssserver init-script with arg "status" I got an exit > code "0". > > Isn't it correct? > > Any hints? > > Best regards. > > > Andre > > > PS: You can see the full /var/log/message && cib.xml here: > > http://www.linux-experience.de/cib.xml > http://www.linux-experience.de/messages.1 > http://www.linux-experience.de/messages.2 > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
