Hello all,

I need some help with LSB nfssserver on SLES9 64bit Linux.

We run a two Node cluster with heartbeat 2.0.8 and crm=yes...
I configured DRDB/NFS/etc. All think works fine, when I start them
manually without HA.

The Problem:

Heartbeat starts IMHO correctly. I can see the nodes on the crm_mon 
with status "online".

For a few seconds all resources was started on my preferred master, 
but heartbeat stopped them all;(

On the the secondary the same problem;(

-- cib.xml -- resources --------------

        <primitive class="lsb" id="nfsserver_4" type="nfsserver">
           <operations>
             <op id="nfsserver_4_mon" interval="120s" name="monitor"  
                                    timeout="240s"/>
           </operations>
         </primitive>
- - - - - - - - - - - - - - - - 

When I remove this entry from the cib.xml alle resources will start 
via HA (drbddisk/FileSystem/IPaddr/Mailto) correctly!

So I look at /var/log/messages and find some strange log entries:

--/var/log/messages -------------------
crmd: info: do_lrm_rsc_op: Performing op=nfsserver_4_start_0 
lrmd: WARN: For LSB init script, no additional parameters are needed.

lrmd: info: RA output: (nfsserver_4:start:stdout) Starting kernel \
                                                      based NFS server


lrmd: WARN: on_op_timeout_expired: TIMEOUT: operation \
            start[13] on lsb::nfsserver::nfsserver_4 for client, its \
            parameters: CRM_meta_op_target_rc=[7] \
           CRM_meta_timeout=[5000] crm_feature_set=[1.0.7] .
---->                              ^^^^^^^^^^^^^^^^^
crmd: [12358]: ERROR: process_lrm_event: LRM operation
 nfsserver_4_start_0 (13) Timed Out (timeout=5000ms)
---->                              ^^^^^^^^^^^^^^^^^
crmd: [12358]: info: append_restart_list: Resource nfsserver_4 does 
not support reloads

tengine: [12364]: WARN: status_from_rc: Action start on sot0000140 
failed (target: (null) vs. rc: -1): Timed Out
---->                              ^^^^^^^^^^^^^^^^^

pengine: [12365]: notice: StopRsc:   sot0000140 Stop datadisk_2
pengine: [12365]: notice: StopRsc:   sot0000140 Stop Filesystem_3
pengine: [12365]: notice: StopRsc:   sot0000140 Stop nfsserver_4
--------------------

Why failed the resources? 

AFAIK should use HA /etc/init.d/nfssserver as init-script. When I call 
the nfssserver init-script with arg "status" I got an exit code "0".

Isn't it correct?

Any hints?

Best regards.


Andre


PS: You can see the full /var/log/message && cib.xml here:

http://www.linux-experience.de/cib.xml
http://www.linux-experience.de/messages.1
http://www.linux-experience.de/messages.2
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to