|
Hi, We have 2 Dell 1850’s in a cluster, both
machines are running Redhat Enterprise Linux 4 AS, update 2. The boxes are connected to a Dell EMC CX300 using emulex
HBA’s The cluster is running an Oracle 10gR2 std edition
RAC. We are using ocfs2 to store files generated by our
application and not to store anything to do with the database. We’ve been having a few problems were the
servers appear to hang, and have to be shutdown (using the powerbutton) and
then started up again. This seems to be happening every weekend and I don’t
really understand what’s happening, or how to fix it. I’ve included an extract from messages in the
hope someone can shed some light on the matter. Kind regards Andrew Sep 17 22:06:04 argon2 kernel:
(0,0):o2net_idle_timer:1310 connection to node argon1.crewe.ukfuels.co.uk (num
0) at 10.1.1.110:7777 has been idle for 10 seconds, shutting it down. Sep 17 22:06:04 argon2 kernel:
(0,0):o2net_idle_timer:1321 here are some times that might help debug the
situation: (tmr 1158527154.993223 now 1158527164.993090 dr 1158527154.993213
adv 1158527154.993227:1158527154.993228 func (101e0528:505)
1158527153.796194:1158527153.796200) Sep 17 22:06:04 argon2 kernel: (3854,0):o2net_set_nn_state:411
no longer connected to node argon1.crewe.ukfuels.co.uk (num 0) at
10.1.1.110:7777 Sep 17 22:06:04 argon2 kernel:
(73,3):dlm_send_remote_unlock_request:350 ERROR: status = -112 Sep 17 22:06:04 argon2 kernel:
(73,3):dlm_send_remote_unlock_request:350 ERROR: status = -107 Sep 17 22:06:05 argon2 last message repeated 185
times Sep 17 22:06:05 argon2 kernel:
(26144,1):dlm_send_remote_unlock_request:350 ERROR: status = -107 Sep 17 22:06:05 argon2 last message repeated 154
times Sep 17 22:06:05 argon2 kernel:
(25274,2):dlm_send_remote_unlock_request:350 ERROR: status = -107 Sep 17 22:06:05 argon2 last message repeated 123
times Sep 17 22:06:05 argon2 kernel:
(73,3):dlm_send_remote_unlock_request:350 ERROR: status = -107 Sep 17 22:06:05 argon2 last message repeated 472
times Sep 17 22:06:05 argon2 kernel:
(73,1):dlm_send_remote_unlock_request:350 ERROR: status = -107 Sep 17 22:06:08 argon2 last message repeated 3239
times Sep 17 22:06:08 argon2 kernel:
(73,3):dlm_send_remote_unlock_request:350 ERROR: status = -107 Sep 17 22:06:08 argon2 last message repeated 118
times Sep 17 22:06:08 argon2 kernel:
(73,1):dlm_send_remote_unlock_request:350 ERROR: status = -107 Sep 18 08:40:32 argon2 syslogd 1.4.1: restart. Sep 18 08:40:32 argon2 syslog: syslogd startup succeeded Sep 18 08:40:32 argon2 kernel: klogd 1.4.1, log
source = /proc/kmsg started. Sep 18 08:40:32 argon2 kernel: Bootdata ok (command
line is ro root=LABEL=/ apic rhgb quiet) Sep 18 08:40:32 argon2 kernel: Linux version
2.6.9-22.0.1.ELsmp ([EMAIL PROTECTED]) (gcc version 3.4.4
20050721 (Red Hat 3.4.4-2)) #1 SMP Andrew Brunton Senior Application Developer UK Fuels Limited Tel +44 (0)1270 655636 Fax +44 (0)1270 655700 [EMAIL PROTECTED] |
BEGIN:VCARD VERSION:2.1 N:Brunton;Andrew FN:Andrew Brunton ([EMAIL PROTECTED]) ORG:UK Fuels Limited;IT TITLE:Senior Application Developer TEL;WORK;VOICE:01270655636 TEL;WORK;FAX:01270655736 ADR;WORK;ENCODING=QUOTED-PRINTABLE:;;Card Management Centre=0D=0APO Box 262;Crewe;Cheshire;CW2 6GF;United Kingd= om LABEL;WORK;ENCODING=QUOTED-PRINTABLE:Card Management Centre=0D=0APO Box 262=0D=0ACrewe, Cheshire CW2 6GF=0D=0AUni= ted Kingdom ADR;POSTAL;ENCODING=QUOTED-PRINTABLE:;;6th Floor=0D=0ARailhouse=0D=0AGresty Road;Crewe;Cheshire;CW2 6EA;United Ki= ngdom LABEL;POSTAL;ENCODING=QUOTED-PRINTABLE:6th Floor=0D=0ARailhouse=0D=0AGresty Road=0D=0ACrewe, Cheshire CW2 6EA=0D= =0AUnited Kingdom EMAIL;PREF;INTERNET:[EMAIL PROTECTED] REV:20040907T122345Z END:VCARD
_______________________________________________ Ocfs2-users mailing list [email protected] http://oss.oracle.com/mailman/listinfo/ocfs2-users
