Re: [Ocfs2-users] Mysterious server reboot

2011-04-04 Thread Nikola Savic
  Hello,   If anyone is interested, here is update on problem we had with OCFS and DRL drop ref bug.   After installing kernel-2.6.32-100.0.19.el5.x86_64.rpm and OCFS2 1.6 packages from Oracle's Centos5.5 public yum, we didn't experience DLM drop ref bug and kernel didn't oops or panic. Mayb

Re: [Ocfs2-users] Mysterious server reboot

2011-03-26 Thread Nikola Savic
Hi all, Just keep you informed :) After 7 days of normal operations, we again had server failure because of OCFS2/DLM drop reference bug. I have added log is on end of message. We're running Centos5 with latest available RedHat kernel 2.6.18-238.5.1.el5 and OCFS2 1.4.7 installed from packa

Re: [Ocfs2-users] Mysterious server reboot

2011-03-18 Thread Nikola Savic
  Hi again,   I have installed newest kernel available for RHEL5 (2.6.18-238.5.1.el5) and OCFS2 packages to match. After only few hours of running, while rsync backup was done, on one of nodes I got following error (after which it hang and required reset): Mar 19 03:42:52 server3 kernel: (dl

Re: [Ocfs2-users] Mysterious server reboot

2011-03-18 Thread Sunil Mushran
On 03/18/2011 04:56 PM, Nikola Savic wrote: Sunil Mushran wrote: This specific bug (associated with the message) has been fixed here. http://oss.oracle.com/git/?p=ocfs2-1.4.git;a=commit;h=1f667766cb67ed05b4d706aa82e8ad0b12eaae8b This should result in an oops and thus panic. But just on this node

Re: [Ocfs2-users] Mysterious server reboot

2011-03-18 Thread Nikola Savic
Sunil Mushran wrote: This specific bug (associated with the message) has been fixed here. http://oss.oracle.com/git/?p=ocfs2-1.4.git;a=commit;h=1f667766cb67ed05b4d706aa82e8ad0b12eaae8b This should result in an oops and thus panic. But just on this node.   What is solution? Are there RPM p

Re: [Ocfs2-users] Mysterious server reboot

2011-03-18 Thread Sunil Mushran
This specific bug (associated with the message) has been fixed here. http://oss.oracle.com/git/?p=ocfs2-1.4.git;a=commit;h=1f667766cb67ed05b4d706aa82e8ad0b12eaae8b This should result in an oops and thus panic. But just on this node. If other nodes are rebooting then I suspect some sysctl values a

[Ocfs2-users] Mysterious server reboot

2011-03-18 Thread Nikola Savic
  Hi,   I have 3 nodes cluster using OCFS2 1.4 on CentOS5.5 (kernel 2.6.18-194). Two nodes (server1 and server2) are providing shared storage using DRBD. Shared storage exported to nodes using iSCSI (server1 is target and all other nodes are iSCSI initiators).   Today cluster went down. Serv