[Linux-cluster] NFS4 service?

2009-02-24 Thread Corey Kovacs
After a lot of fighting with KDE and Firefox on a cluster exporting home dirs via NFS3, I found out that NFS4 works like a charm. Then I began to wonder why there seems to be no documentation on exporting NFS4 filesystems via RHCS. Is there something I am missing? The only way I could think of to d

Re: [Linux-cluster] two node cluster with IP tiebreaker failed.

2009-02-24 Thread Mockey Chen
ext Mockey Chen wrote: > ext Kein He wrote: > >> Hi Mockey, >> >> Could you please attach the output from " cman_tool status " and " >> cman_tool nodes -f" ? >> >> > Thanks your response. > > I try to run cman_tool status on as-2, but it hang, without output, and > even Ctrl+C also no effec

Re: [Linux-cluster] two node cluster with IP tiebreaker failed.

2009-02-24 Thread Mockey Chen
ext Kein He wrote: > Hi Mockey, > > Could you please attach the output from " cman_tool status " and " > cman_tool nodes -f" ? > Thanks your response. I try to run cman_tool status on as-2, but it hang, without output, and even Ctrl+C also no effect. I open a new window and can using ssh to as-2,

Re: [Linux-cluster] two node cluster with IP tiebreaker failed.

2009-02-24 Thread Kein He
Hi Mockey, Could you please attach the output from " cman_tool status " and " cman_tool nodes -f" ? Mockey Chen wrote: Hi, I have a two-nodes cluster, to avoid split-brain. I use ilo as fence device, IP tiebreaker. here is my /etc/cluster/cluster.conf

[Linux-cluster] two node cluster with IP tiebreaker failed.

2009-02-24 Thread Mockey Chen
Hi, I have a two-nodes cluster, to avoid split-brain. I use ilo as fence device, IP tiebreaker. here is my /etc/cluster/cluster.conf

Re: [Linux-cluster] Re: Fencing test

2009-02-24 Thread Paras pradhan
On Tue, Feb 24, 2009 at 3:15 PM, Paras pradhan wrote: > Hi, > > Was busy on some other stuffs. > > On Fri, Jan 16, 2009 at 5:16 AM, Rajagopal Swaminathan > wrote: >> Greetings, >> >> On Thu, Jan 15, 2009 at 1:18 AM, Paras pradhan >> wrote: On Fri, Jan 9, 2009 at 12:09 AM, Paras pradhan >

Re: [Linux-cluster] Re: Fencing test

2009-02-24 Thread Paras pradhan
Hi, Was busy on some other stuffs. On Fri, Jan 16, 2009 at 5:16 AM, Rajagopal Swaminathan wrote: > Greetings, > > On Thu, Jan 15, 2009 at 1:18 AM, Paras pradhan wrote: >>> On Fri, Jan 9, 2009 at 12:09 AM, Paras pradhan >>> wrote: In an act to solve my fencing issue in my 2 node

Re: [Linux-cluster] Monitoring Failovers

2009-02-24 Thread Burton Simonds
Just as a followup, I took a look at the output of the clustat -x, and one of the values is "last transition". I wrote a check that looks at a given service and then calculates the difference between the current time and the last transition. If that time is lower than a given threshold, it alarms

Re: [Linux-cluster] fencing loop in a 2-node partitioned cluster

2009-02-24 Thread Marc Grimme
On Tuesday 24 February 2009 16:59:26 Gianluca Cecchi wrote: > thanks, but where do I have to put the timeout? > Inside fence seciotn of the nodes: > > > > >

Re: [Linux-cluster] Re: [Ocfs2-users] GFS2/OCFS2 scalability

2009-02-24 Thread Joel Becker
On Tue, Feb 24, 2009 at 01:06:42AM -0800, SUVANKAR MOITRA wrote: > Can we copy directly from OCFS to normat filesystem( like : ext3,riserfs etc) While mounted? Of course. Same as any filesystem. Joel -- Life's Little Instruction Book #15 "Own a great stereo system." Joel Be

Re: [Linux-cluster] fencing loop in a 2-node partitioned cluster

2009-02-24 Thread Gianluca Cecchi
thanks, but where do I have to put the timeout? Inside fence seciotn of the nodes: or inside definition of fence devices:

Re: [Linux-cluster] fencing loop in a 2-node partitioned cluster

2009-02-24 Thread Marc Grimme
We've solved this problem by using fence_timeouts that are dependent on the nodeid. Means node0 gets timeout=0 and node1 gets timeout=10. Then node0 will always survive. That's not the optimum way but works. Or use qdiskd and let it detect the networkpartitioning (whereever it happens) and decid

Re: [Linux-cluster] fencing loop in a 2-node partitioned cluster

2009-02-24 Thread Gianluca Cecchi
And these are the logs I see on the wo nodes: the first node: Feb 23 16:26:38 oracs1 openais[6020]: [TOTEM] The token was lost in the OPERATIONAL state. Feb 23 16:26:38 oracs1 openais[6020]: [TOTEM] Receive multicast socket recv buffer size (288000 bytes). Feb 23 16:26:38 oracs1 openais[6020]: [TOT

Re: [Linux-cluster] Re: [Ocfs2-users] GFS2/OCFS2 scalability

2009-02-24 Thread SUVANKAR MOITRA
Hi Sunil,   Can we copy directly from OCFS to normat filesystem( like : ext3,riserfs etc)   Thanks & regards   Suvankar --- On Tue, 2/24/09, Sunil Mushran wrote: From: Sunil Mushran Subject: [Linux-cluster] Re: [Ocfs2-users] GFS2/OCFS2 scalability To: "Kirill Kuvaldin" Cc: linux-fsde...@vger.

Re: [Linux-cluster] fencing loop in a 2-node partitioned cluster

2009-02-24 Thread Gianluca Cecchi
Actually my situation is pretty different and worse. two nodes cluster with qdisk and hp ilo based fencing, components rh el 5U3 based. if I panic a node, the other correctly fence it with default action of rebooting it. And also the converse is true. But if for example I get down the intracluster

Re: [Linux-cluster] fencing loop in a 2-node partitioned cluster

2009-02-24 Thread Kein He
Hi Rajeev, there are several ways to stop the fencing loop: 1. import the third node to the cluster, as a result the quorum votes will great than half total votes . 2. Using qdisk, you can implement the Tie-Breaker IP: use heuristic option to monitor the Gateway. qdisk will also increase the