Hi,

We've just setup a three node GFS cluster on Debian Etch using qlogic HBA 
against a SAN.

gfs_tool 1.03.00 (built Mar  8 2007 23:38:09)
Copyright (C) Red Hat, Inc.  2004-2005  All rights reserved.

Linux cms2 2.6.18-5-amd64 #1 SMP Tue Oct 2 20:37:02 UTC 2007 x86_64 
GNU/Linux

We start the cluster and it works fine for a while..

/sbin/lock_gulmd -n aicluster -s cms1,cms2,cmsqa
sleep 1
/bin/mount -t gfs -o acl /dev/sda /san

But eventually after hours or a day something freezes/hangs and we can't 
issue any commands like df/ls/du etc..

There is no evidence that anything is wrong though.. This command seems to 
show a working cluster right?

cmsqa:/home/alfresco# gulm_tool nodelist cms1
 Name: cms2
  ip    = ::ffff:192.168.1.139
  state = Logged in
  last state = Logged out
  mode = Slave
  missed beats = 0
  last beat = 1193685839882270
  delay avg = 10003803
  max delay = 755383848
 
 Name: cmsqa
  ip    = ::ffff:128.1.32.134
  state = Logged in
  last state = Logged out
  mode = Slave
  missed beats = 0
  last beat = 1193685841974801
  delay avg = 10003928
  max delay = 138560844
 
 Name: cms1
  ip    = ::ffff:192.168.1.137
  state = Logged in
  last state = Was Logged in
  mode = Master
  missed beats = 0
  last beat = 1193685842490217
  delay avg = 10003231
  max delay = 10007256


Any ideas? We need to reboot the boxes to get the cluster back.

Damon.
Working to protect human rights worldwide

DISCLAIMER
Internet communications are not secure and therefore Amnesty International Ltd 
does not accept legal responsibility for the contents of this message. If you 
are not the intended recipient you must not disclose or rely on the information 
in this e-mail. Any views or opinions presented are solely those of the author 
and do not necessarily represent those of Amnesty International Ltd unless 
specifically stated. Electronic communications including email might be 
monitored by Amnesty International Ltd. for operational or business reasons.

This message has been scanned for viruses by Postini.
www.postini.com
--
Linux-cluster mailing list
[email protected]
https://www.redhat.com/mailman/listinfo/linux-cluster

Reply via email to