Hi,
we have a couple of Dell servers with Red Hat 5.2 and OpenVZ, sharing a GFS filesystem.

We have noticed that there are a directory which processes stalls when try to access it.
For instance look this processes:

[r...@parmenides ~]# ps -fel | grep save
4 D root 8997 1 1 78 0 - 1780 339955 09:40 ? 00:02:31 /usr/sbin/save -s espai.upc.es -g Virtuals -LL -f - -m parmenides -t 1236294005 -l 4 -q -W 78 -N /mnt/gfs /mnt/gfs 0 S root 16736 21208 0 78 0 - 980 pipe_w 12:07 pts/1 00:00:00 grep save 4 D root 18796 1 1 78 0 - 1777 339955 08:46 ? 00:02:16 /usr/sbin/save -s espai.upc.es -g Virtuals -LL -f - -m parmenides -t 1236294005 -l 4 -q -W 78 -N /mnt/gfs /mnt/gfs

Both processes are stalled reading a file:

# lsof -p 8997 | grep gfs
save 8997 root cwd DIR 253,7 2048 7022183 /mnt/gfs/vz/private/109/usr/lib/openoffice/program
save    8997 root    3r   DIR   253,7     3864        26 /mnt/gfs
save    8997 root    6r   DIR   253,7     3864       232 /mnt/gfs/vz
save    8997 root    7r   DIR   253,7     3864       233 /mnt/gfs/vz/private
save 8997 root 8r DIR 253,7 3864 230761349 /mnt/gfs/vz/private/109 save 8997 root 9r DIR 253,7 3864 230773154 /mnt/gfs/vz/private/109/usr save 8997 root 12r DIR 253,7 2048 7003944 /mnt/gfs/vz/private/109/usr/lib save 8997 root 14r DIR 253,7 3864 7022175 /mnt/gfs/vz/private/109/usr/lib/openoffice

# lsof -p 18796 | grep gfs
save 18796 root cwd DIR 253,7 2048 7022183 /mnt/gfs/vz/private/109/usr/lib/openoffice/program
save    18796 root    3r   DIR   253,7     3864        26 /mnt/gfs
save    18796 root    6r   DIR   253,7     3864       232 /mnt/gfs/vz
save 18796 root 7r DIR 253,7 3864 233 /mnt/gfs/vz/private save 18796 root 8r DIR 253,7 3864 230761349 /mnt/gfs/vz/private/109 save 18796 root 9r DIR 253,7 3864 230773154 /mnt/gfs/vz/private/109/usr save 18796 root 12r DIR 253,7 2048 7003944 /mnt/gfs/vz/private/109/usr/lib save 18796 root 14r DIR 253,7 3864 7022175 /mnt/gfs/vz/private/109/usr/lib/openoffice

Also there is a process with the glock_ flag accesing the same:

0 D root 8425 6783 0 78 0 - 669 glock_ 08:24 ? 00:00:00 /usr/lib/openoffice/program/pagein -L/usr/lib/openoffice/program @pagein-common

What can be the problem? A corruption in the filesystem?
should a "gfs_fsck" fix the problem?
Regards.

Frank





--
Aquest missatge ha estat analitzat per MailScanner
a la cerca de virus i d'altres continguts perillosos,
i es considera que està net.
For all your IT requirements visit: http://www.transtec.co.uk

--
Linux-cluster mailing list
Linux-cluster@redhat.com
https://www.redhat.com/mailman/listinfo/linux-cluster

Reply via email to