Re: [Linux-cluster] Ricci doesn't work

2012-08-20 Thread Chip Burke
Thanks for sticking with me on this. Here's the log: https://dl.dropbox.com/u/8137282/strace.ricci.tgz Chip Burke On 8/20/12 10:28 AM, "Jan Pokorný" wrote: >Hello Chip, > >On 17/08/12 15:14 +, Chip Burke wrote: >> Libvirt is not installed on any

Re: [Linux-cluster] Ricci doesn't work

2012-08-20 Thread Jan Pokorný
Hello Chip, On 17/08/12 15:14 +, Chip Burke wrote: > Libvirt is not installed on any of the hosts. could you please provide a strace log (best as gzipped attachment sent off-list, or, you can use e.g. fpaste.org and provide a link if the log is not so huge). Something like (untested): # str

Re: [Linux-cluster] RHEL Cluster rpm fresh installation on rhel5 32bit

2012-08-20 Thread Robert Hayden
I believe the document you are following is for RHEL 4. The packages I typically pull are as follows. They will pull in others as needed. # RHCS Specific Packages #cman.x86_64 #openais.x86_64 #lvm2-cluster.x86_64 #gfs2-utils.x86_64 #rgmanager.x86_64 #system-config

Re: [Linux-cluster] Cluster logging issues + rgmanager doesn't notice failed vms

2012-08-20 Thread Bart Verwilst
Not sure if it will help, but here is some more debugging output for the locking: vm02 is the bad node, vm03 can reach the mounts fine. root@vm02-test:~# cat /sys/kernel/debug/gfs2/kvm\:qemu/glocks G: s:UN n:2/19 f:o t:UN d:EX/0 a:0 v:0 r:1 m:50 G: s:SH n:2/183f3 f:Iqob t:SH d:EX/0 a:0 v:0 r:2

Re: [Linux-cluster] Cluster logging issues + rgmanager doesn't notice failed vms

2012-08-20 Thread Bart Verwilst
At the same time, i notice a hanging /etc/libvirt/qemu gfs2 mount ( while /var/lib/libvirt/sanlock still works fine ) on vm02. vm01 and vm03 have perfectly accessible mounts. Nothing special to see in syslog or dmesg.. /dev/mapper/iscsi_cluster_qemu on /etc/libvirt/qemu type gfs2 (rw,relatime

[Linux-cluster] Cluster logging issues + rgmanager doesn't notice failed vms

2012-08-20 Thread Bart Verwilst
Hello again ;) My cluster seems to be logging only to /var/log/syslog, and even then only from the corosync daemon, the /var/log/cluster logs are empty: root@vm01-test:~# ls -al /var/log/cluster/*.log -rw--- 1 root root 0 Aug 16 06:50 /var/log/cluster/corosync.log -rw--- 1 root root 0

Re: [Linux-cluster] gfs2 blocking tasks

2012-08-20 Thread Bart Verwilst
Nothing out of the ordinary, should have mentioned that! Aug 19 00:08:00 vm02-test corosync[7394]: [CMAN ] daemon: read 20 bytes from fd 17 Aug 19 00:08:00 vm02-test corosync[7394]: [CMAN ] daemon: client command is 7 Aug 19 00:08:00 vm02-test corosync[7394]: [CMAN ] daemon: About to