Having an odd problem on a new test environment we are setting up for a partner. And not sure where to look next to figure out the problem or really understand what the dht_layout_dir_mismatch INFO message is telling me.
I was turning up a 4 node distributed volume, each brick is its own 19TB ext4 partition on a hardware raid5. Each node has the volume mounted back to itself at /glusterfs via the fuse mount. At some point files I copy to the /glusterfs on Node A, are not viewable at /glusterfs on the other 3 Nodes. Once that starts happening I notice the below log messages generated anytime i try manipulating those paths. When I try to remove a directory doing this from any node with 'rm -rf /glusterfs/users/TARGETDIR' it fails to remove any of the files stored on Node A. Far as I can tell none of the glusterd services are down, or unresponsive. The only thing I did different on this setup is that I recreated and destroyed the gluster volume a couple times before hand. Stopping and deleting the gluster volume with the cli, umount'ing the brick's partition and rerunning mkfs.ext4 between each time so the filesystem would be clean. If you delete and recreate the volume it will continue to do the above. Ubuntu 12.04 LTS Linux kg14-compute-64 3.2.0-52-generic #78-Ubuntu SMP Fri Jul 26 16:21:44 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux root@kg14-compute-64:/home/lacadmin# dpkg -l | grep gluster ii glusterfs-client 3.4.0final-ubuntu1~precise1 clustered file-system (client package) ii glusterfs-common 3.4.0final-ubuntu1~precise1 GlusterFS common libraries and translator modules ii glusterfs-server 3.4.0final-ubuntu1~precise1 clustered file-system (server package) /etc/fstab ------ /dev/sdb1 /exports/gluster ext4 defaults,noatime,acl,usrjquota=aquota.user,grpjquota=aquota.group,jqfmt=vfsv0 0 0 localhost:/USER-HOME /glusterfs glusterfs defaults,noauto,nobootwait 0 0 glusterfs.log ------ [2013-09-18 21:48:54.686845] I [dht-common.c:623:dht_revalidate_cbk] 0-USER-HOME-dht: mismatching layouts for /users/rpowell1/benchmark [2013-09-18 21:48:54.687492] I [dht-layout.c:630:dht_layout_normalize] 0-USER-HOME-dht: found anomalies in /users/rpowell1/benchmark. holes=1 overlaps=1 [2013-09-18 22:04:32.671426] W [socket.c:514:__socket_rwv] 0-glusterfs: readv failed (No data available) [2013-09-18 22:04:32.671484] W [socket.c:1962:__socket_proto_state_machine] 0-glusterfs: reading from socket failed. Error (No data available), peer (127.0.0.1:24007) [2013-09-18 22:04:42.977516] I [glusterfsd-mgmt.c:1583:mgmt_getspec_cbk] 0-glusterfs: No change in volfile, continuing [2013-09-18 22:14:25.221279] I [dht-layout.c:745:dht_layout_dir_mismatch] 0-USER-HOME-dht: subvol: USER-HOME-client-2; inode layout - 2147483646 - 3221225468; disk layout - 0 - 1073741822 [2013-09-18 22:14:25.221338] I [dht-common.c:623:dht_revalidate_cbk] 0-USER-HOME-dht: mismatching layouts for / [2013-09-18 22:14:25.221975] I [dht-layout.c:630:dht_layout_normalize] 0-USER-HOME-dht: found anomalies in /. holes=1 overlaps=1 [2013-09-19 00:23:11.341443] I [dht-layout.c:745:dht_layout_dir_mismatch] 0-USER-HOME-dht: subvol: USER-HOME-client-2; inode layout - 2147483646 - 3221225468; disk layout - 0 - 1073741822 -- Raymond Powell Systems Administrator Laboratory for Advanced Computing University of Chicago, KCBD 10160 900 East 57th Street Chicago, IL 60637 _______________________________________________ Gluster-users mailing list Gluster-users@gluster.org http://supercolony.gluster.org/mailman/listinfo/gluster-users