Re: [Gluster-users] Brick crashes

2012-06-08 Thread Pranith Kumar Karampuri
hi Ling Ho, It seems like you are using rdma, could you confirm? I am suspecting a memory leak. Could you help me confirm if that is the case. Please post the output of the following: 1) when you start the brick perform 'kill -USR1 ' This will save a file /tmp/glusterdump. 2) mv /tmp/glusterdu

Re: [Gluster-users] Brick crashes

2012-06-08 Thread Albert Zhang
Looks like some process hung there due to memory issues in kernel, error message from the very beginning would be helpful Sent from my iPhone On 2012-6-9, at 上午8:26, Ling Ho wrote: > Hi Anand, > > ulimit -l running as root is 64. > > > This dmesg out is from the second system. > > I don't

Re: [Gluster-users] Brick crashes

2012-06-08 Thread Ling Ho
Hi Anand, ulimit -l running as root is 64. This dmesg out is from the second system. I don't see any new on the first system other that what were there when system booted. Do you want to see the whole dmesg output? Where should I post it, there are 1600 lines. ... ling INFO: task glusterf

Re: [Gluster-users] Brick crashes

2012-06-08 Thread Anand Avati
Those are 4.x GB. Can you post dmesg output as well? Also, what's 'ulimit -l' on your system? On Fri, Jun 8, 2012 at 4:41 PM, Ling Ho wrote: > > This is the core file from the crash just now > > [root@psanaoss213 /]# ls -al core* > -rw--- 1 root root 4073594880 Jun 8 15:05 core.22682 > > Fr

Re: [Gluster-users] Brick crashes

2012-06-08 Thread Ling Ho
This is the core file from the crash just now [root@psanaoss213 /]# ls -al core* -rw--- 1 root root 4073594880 Jun 8 15:05 core.22682 From yesterday: [root@psanaoss214 /]# ls -al core* -rw--- 1 root root 4362727424 Jun 8 00:58 core.13483 -rw--- 1 root root 4624773120 Jun 8 03:21

Re: [Gluster-users] Brick crashes

2012-06-08 Thread Anand Avati
Is it possible the system was running low on memory? I see you have 48GB, but memory registration failure typically would be because the system limit on the number of pinnable pages in RAM was hit. Can you tell us the size of your core dump files after the crash? Avati On Fri, Jun 8, 2012 at 4:22

[Gluster-users] Brick crashes

2012-06-08 Thread Ling Ho
Hello, I have a brick that crashed twice today, and another different brick that crashed just a while a go. This is what I see in one of the brick logs: patchset: git://git.gluster.com/glusterfs.git patchset: git://git.gluster.com/glusterfs.git signal received: 6 signal received: 6 time of cr

Re: [Gluster-users] too many redirects at the gluster download page

2012-06-08 Thread John Mark Walker
It comes up intermittently, and we're not sure why. I've had our team look at it before, and I'll have them look at it again. -JM - Original Message - > On Fri, Jun 08, 2012 at 09:59:04AM -0700, Harry Mangalam wrote: > >And when you click on the alternative 'Download' links (the > >

Re: [Gluster-users] Performance optimization tips Gluster 3.3? (small files / directory listings)

2012-06-08 Thread Brian Candler
On Fri, Jun 08, 2012 at 05:46:42PM +0100, Brian Candler wrote: > The VM boots with io='native' and bus='virtio', but performance is still > very poor: > > ubuntu@lucidtest:~$ dd if=/dev/zero of=/var/tmp/test.zeros bs=1024k > count=100 > 100+0 records in > 100+0 records out > 10485

Re: [Gluster-users] Performance optimization tips Gluster 3.3? (small files / directory listings)

2012-06-08 Thread Brian Candler
On Fri, Jun 08, 2012 at 02:23:57PM -0400, olav johansen wrote: >This is a single thread trying to process a sequential task where the >latency really becomes a problem with ls -aR I get similar speed: That's interesting. >[@web1 files]# time ls -aR|wc -l >1968316 >real27m2

Re: [Gluster-users] Performance optimization tips Gluster 3.3? (small files / directory listings)

2012-06-08 Thread olav johansen
Hi Brian, This is a single thread trying to process a sequential task where the latency really becomes a problem with ls -aR I get similar speed: [@web1 files]# time ls -aR|wc -l 1968316 real27m23.432s user0m5.523s sys 0m35.369s [@web1 files]# time ls -aR|wc -l 1968316 real26m2.72

Re: [Gluster-users] Performance optimization tips Gluster 3.3? (small files / directory listings)

2012-06-08 Thread Fernando Frediani (Qube)
Thanks for sharing that Brian, I wonder if the cause of the problem when trying to power Up VMware ESXi VMs is for the same reason. Fernando -Original Message- From: Brian Candler [mailto:b.cand...@pobox.com] Sent: 08 June 2012 17:47 To: Pranith Kumar Karampuri Cc: olav johansen; glust

Re: [Gluster-users] too many redirects at the gluster download page

2012-06-08 Thread Brian Candler
On Fri, Jun 08, 2012 at 09:59:04AM -0700, Harry Mangalam wrote: >And when you click on the alternative 'Download' links (the button on >the upper right or the larger "Download GlusterFS" icon with the >package image, you get this in Chrome: >This webpage has a redirect loop >The

Re: [Gluster-users] Performance optimization tips Gluster 3.3? (small files / directory listings)

2012-06-08 Thread Brian Candler
On Fri, Jun 08, 2012 at 05:46:42PM +0100, Brian Candler wrote: > but glusterfs objected to the cache='none' option (possibly this opens the > file with O_DIRECT?) Yes that's definitely the problem, as I can see if I strace the kvm process: stat("/gluster/safe/images/lucidtest/tmpaJqTD9.qcow2", {s

[Gluster-users] too many redirects at the gluster download page

2012-06-08 Thread Harry Mangalam
It may be just me/chrome, but trying to dl the latest gluster results by clicking on the Download button next to the Ant, leads not to a download page but to the info page. It invites you to go back to the gluster.org page from when you just came. And when you click on the alternative 'Downlo

Re: [Gluster-users] Performance optimization tips Gluster 3.3? (small files / directory listings)

2012-06-08 Thread Brian Candler
On Thu, Jun 07, 2012 at 02:36:26PM +0100, Brian Candler wrote: > I'm interested in understanding this, especially the split-brain scenarios > (better to understand them *before* you're stuck in a problem :-) > > BTW I'm in the process of building a 2-node 3.3 test cluster right now. FYI, I have g

[Gluster-users] Can't re-create deleted volume

2012-06-08 Thread Gerald Brandt
Hi, I created a test volume, deleted it, and can not re-create it. # gluster volume create nfstest replica 2 transport tcp nfstest1:/nfstest nfstest2:/nfstest # gluster volume delete nfstest # gluster volume create nfstest replica 2 transport tcp nfstest1:/export/nfs nfstest2:/export/nfs /expor

Re: [Gluster-users] Gluster 3.3.0 and VMware ESXi 5

2012-06-08 Thread Fernando Frediani (Qube)
I don't think there are many people using it with VMware specifically nor the people who develop it have probably tested it much. I also suspect is some problem with NFS settings and wonder if it is possible to change it if you use that environment only for running virtual machines. I know that

Re: [Gluster-users] Issue recreating volumes

2012-06-08 Thread Brian Candler
On Fri, Jun 08, 2012 at 01:58:56AM -0400, Rajesh Amaravathi wrote: > one can use the clear_xattrs.sh script with the bricks as argument to remove > all the xattrs set on bricks. it recursively deleted all > xattrs from the bricks' files. after running this script on bricks, we can > re-use them

Re: [Gluster-users] Issue recreating volumes

2012-06-08 Thread Brian Candler
On Fri, Jun 08, 2012 at 10:34:08AM +0530, Amar Tumballi wrote: > If admin is aware of the case, and knows that there is no 'data' > inside the brick, then easier option is to delete the export dir and > it gets created by 'gluster volume create'. If you want to fix it > without deleting the export

Re: [Gluster-users] Performance optimization tips Gluster 3.3? (small files / directory listings)

2012-06-08 Thread Brian Candler
On Fri, Jun 08, 2012 at 12:19:58AM -0400, olav johansen wrote: ># mount -t glusterfs fs1:/data-storage /storage >I've copied over my data to it again and doing a ls several times, >takes ~0.5 seconds: >[@web1 files]# time ls -all|wc -l Like I said before, please also try without th

Re: [Gluster-users] Gluster 2.6 and infiniband

2012-06-08 Thread bxma...@gmail.com
Hello, after downgrade kernel to 2.6.28 ( on 3.2.12 is glusterd not working - check my previous email ) i'm not able to run rdma at all, mount without rdma ( i'm using tcp,rdma ) is working ok but speed max 150mb/s after try to mount .rdma it fail and log contain this: [2012-06-08 03:50:32.442263