Re: [Gluster-users] How to diagnose volume rebalance failure?

2015-12-27 Thread PuYun
Hi Susant, Have you downloaded those state dumps? http://pan.baidu.com/s/1jHuZCMU Have you found any clue? Thank you. PuYun From: Susant Palai Date: 2015-12-17 20:23 To: PuYun CC: gluster-users Subject: Re: [Gluster-users] How to diagnose volume rebalance failure? Ok from your reply

Re: [Gluster-users] How to diagnose volume rebalance failure?

2015-12-17 Thread PuYun
processes, one is glusterfs for rebalance and 2 glusterfsd for bricks. Only 1 glusterfsd occupied very large mem and it is related to the newly added brick. The other 2 processes seems normal. If that happens again, I will send you the state dump. Thank you. PuYun From: Susant Palai Date: 2015-12

Re: [Gluster-users] How to diagnose volume rebalance failure?

2015-12-17 Thread PuYun
/1jHuZCMU PuYun From: Susant Palai Date: 2015-12-17 20:23 To: PuYun CC: gluster-users Subject: Re: [Gluster-users] How to diagnose volume rebalance failure? Ok from your reply rebalance seems to be fine. So what you can do is check whether the mem-usage of brick process keeps increasing

Re: [Gluster-users] How to diagnose volume rebalance failure?

2015-12-16 Thread PuYun
: [ 4830] 0 4830 137846 6463 0 0 0 glusterfs Dec 16 20:06:41 d001 kernel: [ 4940] 0 4940 341517 116710 1 0 0 glusterfs = == PuYun From: PuYun Date: 2015-12-15 22:10 To: gluster-users

Re: [Gluster-users] How to diagnose volume rebalance failure?

2015-12-15 Thread PuYun
complete after running for 25 hours. And, I can't grep any errors in the rebalance log file using "grep ' E ' FastVol-rebalance.log". Thank you. PuYun From: Susant Palai Date: 2015-12-15 15:21 To: PuYun CC: gluster-users Subject: Re: [Gluster-users] How to diagnose volume

Re: [Gluster-users] How to diagnose volume rebalance failure?

2015-12-15 Thread PuYun
process 1800, UID 0, (glusterfs) total-vm:5822376kB, anon-rss:5037552kB, file-rss:16kB == However, I can find oom in /var/log/messages matching every rebalance failure before. Is it a memory leak bug? Thank you. PuYun From: PuYun Date: 2015-12-15

Re: [Gluster-users] How to diagnose volume rebalance failure?

2015-12-15 Thread PuYun
Hi, I find this bug link https://bugzilla.redhat.com/show_bug.cgi?id=1261234 . My version is 3.7.4 which is older than the fixed version 3.7.5. I'll upgrade my gluster version and try again later. Thank you. PuYun From: PuYun Date: 2015-12-15 21:58 To: gluster-users Subject: Re: [Gluster

Re: [Gluster-users] How to diagnose volume rebalance failure?

2015-12-14 Thread PuYun
:33.179xx] in mnt-b1-brick.log, mnt-c1-brick.log and etc-glusterfs-glusterd.vol.log. PuYun From: PuYun Date: 2015-12-15 07:30 To: gluster-users Subject: Re: [Gluster-users] How to diagnose volume rebalance failure? Hi, Failed again. I can see disconnections in logs, but no more details

Re: [Gluster-users] How to diagnose volume rebalance failure?

2015-12-14 Thread PuYun
] 0-FastVol-dht: /for_ybest_fsdir/user/ji/ay/up/a19640529/linkwrap/129836/icon_loading_white22c04a.gif: attempting to move from FastVol-client-0 to FastVol-client-1 == PuYun From: PuYun Date: 2015-12-14 21:51 To: gluster-users Subject: Re

Re: [Gluster-users] How to diagnose volume rebalance failure?

2015-12-14 Thread PuYun
again. Everything goes well for now. I will report again when the current task completed or failed. PuYun From: Nithya Balachandran Date: 2015-12-14 18:57 To: PuYun CC: gluster-users Subject: Re: [Gluster-users] How to diagnose volume rebalance failure? Hi, Can you send us the rebalance log

Re: [Gluster-users] How to diagnose volume rebalance failure?

2015-12-13 Thread PuYun
Here is the tail of the failed rebalance log, any clue? [2015-12-13 21:30:31.527493] I [dht-rebalance.c:2340:gf_defrag_process_dir] 0-FastVol-dht: Migration operation on dir /for_ybest_fsdir/user/Weixin.oClDcjhe/Ny/5F/1MsH5--BcoGRAJPI took 20.95 secs [2015-12-13 21:30:31.528704] I