Hi Susant,
Have you downloaded those state dumps?
http://pan.baidu.com/s/1jHuZCMU
Have you found any clue?
Thank you.
PuYun
From: Susant Palai
Date: 2015-12-17 20:23
To: PuYun
CC: gluster-users
Subject: Re: [Gluster-users] How to diagnose volume rebalance failure?
Ok from your reply
processes, one is glusterfs for rebalance and 2 glusterfsd
for bricks. Only 1 glusterfsd occupied very large mem and it is related to the
newly added brick. The other 2 processes seems normal. If that happens again, I
will send you the state dump.
Thank you.
PuYun
From: Susant Palai
Date: 2015-12
/1jHuZCMU
PuYun
From: Susant Palai
Date: 2015-12-17 20:23
To: PuYun
CC: gluster-users
Subject: Re: [Gluster-users] How to diagnose volume rebalance failure?
Ok from your reply rebalance seems to be fine.
So what you can do is check whether the mem-usage of brick process keeps
increasing
: [ 4830] 0 4830 137846 6463 0 0
0 glusterfs
Dec 16 20:06:41 d001 kernel: [ 4940] 0 4940 341517 116710 1 0
0 glusterfs
= ==
PuYun
From: PuYun
Date: 2015-12-15 22:10
To: gluster-users
complete after running for 25 hours.
And, I can't grep any errors in the rebalance log file using "grep ' E '
FastVol-rebalance.log".
Thank you.
PuYun
From: Susant Palai
Date: 2015-12-15 15:21
To: PuYun
CC: gluster-users
Subject: Re: [Gluster-users] How to diagnose volume
process 1800, UID 0, (glusterfs)
total-vm:5822376kB, anon-rss:5037552kB, file-rss:16kB
==
However, I can find oom in /var/log/messages matching every rebalance failure
before. Is it a memory leak bug?
Thank you.
PuYun
From: PuYun
Date: 2015-12-15
Hi,
I find this bug link https://bugzilla.redhat.com/show_bug.cgi?id=1261234 . My
version is 3.7.4 which is older than the fixed version 3.7.5.
I'll upgrade my gluster version and try again later.
Thank you.
PuYun
From: PuYun
Date: 2015-12-15 21:58
To: gluster-users
Subject: Re: [Gluster
:33.179xx] in
mnt-b1-brick.log, mnt-c1-brick.log and etc-glusterfs-glusterd.vol.log.
PuYun
From: PuYun
Date: 2015-12-15 07:30
To: gluster-users
Subject: Re: [Gluster-users] How to diagnose volume rebalance failure?
Hi,
Failed again. I can see disconnections in logs, but no more details
]
0-FastVol-dht:
/for_ybest_fsdir/user/ji/ay/up/a19640529/linkwrap/129836/icon_loading_white22c04a.gif:
attempting to move from FastVol-client-0 to FastVol-client-1
==
PuYun
From: PuYun
Date: 2015-12-14 21:51
To: gluster-users
Subject: Re
again. Everything goes well
for now.
I will report again when the current task completed or failed.
PuYun
From: Nithya Balachandran
Date: 2015-12-14 18:57
To: PuYun
CC: gluster-users
Subject: Re: [Gluster-users] How to diagnose volume rebalance failure?
Hi,
Can you send us the rebalance log
Here is the tail of the failed rebalance log, any clue?
[2015-12-13 21:30:31.527493] I [dht-rebalance.c:2340:gf_defrag_process_dir]
0-FastVol-dht: Migration operation on dir
/for_ybest_fsdir/user/Weixin.oClDcjhe/Ny/5F/1MsH5--BcoGRAJPI took 20.95 secs
[2015-12-13 21:30:31.528704] I
11 matches
Mail list logo