Re: [Gluster-users] question about info and info.tmp

2016-11-24 Thread Atin Mukherjee
On Fri, Nov 25, 2016 at 1:14 PM, songxin wrote: > Hi Atin, > It seems that this workaround should be done by manual. > Is that right? > And even the files in bricks/* may be empty too. > Yes, that's right > > Do you have a workaround, which is implemented in glusterfs

Re: [Gluster-users] question about info and info.tmp

2016-11-24 Thread songxin
Hi Atin, It seems that this workaround should be done by manual. Is that right? And even the files in bricks/* may be empty too. Do you have a workaround, which is implemented in glusterfs code? Thanks, Xin 在 2016-11-25 15:36:29,"Atin Mukherjee" 写道: On Fri, Nov

Re: [Gluster-users] question about info and info.tmp

2016-11-24 Thread Atin Mukherjee
On Fri, Nov 25, 2016 at 12:06 PM, songxin wrote: > Hi Atin, > Do you mean that you have the workaround applicable now? > Or it will take time to design the workaround? > > If you have workaround now, could you share it to me ? > If you end up in having a 0 byte info file

Re: [Gluster-users] question about info and info.tmp

2016-11-24 Thread songxin
Hi Atin, Do you mean that you have the workaround applicable now? Or it will take time to design the workaround? If you have workaround now, could you share it to me ? Thanks, Xin, 在 2016-11-24 19:12:07,"Atin Mukherjee" 写道: Xin - I appreciate your patience. I'd

Re: [Gluster-users] question about info and info.tmp

2016-11-24 Thread Atin Mukherjee
Xin - I appreciate your patience. I'd need some more time to pick this item up from my backlog. I believe we have a workaround applicable here too. On Thu, 24 Nov 2016 at 14:24, songxin wrote: > > > > Hi Atin, > Actually, the glusterfs is used in my project. > And our test

Re: [Gluster-users] question about info and info.tmp

2016-11-24 Thread songxin
Hi Atin, Actually, the glusterfs is used in my project. And our test team find this issue. So I want to make sure that whether you plan to fix it. if you have plan I will wait you because your method shoud be better than mine. Thanks, Xin 在 2016-11-21 10:00:36,"Atin Mukherjee"

Re: [Gluster-users] question about info and info.tmp

2016-11-20 Thread songxin
Hi Atin, Ok.Thank you for your reply. Thanks, Xin 在 2016-11-21 10:00:36,"Atin Mukherjee" 写道: Hi Xin, I've not got a chance to look into it yet. delete stale volume function is in place to take care of wiping off volume configuration data which has been

Re: [Gluster-users] question about info and info.tmp

2016-11-20 Thread Atin Mukherjee
Hi Xin, I've not got a chance to look into it yet. delete stale volume function is in place to take care of wiping off volume configuration data which has been deleted from the cluster. However we need to revisit this code to see if this function is anymore needed given we recently added a

Re: [Gluster-users] question about info and info.tmp

2016-11-20 Thread songxin
Hi Atin, Thank you for your support. And any conclusions about this issue? Thanks, Xin 在 2016-11-16 20:59:05,"Atin Mukherjee" 写道: On Tue, Nov 15, 2016 at 1:53 PM, songxin wrote: ok, thank you. 在 2016-11-15 16:12:34,"Atin Mukherjee"

Re: [Gluster-users] question about info and info.tmp

2016-11-16 Thread songxin
Hi Atin, Thank you for your support. I have a question for you. glusterd_store_volinfo() will hidden remove the info and bricks/* by rename(). Why glusterd must remove the info and bricks/* in function glusterd_delete_stale_volume() before calling glusterd_store_volinfo()? Thanks, Xin

Re: [Gluster-users] question about info and info.tmp

2016-11-16 Thread Atin Mukherjee
On Tue, Nov 15, 2016 at 1:53 PM, songxin wrote: > ok, thank you. > > > > > 在 2016-11-15 16:12:34,"Atin Mukherjee" 写道: > > > > On Tue, Nov 15, 2016 at 12:47 PM, songxin wrote: > >> >> Hi Atin, >> >> I think the root cause is in

Re: [Gluster-users] question about info and info.tmp

2016-11-15 Thread songxin
ok, thank you. 在 2016-11-15 16:12:34,"Atin Mukherjee" 写道: On Tue, Nov 15, 2016 at 12:47 PM, songxin wrote: Hi Atin, I think the root cause is in the function glusterd_import_friend_volume as below. int32_t glusterd_import_friend_volume

Re: [Gluster-users] question about info and info.tmp

2016-11-15 Thread Atin Mukherjee
On Tue, Nov 15, 2016 at 12:47 PM, songxin wrote: > > Hi Atin, > > I think the root cause is in the function glusterd_import_friend_volume as > below. > > int32_t > glusterd_import_friend_volume (dict_t *peer_data, size_t count) > { > ... > ret =

Re: [Gluster-users] question about info and info.tmp

2016-11-14 Thread songxin
Hi Atin, I think the root cause is in the function glusterd_import_friend_volume as below. int32_t glusterd_import_friend_volume (dict_t *peer_data, size_t count) { ... ret = glusterd_volinfo_find (new_volinfo->volname, _volinfo); if (0 == ret) { (void)

Re: [Gluster-users] question about info and info.tmp

2016-11-14 Thread songxin
Hi Atin, Now I have known that the info and bricks/* are removed by the function glusterd_delete_stale_volume(). But I have not known how to solve this issue. Thanks, Xin 在 2016-11-15 12:07:05,"Atin Mukherjee" 写道: On Tue, Nov 15, 2016 at 8:58 AM, songxin

Re: [Gluster-users] question about info and info.tmp

2016-11-14 Thread songxin
Hi Atin, I have two nodes, a node and b node, in which creating a replicate volume and then start the volume. I will run the script as below on b node. #!/bin/bash i=1 while(($i<100)) do

Re: [Gluster-users] question about info and info.tmp

2016-11-14 Thread Atin Mukherjee
On Tue, Nov 15, 2016 at 8:58 AM, songxin wrote: > Hi Atin, > I have some clues about this issue. > I could reproduce this issue use the scrip that mentioned in > https://bugzilla.redhat.com/show_bug.cgi?id=1308487 . > I really appreciate your help in trying to nail down

Re: [Gluster-users] question about info and info.tmp

2016-11-14 Thread songxin
Hi Atin, I have some clues about this issue. I could reproduce this issue use the scrip that mentioned in https://bugzilla.redhat.com/show_bug.cgi?id=1308487 . After I added some debug print,which like below, in glusterd-store.c and I found that the /var/lib/glusterd/vols/xxx/info and

Re: [Gluster-users] question about info and info.tmp

2016-11-11 Thread Atin Mukherjee
On Fri, Nov 11, 2016 at 4:00 PM, songxin wrote: > Hi Atin, > > Thank you for your support. > Sincerely wait for your reply. > > By the way, could you make sure that the issue, file info is empty, cause > by rename is interrupted in kernel? > As per my RCA on that bug, it

Re: [Gluster-users] question about info and info.tmp

2016-11-11 Thread songxin
Hi Atin, Thank you for your support. Sincerely wait for your reply. By the way, could you make sure that the issue, file info is empty, cause by rename is interrupted in kernel? Thanks, Xin 在 2016-11-11 15:49:02,"Atin Mukherjee" 写道: On Fri, Nov 11, 2016 at 1:15

Re: [Gluster-users] question about info and info.tmp

2016-11-10 Thread Atin Mukherjee
On Fri, Nov 11, 2016 at 1:15 PM, songxin wrote: > Hi Atin, > Thank you for your reply. > Actually it is very difficult to reproduce because I don't know when there > was an ongoing commit happening.It is just a coincidence. > But I want to make sure the root cause. > I'll

Re: [Gluster-users] question about info and info.tmp

2016-11-10 Thread songxin
Hi Atin, Thank you for your reply. Actually it is very difficult to reproduce because I don't know when there was an ongoing commit happening.It is just a coincidence. But I want to make sure the root cause. So I would be grateful if you could answer my questions below. You said that "This

Re: [Gluster-users] question about info and info.tmp

2016-11-10 Thread Atin Mukherjee
On Fri, Nov 11, 2016 at 12:38 PM, songxin wrote: > > Hi Atin, > Thank you for your reply. > > As you said that the info file can only be changed in the > glusterd_store_volinfo() > sequentially because of the big lock. > > I have found the similar issue as below that you

Re: [Gluster-users] question about info and info.tmp

2016-11-10 Thread songxin
Hi Atin, Thank you for your reply. As you said that the info file can only be changed in the glusterd_store_volinfo() sequentially because of the big lock. I have found the similar issue as below that you mentioned. https://bugzilla.redhat.com/show_bug.cgi?id=1308487 You said that "This

Re: [Gluster-users] question about info and info.tmp

2016-11-10 Thread Atin Mukherjee
On Fri, Nov 11, 2016 at 8:33 AM, songxin wrote: > Hi Atin, > > Thank you for your reply. > I have two questions for you. > > 1.Are the two files info and info.tmp are only to be created or changed in > function glusterd_store_volinfo()? I did not find other point in which

Re: [Gluster-users] question about info and info.tmp

2016-11-10 Thread songxin
Hi Atin, Thank you for your reply. I have two questions for you. 1.Are the two files info and info.tmp are only to be created or changed in function glusterd_store_volinfo()? I did not find other point in which the two file are changed. 2.I found that glusterd_store_volinfo() will be call in

Re: [Gluster-users] question about info and info.tmp

2016-11-10 Thread Atin Mukherjee
Did you run out of disk space by any chance? AFAIK, the code is like we write new stuffs to .tmp file and rename it back to the original file. In case of a disk space issue I expect both the files to be of non zero size. But having said that I vaguely remember a similar issue (in the form of a bug