Re: [PATCH 21/39] mds: encode dirfrag base in cache rejoin ack

2013-03-21 Thread Yan, Zheng
On 03/21/2013 07:33 AM, Gregory Farnum wrote: This needs to handle versioning the encoding based on peer feature bits too. On Sun, Mar 17, 2013 at 7:51 AM, Yan, Zheng zheng.z@intel.com wrote: From: Yan, Zheng zheng.z@intel.com Cache rejoin ack message already encodes inode base,

Re: corruption of active mmapped files in btrfs snapshots

2013-03-21 Thread Alexandre Oliva
On Mar 19, 2013, Alexandre Oliva ol...@gnu.org wrote: On Mar 19, 2013, Alexandre Oliva ol...@gnu.org wrote: that is being processed inside the snapshot. This doesn't explain why the master database occasionally gets similarly corrupted, does it? Actually, scratch this bit for now. I don't

Re: [PATCH 04/39] mds: make sure table request id unique

2013-03-21 Thread Yan, Zheng
On 03/21/2013 02:31 AM, Greg Farnum wrote: On Tuesday, March 19, 2013 at 11:49 PM, Yan, Zheng wrote: On 03/20/2013 02:15 PM, Sage Weil wrote: On Wed, 20 Mar 2013, Yan, Zheng wrote: On 03/20/2013 07:09 AM, Greg Farnum wrote: Hmm, this is definitely narrowing the race (probably enough to never

Re: [ceph-users] Unable to start ceph monitor in V0.59

2013-03-21 Thread Joao Eduardo Luis
On 03/21/2013 11:23 AM, Chen, Xiaoxi wrote: Hi List, I cannot start my monitor when I update my cluster to v0.59, pls note that I am not trying to upgrade,but by reinstall the ceph software stack and rerunning mkcephfs. I have seen that the monitor change a lot after 0.58, is the

v0.59 released

2013-03-21 Thread Sage Weil
Another sprint and another release! This one is delayed a day or two due to power issues in our data center. The most exciting bit here is a big refactor in the monitor that has finally landed (thanks go to Joao Luis), but there is lots of other good stuff to go around: * mon: rearchitected

Re: [PATCH 22/39] mds: handle linkage mismatch during cache rejoin

2013-03-21 Thread Gregory Farnum
On Sun, Mar 17, 2013 at 7:51 AM, Yan, Zheng zheng.z@intel.com wrote: From: Yan, Zheng zheng.z@intel.com For MDS cluster, not all file system namespace operations that impact multiple MDS use two phase commit. Some operations use dentry link/unlink message to update replica dentry's

Re: [PATCH 08/39] mds: consider MDS as recovered when it reaches clientreply state.

2013-03-21 Thread Gregory Farnum
On Wed, Mar 20, 2013 at 7:22 PM, Yan, Zheng zheng.z@intel.com wrote: On 03/21/2013 02:40 AM, Greg Farnum wrote: The idea of this patch makes sense, but I'm not sure if we guarantee that each daemon sees every map update — if they don't then if an MDS misses the map moving an MDS into

Re: [PATCH 11/39] mds: don't delay processing replica buffer in slave request

2013-03-21 Thread Gregory Farnum
On Wed, Mar 20, 2013 at 9:15 PM, Sage Weil s...@inktank.com wrote: On Thu, 21 Mar 2013, Yan, Zheng wrote: On 03/21/2013 05:19 AM, Greg Farnum wrote: On Sunday, March 17, 2013 at 7:51 AM, Yan, Zheng wrote: From: Yan, Zheng zheng.z@intel.com Replicated objects need to be added into the

Re: [PATCH 13/39] mds: don't send resolve message between active MDS

2013-03-21 Thread Gregory Farnum
On Wed, Mar 20, 2013 at 7:55 PM, Yan, Zheng zheng.z@intel.com wrote: On 03/21/2013 05:56 AM, Gregory Farnum wrote: On Sun, Mar 17, 2013 at 7:51 AM, Yan, Zheng zheng.z@intel.com wrote: From: Yan, Zheng zheng.z@intel.com When MDS cluster is resolving, current behavior is sending

Re: [PATCH 29/39] mds: avoid double auth pin for file recovery

2013-03-21 Thread Gregory Farnum
Went over those mechanisms quickly but a bit more carefully; looks good. Reviewed-by: Greg Farnum g...@inktank.com On Wed, Mar 20, 2013 at 8:20 PM, Gregory Farnum g...@inktank.com wrote: This looks good on its face but I haven't had the chance to dig through the recovery queue stuff yet (it's

Re: [PATCH 21/39] mds: encode dirfrag base in cache rejoin ack

2013-03-21 Thread Gregory Farnum
On Wed, Mar 20, 2013 at 11:41 PM, Yan, Zheng zheng.z@intel.com wrote: On 03/21/2013 07:33 AM, Gregory Farnum wrote: This needs to handle versioning the encoding based on peer feature bits too. On Sun, Mar 17, 2013 at 7:51 AM, Yan, Zheng zheng.z@intel.com wrote: From: Yan, Zheng

Re: [PATCH 04/39] mds: make sure table request id unique

2013-03-21 Thread Gregory Farnum
On Thu, Mar 21, 2013 at 1:07 AM, Yan, Zheng zheng.z@intel.com wrote: On 03/21/2013 02:31 AM, Greg Farnum wrote: On Tuesday, March 19, 2013 at 11:49 PM, Yan, Zheng wrote: On 03/20/2013 02:15 PM, Sage Weil wrote: On Wed, 20 Mar 2013, Yan, Zheng wrote: On 03/20/2013 07:09 AM, Greg Farnum

Re: github pull requests

2013-03-21 Thread Chen, Xiaoxi
can we have a review system like review.openstack.com? 发自我的 iPhone 在 2013-3-20,7:10,Guilhem Lettron guil...@lettron.fr 写道: Glad to see this openness! Everyone isn't like you. And I hope to see less [PATCH] in mailing-list, but maybe it's only a dream. Just my two cents. On Tue, Mar 19,

Re: corruption of active mmapped files in btrfs snapshots

2013-03-21 Thread Chris Mason
Quoting Chris Mason (2013-03-21 14:06:14) Quoting Alexandre Oliva (2013-03-21 03:14:02) On Mar 19, 2013, Alexandre Oliva ol...@gnu.org wrote: On Mar 19, 2013, Alexandre Oliva ol...@gnu.org wrote: that is being processed inside the snapshot. This doesn't explain why the master

Re: [ceph-users] Unable to start ceph monitor in V0.59

2013-03-21 Thread Joao Eduardo Luis
On 03/21/2013 11:39 AM, Joao Eduardo Luis wrote: On 03/21/2013 11:23 AM, Chen, Xiaoxi wrote: Hi List, I cannot start my monitor when I update my cluster to v0.59, pls note that I am not trying to upgrade,but by reinstall the ceph software stack and rerunning mkcephfs. I have seen

Re: [PATCH 18/39] mds: fix MDS recovery involving cross authority rename

2013-03-21 Thread Yan, Zheng
On 03/22/2013 01:59 AM, Gregory Farnum wrote: On Sun, Mar 17, 2013 at 7:51 AM, Yan, Zheng zheng.z@intel.com wrote: From: Yan, Zheng zheng.z@intel.com For mds cluster, rename operation may involve multiple MDS. If the rename source's auth MDS crashes after some witness MDS have

Re: [PATCH 22/39] mds: handle linkage mismatch during cache rejoin

2013-03-21 Thread Yan, Zheng
On 03/22/2013 05:23 AM, Gregory Farnum wrote: On Sun, Mar 17, 2013 at 7:51 AM, Yan, Zheng zheng.z@intel.com wrote: From: Yan, Zheng zheng.z@intel.com For MDS cluster, not all file system namespace operations that impact multiple MDS use two phase commit. Some operations use dentry

Re: corruption of active mmapped files in btrfs snapshots

2013-03-21 Thread Alexandre Oliva
On Mar 21, 2013, Chris Mason chris.ma...@fusionio.com wrote: Quoting Chris Mason (2013-03-21 14:06:14) With mmap the kernel can pick any given time to start writing out dirty pages. The idea is that if the application makes more changes the page becomes dirty again and the kernel writes it