On Wed, Apr 11, 2012 at 03:39:07PM -0600, Jim Schutt wrote: > On 04/11/2012 02:28 PM, Josef Bacik wrote: > >On Wed, Apr 11, 2012 at 02:24:30PM -0600, Jim Schutt wrote: > >>On 04/11/2012 01:09 PM, Josef Bacik wrote: > >>>On Tue, Apr 10, 2012 at 01:39:14PM -0600, Jim Schutt wrote: > >>>>Hi, > >>>> > >>>>I hit this BUG today. > >>>> > >>>>I'm running 3.3.1 merged with the ceph and btrfs bits for 3.4, > >>>>i.e. 3.3.1 + > >>>> commit bc3f116fec194 "Btrfs: update the checks for mixed block groups > >>>> with big metadata blocks" > >>>> commit c666601a935b9 "rbd: move snap_rwsem to the device, rename to > >>>> header_rwsem" > >>>> > >>>>The btrfs filesystem in question is backing a Ceph OSD under > >>>>a heavy write load. > >>>> > >>>>Here's the bug: > >>>> > >>> > >>>Can you give this a whirl and let me know how it goes? If I'm right you > >>>should > >>>see a warning pop up in your messages. Thanks, > >> > >>OK, I've got my test running with your patch applied > >>to my previous kernel. > >> > >>Do you expect your warning to only fire when my > >>previous kernel would have BUGged? I ask because I've > >>only seen the BUG once, so it may be a low-probability > >>occurrence. > >> > >>It seems like I should keep testing until I see either > >>your new warning or the BUG, right? > >> > > > >So hopefully you will see my WARN with no BUG, but yes keep running until you > >see one or the other please ;). Thanks, > > Hmmm, the BUG won: > > [ 6202.249041] ------------[ cut here ]------------ > [ 6202.253654] kernel BUG at fs/btrfs/extent_io.c:3989!
Since this is exactly the same call trace, we can assume ref count on the buffer is correct. I think it means we're racing on removing the buffer from the radix tree. I'm adding some diagnostics here to try and grow the window a bit. -chris -- To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html