On Tue, Jul 1, 2014 at 4:04 PM, Chris Mason <c...@fb.com> wrote:
> On 06/30/2014 07:42 PM, Cody P Schafer wrote:
>> On Mon, Jun 30, 2014 at 1:30 PM, Chris Mason <c...@fb.com> wrote:
>>> On 06/30/2014 02:11 PM, Chris Mason wrote:
>>>> On 06/29/2014 04:02 PM, Cody P Schafer wrote:
>>>>> On Fri, Jun 27, 2014 at 7:22 PM, Chris Samuel <ch...@csamuel.org> wrote:
>>>>>> On Fri, 27 Jun 2014 05:20:41 PM Duncan wrote:
>>>>>>
>>>>>>> If I'm not mistaken the fix for the 3.16 series bug was:
>>>>>>>
>>>>>>> ea4ebde02e08558b020c4b61bb9a4c0fcf63028e
>>>>>>>
>>>>>>> Btrfs: fix deadlocks with trylock on tree nodes.
>>>>>>
>>>>>> That patch applies cleanly to 3.15.2 so if it is indeed the fix it should
>>>>>> probably go to -stable for the next 3.15 release..
>>>>>>
>>>>>> Unfortunately my test system died a while ago (hardware problem) and 
>>>>>> I've not
>>>>>> been able to resurrect it yet.
>>>>>
>>>>> I'm also seeing stuck tasks on btrfs (3.14.4, 3.15.1, 3.15.2).
>>>>> I've also tried 3.15.2 with ea4ebde02e08558b020c4b61bb9a4c applied on
>>>>> top with similar results.
>>>>> I've been triggering the hang with 'rsync -hPaHAXx --del /mnt/home/a/
>>>>> /home/a/' where /mnt/home and /home are 2 separate btrfs filesystems
>>>>> on 2 separate disks.
>>>>>
>>>>> dmesg with w-trigger: 
>>>>> https://urldefense.proofpoint.com/v1/url?u=http://bpaste.net/show/419555&k=ZVNjlDMF0FElm4dQtryO4A%3D%3D%0A&r=6%2FL0lzzDhu0Y1hL9xm%2BQyA%3D%3D%0A&m=SAjzDO8AnhJBEWtUi6s8VGVQd2sORQ%2FJz5tWH4nOYWg%3D%0A&s=2c4ff3f7f39b2e6d3dcd4947905df54d6a534b35adf63c55d8c50e28ef5781b6
>>>>> --
>>>>
>>>> These traces show us waiting for IO, but it doesn't show anyone doing
>>>> the IO.  Either we're failing to kick off our work queues or they are
>>>> stuck on something else.
>>>>
>>>> Could you please send a sysrq-t and sysrq-l while you're stuck?  That
>>>> will show us all the procs and all the CPUs.
>>>
>>> Also, do you have any nodatacow files in here?  Please say yes.
>>>
>>
>> kernel log from 3.15.2 + ea4ebde02 showing the blocked tasks,
>> sysrq-{w,t,l} included
>> https://urldefense.proofpoint.com/v1/url?u=http://bpaste.net/show/423296/&k=ZVNjlDMF0FElm4dQtryO4A%3D%3D%0A&r=6%2FL0lzzDhu0Y1hL9xm%2BQyA%3D%3D%0A&m=SAjzDO8AnhJBEWtUi6s8VGVQd2sORQ%2FJz5tWH4nOYWg%3D%0A&s=5af8bc75059925af242b0eef1f4b94348d233d79968d53ff36b7c2594c9dd6b9
>>
>> I haven't explicitely created any nodatacow files, is there a quick
>> way to tell if there are any? Right now I'm doing
>> `lsattr -R /mnt/home/a/ 2>/dev/null | grep -- '^-*C-* '` to try and check.
>>
>> (2>/dev/null is hiding lots of "Operation not supported While reading
>> flags on" warnings)
>>
>
> If you haven't turned nodatacow on intentionally, you don't have any
> nodatacow files ;)  I have been trying to reproduce this with rsync and
> other code that hammers on the ordered writeback, but no luck yet.
>
> Before we spend too much time triggering it again, I'd like you to
> please try a patch from Filipe that is in current mainline.  I've cherry
> picked on top of 3.15.3 in a branch called v3.15.y:
>
> git://git.kernel.org/pub/scm/linux/kernel/git/mason/linux-btrfs.git v3.15.y
>
> -chris
>

Will do. The rsync I'm running is processing a lot of chromium cache
files when it hangs (just for a reference), and ends up triggering a
bunch of deletes as well.
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to