Re: 2.6.29-rc2 oops and assertion failure...

2011-04-08 Thread Josef Bacik

On 04/07/2011 10:26 PM, Daniel J Blueman wrote:

Hi Josef, Chris,

On 8 April 2011 00:23, Josef Bacikjo...@redhat.com  wrote:

On 04/07/2011 03:21 AM, Daniel J Blueman wrote:


When running a practical stress-test on 2.6.29-rc2 trying to reproduce
an older (extent refcounting) issue, I am consistently able to hit an
oops [] and an assertion failure [].


Sorry about that, please apply the patch I just sent this morning

[PATCH] Btrfs: deal with the case that we run out of space in the cache


Superb work - the btrfs_write_out_cache oops is addressed, so now we
(separately) hit a few other assertions at: volumes.c:2013 [1],
volumes.c:2063 [2] and volumes.c:2703 [3] with the previous
reproducer.

Let me know if adding any debugging or other testing may be useful.

Thanks,
   Daniel


Looks like the first 2 panics are basically the same thing.  You are 
getting -EIO back from btrfs_shrink_device(), which could either come 
from searching or it could come from the stuff in relocation.c.  So will 
you put printk's at the 2 places in relocation.c where we return -EIO 
and figure out which one is getting tripped?  Once we know who is 
returning EIO we can go from there.  As for the last one, that's just a 
normal ENOSPC, but it's because we're allocating a chunk in the 
submission path, so that's going to be a little trickier to deal with. 
Lets fix these first two panics first and then hopefully that last one 
will just go away :).  Thanks,


Josef
--
To unsubscribe from this list: send the line unsubscribe linux-btrfs in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: 2.6.29-rc2 oops and assertion failure...

2011-04-07 Thread Josef Bacik

On 04/07/2011 03:21 AM, Daniel J Blueman wrote:

When running a practical stress-test on 2.6.29-rc2 trying to reproduce
an older (extent refcounting) issue, I am consistently able to hit an
oops [1] and an assertion failure [2].



Sorry about that, please apply the patch I just sent this morning

[PATCH] Btrfs: deal with the case that we run out of space in the cache

Thanks,

Josef
--
To unsubscribe from this list: send the line unsubscribe linux-btrfs in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: 2.6.29-rc2 oops and assertion failure...

2011-04-07 Thread Daniel J Blueman
Hi Josef, Chris,

On 8 April 2011 00:23, Josef Bacik jo...@redhat.com wrote:
 On 04/07/2011 03:21 AM, Daniel J Blueman wrote:

 When running a practical stress-test on 2.6.29-rc2 trying to reproduce
 an older (extent refcounting) issue, I am consistently able to hit an
 oops [] and an assertion failure [].

 Sorry about that, please apply the patch I just sent this morning

 [PATCH] Btrfs: deal with the case that we run out of space in the cache

Superb work - the btrfs_write_out_cache oops is addressed, so now we
(separately) hit a few other assertions at: volumes.c:2013 [1],
volumes.c:2063 [2] and volumes.c:2703 [3] with the previous
reproducer.

Let me know if adding any debugging or other testing may be useful.

Thanks,
  Daniel

--- [1]

kernel BUG at fs/btrfs/volumes.c:2013!
invalid opcode:  [#1] PREEMPT SMP DEBUG_PAGEALLOC
last sysfs file: /sys/devices/virtual/block/ram7/removable
CPU 0
Modules linked in: ppp_generic slhc tun brd loop

Pid: 17040, comm: btrfs Tainted: GW   2.6.39-rc2-350cd+ #3
Supermicro X8STi/X8STi
RIP: 0010:[812c214b]  [812c214b] btrfs_balance+0x27b/0x280
RSP: 0018:88015c923e08  EFLAGS: 00010282
RAX: fffb RBX: 880301d6e1b0 RCX: 0040
RDX: fffb RSI:  RDI: 8112e425
RBP: 88015c923e88 R08:  R09: 8802f8ee53f0
R10: 0012 R11: 0098 R12: 8802f909a490
R13: 8802f909bc38 R14: 1000 R15: 7fffd1599ce0
FS:  7f3c4b6f4740() GS:88031fc0() knlGS:
CS:  0010 DS:  ES:  CR0: 8005003b
CR2: 00f00098 CR3: 00015c921000 CR4: 06f0
DR0:  DR1:  DR2: 
DR3:  DR6: 0ff0 DR7: 0400
Process btrfs (pid: 17040, threadinfo 88015c922000, task 88030b898000)
Stack:
 880307cd5498 880301d6c120 88015c923e38 81085b9e
 880308a5d700 0008 88015c923f48 81031d5c
 ea000a9e7b40 88015c923f58 88030b898000 88015c8aa300
Call Trace:
 [81085b9e] ? up_read+0x1e/0x40
 [81031d5c] ? do_page_fault+0x1cc/0x440
 [812c9ec0] btrfs_ioctl+0x450/0x590
 [81152e8d] do_vfs_ioctl+0x8d/0x330
 [81141444] ? fget_light+0x274/0x3c0
 [81106cc0] ? __do_fault+0x150/0x5d0
 [8115317a] sys_ioctl+0x4a/0x80
 [8170a03b] system_call_fastpath+0x16/0x1b
Code: 81 c7 d8 22 00 00 e8 05 4b 44 00 8b 45 80 e9 e7 fd ff ff 31 c0
eb d2 85 c0 74 a7 0f 0b eb fe 0f 0b eb fe 0f 0b eb fe 0f 0b eb fe 0f
0b eb fe 90 55 48 89 e5 48 83 ec 40 8b 05 e2 62 72 00 4c 89
RIP  [812c214b] btrfs_balance+0x27b/0x280
 RSP 88015c923e08

--- [2]

kernel BUG at fs/btrfs/volumes.c:2063!
invalid opcode:  [#1] PREEMPT SMP DEBUG_PAGEALLOC
last sysfs file: /sys/devices/virtual/block/ram7/removable
CPU 0
Modules linked in: brd loop

Pid: 13460, comm: btrfs Tainted: GW   2.6.39-rc2-350cd+ #3
Supermicro X8STi/X8STi
RIP: 0010:[812c213b]  [812c213b] btrfs_balance+0x26b/0x280
RSP: 0018:8800b1827e08  EFLAGS: 00010282
RAX: fffb RBX: 88030934d168 RCX: 0006
RDX: fffb RSI: 880308fc06f0 RDI: 880308fc
RBP: 8800b1827e88 R08:  R09: 
R10:  R11:  R12: 8802ff5455e8
R13: 8800b1827e38 R14: 00010d56 R15: 8800b1827e18
FS:  7fce737e5740() GS:88031fc0() knlGS:
CS:  0010 DS:  ES:  CR0: 8005003b
CR2: 02371688 CR3: b1ff8000 CR4: 06f0
DR0:  DR1:  DR2: 
DR3:  DR6: 0ff0 DR7: 0400
Process btrfs (pid: 13460, threadinfo 8800b1826000, task 880308fc)
Stack:
 0100 88030934e1b0 0100 010d56e4
 880308837a00 0008 0100 0113bbe4
 880308fc0600 8800b1827f58 880308fc 8801f8c56c00
Call Trace:
 [812c9ec0] btrfs_ioctl+0x450/0x590
 [81152e8d] do_vfs_ioctl+0x8d/0x330
 [8114148f] ? fget_light+0x2bf/0x3c0
 [8109629d] ? trace_hardirqs_on_caller+0x14d/0x190
 [8115317a] sys_ioctl+0x4a/0x80
 [8170a03b] system_call_fastpath+0x16/0x1b
Code: 7c 90 fb ff 48 8b 55 88 48 8b ba 58 01 00 00 48 81 c7 d8 22 00
00 e8 05 4b 44 00 8b 45 80 e9 e7 fd ff ff 31 c0 eb d2 85 c0 74 a7 0f
0b eb fe 0f 0b eb fe 0f 0b eb fe 0f 0b eb fe 0f 0b eb fe 90
RIP  [812c213b] btrfs_balance+0x26b/0x280
 RSP 8800b1827e08

--- [3]

kernel BUG at fs/btrfs/volumes.c:2703!
invalid opcode:  [#1] PREEMPT SMP DEBUG_PAGEALLOC
last sysfs file: /sys/devices/virtual/bdi/btrfs-3/uevent
CPU 0
Modules linked in: brd loop

Pid: 14333, comm: btrfs-delalloc- Tainted: GW
2.6.39-rc2-350cd+ #3 Supermicro X8STi/X8STi
RIP: