On Mon, Mar 17, 2014 at 5:57 AM, Florian Bruhin <m...@the-compiler.org> wrote: > Hi, > > on one of my servers running btrfs, I noticed a very high load of > 26/26/26. After investigating further, this happened in my logs about > 5 minutes before the monitoring alerted me because of the load:
Hi, This issue was fixed in kernel 3.14 by: https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/?id=5de865eebb8330eee19c37b31fb6f315a09d4273 It was already backported to 3.12.14 but not yet to any 3.13 release. cheers > > ------------[ cut here ]------------ > WARNING: CPU: 2 PID: 3046 at fs/btrfs/ctree.c:1321 > btrfs_search_old_slot+0x7c4/0x900 [btrfs]() > Modules linked in: w83627ehf hwmon_vid intel_rapl x86_pkg_temp_thermal > intel_powerclamp coretemp kvm_intel kvm btrfs i915 crct10dif_pclmul > crct10dif_common crc32_pclmul video intel_agp raid6_pq li > CPU: 2 PID: 3046 Comm: deluged Not tainted 3.13.6-1-ARCH #1 > Hardware name: /DH67BL, BIOS > BLH6710H.86A.0160.2012.1204.1156 12/04/2012 > 0000000000000009 ffff8800d2ecd9f0 ffffffff81513274 0000000000000000 > ffff8800d2ecda28 ffffffff81061a3d ffff880000000000 0000160000000000 > ffff88010c5b3400 ffff88011d63c460 ffff88010babf360 ffff8800d2ecda38 > Call Trace: > [<ffffffff81513274>] dump_stack+0x4d/0x6f > [<ffffffff81061a3d>] warn_slowpath_common+0x7d/0xa0 > [<ffffffff81061b1a>] warn_slowpath_null+0x1a/0x20 > [<ffffffffa05bfc74>] btrfs_search_old_slot+0x7c4/0x900 [btrfs] > [<ffffffffa06375f7>] __resolve_indirect_refs+0x117/0x650 [btrfs] > [<ffffffffa05fa07b>] ? release_extent_buffer+0x2b/0xd0 [btrfs] > [<ffffffffa06383d6>] ? __add_missing_keys.isra.10+0x76/0x140 [btrfs] > [<ffffffffa0638b1d>] find_parent_nodes+0x34d/0x6e0 [btrfs] > [<ffffffffa0638f69>] btrfs_find_all_roots+0x99/0x100 [btrfs] > [<ffffffffa05f90e0>] ? set_state_bits+0x80/0x80 [btrfs] > [<ffffffffa06398ed>] iterate_extent_inodes+0x14d/0x320 [btrfs] > [<ffffffffa05f90e0>] ? set_state_bits+0x80/0x80 [btrfs] > [<ffffffffa0639b47>] iterate_inodes_from_logical+0x87/0xa0 [btrfs] > [<ffffffffa0600a02>] extent_fiemap+0x282/0x5a0 [btrfs] > [<ffffffffa05e5c10>] ? btrfs_get_extent+0x9a0/0x9a0 [btrfs] > [<ffffffff81142000>] ? dirty_ratio_handler+0x10/0x50 > [<ffffffffa05e4cf5>] btrfs_fiemap+0x45/0x50 [btrfs] > [<ffffffff811b5ef4>] do_vfs_ioctl+0x154/0x4c0 > [<ffffffff811b62e1>] SyS_ioctl+0x81/0xa0 > [<ffffffff815216ad>] system_call_fastpath+0x1a/0x1f > ---[ end trace 62dcb7572bce3542 ]--- > BTRFS critical (device sda3): unable to find logical 20461224206336 len > 4096 > ------------[ cut here ]------------ > kernel BUG at fs/btrfs/inode.c:1593! > invalid opcode: 0000 [#1] PREEMPT SMP > Modules linked in: w83627ehf hwmon_vid intel_rapl x86_pkg_temp_thermal > intel_powerclamp coretemp kvm_intel kvm btrfs i915 crct10dif_pclmul > crct10dif_common crc32_pclmul video intel_agp raid6_pq li > CPU: 2 PID: 3046 Comm: deluged Tainted: G W 3.13.6-1-ARCH #1 > Hardware name: /DH67BL, BIOS > BLH6710H.86A.0160.2012.1204.1156 12/04/2012 > task: ffff88021205f500 ti: ffff8800d2ecc000 task.ti: ffff8800d2ecc000 > RIP: 0010:[<ffffffffa05e4828>] [<ffffffffa05e4828>] > btrfs_merge_bio_hook+0x78/0x80 [btrfs] > RSP: 0018:ffff8800d2ecd6c8 EFLAGS: 00010282 > RAX: 00000000ffffffea RBX: 0000000000001000 RCX: 0000000000000006 > RDX: 0000000000000007 RSI: 0000000000000046 RDI: 0000000000000246 > RBP: ffff8800d2ecd6e0 R08: 0000000000000000 R09: 0000000000000414 > R10: 0000000000000001 R11: 0000000000000000 R12: 0000000000001000 > R13: 000000094e000018 R14: ffff880115aecba8 R15: ffff8802126b0040 > FS: 00007f60528ae700(0000) GS:ffff88021f300000(0000) > knlGS:0000000000000000 > CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > CR2: 00007f5e3258bd18 CR3: 0000000212996000 CR4: 00000000000407e0 > Stack: > 0000000000001000 ffff8800d2ecd8f0 0000000000001000 ffff8800d2ecd730 > ffffffffa05fc5c9 0000002000000000 0000000000000000 ffffea0004499400 > 000000094e000018 0000000000000000 0000000000000000 0000000000001000 > Call Trace: > [<ffffffffa05fc5c9>] submit_extent_page.isra.37+0x149/0x1d0 [btrfs] > [<ffffffffa05fc877>] __do_readpage+0x227/0x760 [btrfs] > [<ffffffffa05fdfa0>] ? repair_eb_io_failure+0xd0/0xd0 [btrfs] > [<ffffffffa05d8370>] ? free_root_pointers+0x60/0x60 [btrfs] > [<ffffffffa05fce75>] __extent_read_full_page+0xc5/0xe0 [btrfs] > [<ffffffffa05d8370>] ? free_root_pointers+0x60/0x60 [btrfs] > [<ffffffffa05d8370>] ? free_root_pointers+0x60/0x60 [btrfs] > [<ffffffffa0600593>] read_extent_buffer_pages+0x223/0x300 [btrfs] > [<ffffffffa05d8370>] ? free_root_pointers+0x60/0x60 [btrfs] > [<ffffffffa05d9fc3>] > btree_read_extent_buffer_pages.constprop.52+0xb3/0x120 [btrfs] > [<ffffffffa05da5c6>] read_tree_block+0x46/0x80 [btrfs] > [<ffffffffa05bcda8>] read_block_for_search.isra.32+0x148/0x380 [btrfs] > [<ffffffff81061a4f>] ? warn_slowpath_common+0x8f/0xa0 > [<ffffffffa05bf777>] btrfs_search_old_slot+0x2c7/0x900 [btrfs] > [<ffffffffa06375f7>] __resolve_indirect_refs+0x117/0x650 [btrfs] > [<ffffffffa05fa07b>] ? release_extent_buffer+0x2b/0xd0 [btrfs] > [<ffffffffa06383d6>] ? __add_missing_keys.isra.10+0x76/0x140 [btrfs] > [<ffffffffa0638b1d>] find_parent_nodes+0x34d/0x6e0 [btrfs] > [<ffffffffa0638f69>] btrfs_find_all_roots+0x99/0x100 [btrfs] > [<ffffffffa05f90e0>] ? set_state_bits+0x80/0x80 [btrfs] > [<ffffffffa06398ed>] iterate_extent_inodes+0x14d/0x320 [btrfs] > [<ffffffffa05f90e0>] ? set_state_bits+0x80/0x80 [btrfs] > [<ffffffffa0639b47>] iterate_inodes_from_logical+0x87/0xa0 [btrfs] > [<ffffffffa0600a02>] extent_fiemap+0x282/0x5a0 [btrfs] > [<ffffffffa05e5c10>] ? btrfs_get_extent+0x9a0/0x9a0 [btrfs] > [<ffffffff81142000>] ? dirty_ratio_handler+0x10/0x50 > [<ffffffffa05e4cf5>] btrfs_fiemap+0x45/0x50 [btrfs] > [<ffffffff811b5ef4>] do_vfs_ioctl+0x154/0x4c0 > [<ffffffff811b62e1>] SyS_ioctl+0x81/0xa0 > [<ffffffff815216ad>] system_call_fastpath+0x1a/0x1f > Code: c9 45 31 c0 89 fe 48 89 c7 4c 89 65 e8 e8 f1 3f 02 00 85 c0 78 15 > 4c 01 e3 31 c0 48 3b 5d e8 0f 97 c0 48 83 c4 08 5b 41 5c 5d c3 <0f> 0b 66 0f > 1f 44 00 00 66 66 66 66 90 55 48 89 e5 41 57 49 > RIP [<ffffffffa05e4828>] btrfs_merge_bio_hook+0x78/0x80 [btrfs] > RSP <ffff8800d2ecd6c8> > ---[ end trace 62dcb7572bce3543 ]--- > > So basically after that happened, my load stayed high, so probably > something went berzerk. > > After a reboot everything seems to work nice again, except I get some > of these errors in my logs: > > BTRFS error (device sda3): block group 2346155245568 has wrong amount of > free space > BTRFS error (device sda3): failed to load free space cache for block > group 2346155245568 > > I'm running on an Archlinux on kernel 3.13.6. > > Is this a bug? What should I do now? I'm currently running a scrub to > see if everything is fine. > > If you need anything else to help, please let me know. Also please Cc > me on replies as I'm not subscribed to the list. Thanks! > > Florian > > -- > () ascii ribbon campaign - stop html mail www.asciiribbon.org > /\ www.the-compiler.org | I love long mails http://email.is-not-s.ms/ > It is now pitch dark. If you proceed, you will likely fall into a pit. -- Filipe David Manana, "Reasonable men adapt themselves to the world. Unreasonable men adapt the world to themselves. That's why all progress depends on unreasonable men." -- To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html