Thanks I'll run this on Monday. Josef
Wang Shilong <wangshilong1...@gmail.com> wrote: Hello Josef, With Skinny metadta and i running your btrfs-next repo for-suse branch (which has extent ref patch), i hit following problem: [ 250.679705] BTRFS info (device sdb): relocating block group 35597058048 flags 36 [ 250.728815] BTRFS info (device sdb): relocating block group 35462840320 flags 36 [ 253.562133] Dropping a ref for a root that doesn't have a ref on the block [ 253.562475] Dumping block entry [34793177088 8192], num_refs 3, metadata 0 [ 253.562795] Ref root 0, parent 35532013568, owner 23988, offset 0, num_refs 18446744073709551615 [ 253.563126] Ref root 0, parent 35560964096, owner 23988, offset 0, num_refs 1 [ 253.563505] Ref root 0, parent 35654615040, owner 23988, offset 0, num_refs 1 [ 253.563837] Ref root 0, parent 35678650368, owner 23988, offset 0, num_refs 1 [ 253.564162] Root entry 5, num_refs 1 [ 253.564520] Root entry 18446744073709551608, num_refs 18446744073709551615 [ 253.564860] Ref action 4, root 5, ref_root 5, parent 0, owner 23988, offset 0, num_refs 1 [ 253.565205] [<ffffffffa049d2f1>] process_leaf.isra.6+0x281/0x3e0 [btrfs] [ 253.565225] [<ffffffffa049de83>] build_ref_tree_for_root+0x433/0x460 [btrfs] [ 253.565234] [<ffffffffa049e1af>] btrfs_build_ref_tree+0x18f/0x1c0 [btrfs] [ 253.565241] [<ffffffffa0419ce8>] open_ctree+0x18b8/0x21a0 [btrfs] [ 253.565247] [<ffffffffa03ecb0e>] btrfs_mount+0x62e/0x8b0 [btrfs] [ 253.565251] [<ffffffff812324e9>] mount_fs+0x39/0x1b0 [ 253.565255] [<ffffffff8125285b>] vfs_kern_mount+0x6b/0x150 [ 253.565257] [<ffffffff8125565b>] do_mount+0x27b/0xc30 [ 253.565259] [<ffffffff81256356>] SyS_mount+0x96/0xf0 [ 253.565260] [<ffffffff81795429>] system_call_fastpath+0x16/0x1b [ 253.565263] [<ffffffffffffffff>] 0xffffffffffffffff [ 253.565272] Ref action 1, root 18446744073709551608, ref_root 0, parent 35654615040, owner 23988, offset 0, num_refs 1 [ 253.565681] [<ffffffffa049d564>] btrfs_ref_tree_mod+0x114/0x570 [btrfs] [ 253.565692] [<ffffffffa03f946b>] btrfs_inc_extent_ref+0x6b/0x120 [btrfs] [ 253.565697] [<ffffffffa03fb77c>] __btrfs_mod_ref+0x16c/0x2b0 [btrfs] [ 253.565702] [<ffffffffa0401504>] btrfs_inc_ref+0x14/0x20 [btrfs] [ 253.565707] [<ffffffffa03f05ff>] update_ref_for_cow+0x15f/0x380 [btrfs] [ 253.565711] [<ffffffffa03f0a3d>] __btrfs_cow_block+0x21d/0x540 [btrfs] [ 253.565716] [<ffffffffa03f0f0c>] btrfs_cow_block+0x12c/0x290 [btrfs] [ 253.565721] [<ffffffffa046f59c>] do_relocation+0x49c/0x570 [btrfs] [ 253.565728] [<ffffffffa04723ce>] relocate_tree_blocks+0x60e/0x660 [btrfs] [ 253.565735] [<ffffffffa0473ce7>] relocate_block_group+0x407/0x690 [btrfs] [ 253.565741] [<ffffffffa0474148>] btrfs_relocate_block_group+0x1d8/0x2f0 [btrfs] [ 253.565746] [<ffffffffa04455a7>] btrfs_relocate_chunk.isra.30+0x77/0x800 [btrfs] [ 253.565753] [<ffffffffa0448a8b>] __btrfs_balance+0x4eb/0x8d0 [btrfs] [ 253.565760] [<ffffffffa044928a>] btrfs_balance+0x41a/0x720 [btrfs] [ 253.565766] [<ffffffffa045112a>] btrfs_ioctl_balance+0x16a/0x530 [btrfs] [ 253.565772] [<ffffffffa0456df8>] btrfs_ioctl+0x588/0x2cb0 [btrfs] [ 253.565779] Ref action 1, root 18446744073709551608, ref_root 0, parent 35560964096, owner 23988, offset 0, num_refs 1 [ 253.566143] [<ffffffffa049d564>] btrfs_ref_tree_mod+0x114/0x570 [btrfs] [ 253.566152] [<ffffffffa03f946b>] btrfs_inc_extent_ref+0x6b/0x120 [btrfs] [ 253.566180] [<ffffffffa03fb77c>] __btrfs_mod_ref+0x16c/0x2b0 [btrfs] [ 253.566186] [<ffffffffa0401504>] btrfs_inc_ref+0x14/0x20 [btrfs] [ 253.566191] [<ffffffffa03f071b>] update_ref_for_cow+0x27b/0x380 [btrfs] [ 253.566195] [<ffffffffa03f0a3d>] __btrfs_cow_block+0x21d/0x540 [btrfs] [ 253.566199] [<ffffffffa03f0f0c>] btrfs_cow_block+0x12c/0x290 [btrfs] [ 253.566203] [<ffffffffa046f59c>] do_relocation+0x49c/0x570 [btrfs] [ 253.566210] [<ffffffffa04723ce>] relocate_tree_blocks+0x60e/0x660 [btrfs] [ 253.566216] [<ffffffffa0473ce7>] relocate_block_group+0x407/0x690 [btrfs] [ 253.566222] [<ffffffffa0474148>] btrfs_relocate_block_group+0x1d8/0x2f0 [btrfs] [ 253.566227] [<ffffffffa04455a7>] btrfs_relocate_chunk.isra.30+0x77/0x800 [btrfs] [ 253.566233] [<ffffffffa0448a8b>] __btrfs_balance+0x4eb/0x8d0 [btrfs] [ 253.566240] [<ffffffffa044928a>] btrfs_balance+0x41a/0x720 [btrfs] [ 253.566245] [<ffffffffa045112a>] btrfs_ioctl_balance+0x16a/0x530 [btrfs] [ 253.566252] [<ffffffffa0456df8>] btrfs_ioctl+0x588/0x2cb0 [btrfs] [ 253.566258] Ref action 2, root 18446744073709551608, ref_root 5, parent 0, owner 23988, offset 0, num_refs 18446744073709551615 [ 253.566641] [<ffffffffa049d710>] btrfs_ref_tree_mod+0x2c0/0x570 [btrfs] [ 253.566651] [<ffffffffa040404a>] btrfs_free_extent+0x7a/0x180 [btrfs] [ 253.566657] [<ffffffffa03fb77c>] __btrfs_mod_ref+0x16c/0x2b0 [btrfs] [ 253.566662] [<ffffffffa0401521>] btrfs_dec_ref+0x11/0x20 [btrfs] [ 253.566668] [<ffffffffa03f07a8>] update_ref_for_cow+0x308/0x380 [btrfs] Below is my Test scrips: #!/bin/bash DEVICE=/dev/sdb TEST_MNT=/mnt SLEEP=3 function run_snapshots() { i=1 while [ 1 ] do btrfs sub snapshot $TEST_MNT $TEST_MNT/snap_$i a=$(($i%10)) if [ $a -eq 0 ]; then btrfs sub delete * fi ((i++)) sleep $SLEEP done } function run_compiling() { while [ 1 ] do make -j4 -C $TEST_MNT/linux-btrfs make -C $TEST_MNT/linux-btrfs clean done } function run_balance() { while [ 1 ] do btrfs balance start $TEST_MNT sleep $SLEEP done } run_snapshots & run_compiling & run_balance & ——cut— Mount options: /dev/sdb /mnt btrfs rw,relatime,space_cache 0 0 Here my /dev/sdb is 10G, and before comping kernel,run ‘make allmodconfig’ Above tests maybe detect more problem, after running a while, system seems blocked, echo w > /proc/sysrq-trigger [ 1970.909512] SysRq : Show Blocked State 2 [ 1970.910490] task PC stack pid father 3 [ 1970.910564] kworker/u128:9 D ffff880208a89a30 0 3514 2 0x00000080 4 [ 1970.910587] Workqueue: writeback bdi_writeback_workfn (flush-btrfs-1) 5 [ 1970.910590] ffff8800b3dab8e8 0000000000000046 ffff8800b3dabfd8 00000000001d59c0 6 [ 1970.910594] 00000000001d59c0 ffff880208a89a30 ffff8801f71c3460 ffff8802303d6360 7 [ 1970.910597] ffff88023ff509a8 ffff8800b3dab978 0000000000000002 ffffffff8178eda0 8 [ 1970.910600] Call Trace: 9 [ 1970.910606] [<ffffffff8178eda0>] ? bit_wait+0x50/0x50 10 [ 1970.910609] [<ffffffff8178e56d>] io_schedule+0x9d/0x130 11 [ 1970.910612] [<ffffffff8178edcc>] bit_wait_io+0x2c/0x50 12 [ 1970.910614] [<ffffffff8178eb3b>] __wait_on_bit_lock+0x4b/0xb0 13 [ 1970.910619] [<ffffffff811aa2ef>] __lock_page+0xbf/0xe0 14 [ 1970.910623] [<ffffffff810caa90>] ? autoremove_wake_function+0x40/0x40 15 [ 1970.910642] [<ffffffffa043d9d0>] extent_write_cache_pages.isra.30.constprop.52+0x410/0x440 [btrfs] 16 [ 1970.910645] [<ffffffff810d6a46>] ? __lock_acquire+0x396/0xbe0 17 [ 1970.910648] [<ffffffff81024ec5>] ? native_sched_clock+0x35/0xa0 18 [ 1970.910661] [<ffffffffa043f92c>] extent_writepages+0x5c/0x90 [btrfs] 19 [ 1970.910672] [<ffffffffa04216a0>] ? btrfs_submit_direct+0x6b0/0x6b0 [btrfs] 20 [ 1970.910674] [<ffffffff810b7174>] ? local_clock+0x24/0x30 21 [ 1970.910685] [<ffffffffa041f008>] btrfs_writepages+0x28/0x30 [btrfs] 22 [ 1970.910688] [<ffffffff811b8a21>] do_writepages+0x21/0x50 23 [ 1970.910692] [<ffffffff8125f920>] __writeback_single_inode+0x40/0x540 24 [ 1970.910694] [<ffffffff81260425>] writeback_sb_inodes+0x275/0x520 25 [ 1970.910697] [<ffffffff8126076f>] __writeback_inodes_wb+0x9f/0xd0 26 [ 1970.910700] [<ffffffff81260a53>] wb_writeback+0x2b3/0x550 27 [ 1970.910702] [<ffffffff811b7e90>] ? bdi_dirty_limit+0x40/0xe0 28 [ 1970.910705] [<ffffffff812610d8>] bdi_writeback_workfn+0x1f8/0x650 29 [ 1970.910711] [<ffffffff8109c684>] process_one_work+0x1c4/0x640 30 [ 1970.910713] [<ffffffff8109c624>] ? process_one_work+0x164/0x640 31 [ 1970.910716] [<ffffffff8109cc1b>] worker_thread+0x11b/0x490 32 [ 1970.910718] [<ffffffff8109cb00>] ? process_one_work+0x640/0x640 33 [ 1970.910721] [<ffffffff810a2f1f>] kthread+0xff/0x120 34 [ 1970.910724] [<ffffffff81024ec5>] ? native_sched_clock+0x35/0xa0 35 [ 1970.910727] [<ffffffff810a2e20>] ? kthread_create_on_node+0x250/0x250 36 [ 1970.910730] [<ffffffff8179537c>] ret_from_fork+0x7c/0xb0 37 [ 1970.910732] [<ffffffff810a2e20>] ? kthread_create_on_node+0x250/0x250 38 [ 1970.910737] kworker/u128:20 D ffff8801f71c3460 0 8244 2 0x00000080 39 [ 1970.910752] Workqueue: btrfs-flush_delalloc btrfs_flush_delalloc_helper [btrfs] 40 [ 1970.910754] ffff88020aa2b640 0000000000000046 ffff88020aa2bfd8 00000000001d59c0 41 [ 1970.910757] 00000000001d59c0 ffff8801f71c3460 ffff880225e93460 7fffffffffffffff 42 [ 1970.910760] ffff880035763520 ffff880035763518 ffff880225e93460 ffff880201c44000 43 [ 1970.910763] Call Trace: 44 [ 1970.910766] [<ffffffff8178e209>] schedule+0x29/0x70 45 [ 1970.910769] [<ffffffff81793621>] schedule_timeout+0x281/0x460 46 [ 1970.910772] [<ffffffff810d47d5>] ? mark_held_locks+0x75/0xa0 47 [ 1970.910775] [<ffffffff817946ac>] ? _raw_spin_unlock_irq+0x2c/0x40 48 [ 1970.910777] [<ffffffff8178f89c>] wait_for_completion+0xfc/0x140 49 [ 1970.910780] [<ffffffff810b31c0>] ? wake_up_state+0x20/0x20 50 [ 1970.910790] [<ffffffffa0400ff7>] btrfs_async_run_delayed_refs+0x127/0x150 [btrfs] 51 [ 1970.910802] [<ffffffffa041d0c8>] __btrfs_end_transaction+0x208/0x390 [btrfs] 52 [ 1970.910811] [<ffffffffa041d260>] btrfs_end_transaction+0x10/0x20 [btrfs] 53 [ 1970.910821] [<ffffffffa042365b>] cow_file_range_inline+0x49b/0x5e0 [btrfs] 54 [ 1970.910824] [<ffffffff81024ec5>] ? native_sched_clock+0x35/0xa0 55 [ 1970.910833] [<ffffffffa0423aa3>] cow_file_range+0x303/0x450 [btrfs] 56 [ 1970.910836] [<ffffffff817945b7>] ? _raw_spin_unlock+0x27/0x40 57 [ 1970.910845] [<ffffffffa0424a88>] run_delalloc_range+0x338/0x370 [btrfs] 58 [ 1970.910857] [<ffffffffa043c5e9>] ? find_lock_delalloc_range+0x1e9/0x210 [btrfs] 59 [ 1970.910859] [<ffffffff810d6a46>] ? __lock_acquire+0x396/0xbe0 60 [ 1970.910870] [<ffffffffa043c72c>] writepage_delalloc.isra.34+0x11c/0x180 [btrfs] 61 [ 1970.910880] [<ffffffffa043d2fa>] __extent_writepage+0xca/0x390 [btrfs] 62 [ 1970.910883] [<ffffffff811b6f49>] ? clear_page_dirty_for_io+0xc9/0x110 [ 1970.910893] [<ffffffffa043d93a>] extent_write_cache_pages.isra.30.constprop.52+0x37a/0x440 [btrfs] 64 [ 1970.910895] [<ffffffff81024ec5>] ? native_sched_clock+0x35/0xa0 65 [ 1970.910898] [<ffffffff81024f39>] ? sched_clock+0x9/0x10 66 [ 1970.910900] [<ffffffff810b7175>] ? local_clock+0x25/0x30 67 [ 1970.910909] [<ffffffffa043f92c>] extent_writepages+0x5c/0x90 [btrfs] 68 [ 1970.910918] [<ffffffffa04216a0>] ? btrfs_submit_direct+0x6b0/0x6b0 [btrfs] 69 [ 1970.910928] [<ffffffffa041f008>] btrfs_writepages+0x28/0x30 [btrfs] 70 [ 1970.910930] [<ffffffff811b8a21>] do_writepages+0x21/0x50 71 [ 1970.910933] [<ffffffff811ac7dd>] __filemap_fdatawrite_range+0x5d/0x80 72 [ 1970.910936] [<ffffffff811ac8ac>] filemap_flush+0x1c/0x20 73 [ 1970.910945] [<ffffffffa042271a>] btrfs_run_delalloc_work+0x5a/0xa0 [btrfs] 74 [ 1970.910956] [<ffffffffa044ec1f>] normal_work_helper+0x13f/0x5c0 [btrfs] 75 [ 1970.910966] [<ffffffffa044f0f2>] btrfs_flush_delalloc_helper+0x12/0x20 [btrfs] 76 [ 1970.910969] [<ffffffff8109c684>] process_one_work+0x1c4/0x640 77 [ 1970.910971] [<ffffffff8109c624>] ? process_one_work+0x164/0x640 78 [ 1970.910976] [<ffffffff8109cc1b>] worker_thread+0x11b/0x490 79 [ 1970.910978] [<ffffffff8109cb00>] ? process_one_work+0x640/0x640 80 [ 1970.910981] [<ffffffff810a2f1f>] kthread+0xff/0x120 81 [ 1970.910983] [<ffffffff81024ec5>] ? native_sched_clock+0x35/0xa0 82 [ 1970.910986] [<ffffffff810a2e20>] ? kthread_create_on_node+0x250/0x250 83 [ 1970.910988] [<ffffffff8179537c>] ret_from_fork+0x7c/0xb0 84 [ 1970.910991] [<ffffffff810a2e20>] ? kthread_create_on_node+0x250/0x250 85 [ 1970.910997] btrfs D ffff88022beab460 0 62979 2587 0x00000080 86 [ 1970.911000] ffff880054083870 0000000000000046 ffff880054083fd8 00000000001d59c0 87 [ 1970.911004] 00000000001d59c0 ffff88022beab460 ffff88017f740000 7fffffffffffffff 88 [ 1970.911007] ffff8800546a9520 ffff8800546a9518 ffff88017f740000 ffff880201c44000 89 [ 1970.911010] Call Trace: 90 [ 1970.911012] [<ffffffff8178e209>] schedule+0x29/0x70 91 [ 1970.911015] [<ffffffff81793621>] schedule_timeout+0x281/0x460 92 [ 1970.911018] [<ffffffff810d47d5>] ? mark_held_locks+0x75/0xa0 93 [ 1970.911021] [<ffffffff817946ac>] ? _raw_spin_unlock_irq+0x2c/0x40 94 [ 1970.911023] [<ffffffff8178f89c>] wait_for_completion+0xfc/0x140 95 [ 1970.911026] [<ffffffff810b31c0>] ? wake_up_state+0x20/0x20 96 [ 1970.911035] [<ffffffffa0400ff7>] btrfs_async_run_delayed_refs+0x127/0x150 [btrfs] 97 [ 1970.911047] [<ffffffffa041d0c8>] __btrfs_end_transaction+0x208/0x390 [btrfs] 98 [ 1970.911059] [<ffffffffa041e0c3>] btrfs_end_transaction_throttle+0x13/0x20 [btrfs] 99 [ 1970.911073] [<ffffffffa0473cfe>] relocate_block_group+0x41e/0x690 [btrfs] 100 [ 1970.911086] [<ffffffffa0474148>] btrfs_relocate_block_group+0x1d8/0x2f0 [btrfs] 101 [ 1970.911100] [<ffffffffa04455a7>] btrfs_relocate_chunk.isra.30+0x77/0x800 [btrfs] 102 [ 1970.911102] [<ffffffff81024f39>] ? sched_clock+0x9/0x10 103 [ 1970.911105] [<ffffffff810b7175>] ? local_clock+0x25/0x30 104 [ 1970.911118] [<ffffffffa0435568>] ? btrfs_get_token_64+0x68/0x100 [btrfs] 105 [ 1970.911132] [<ffffffffa0448a8b>] __btrfs_balance+0x4eb/0x8d0 [btrfs] 106 [ 1970.911146] [<ffffffffa044928a>] btrfs_balance+0x41a/0x720 [btrfs] 107 [ 1970.911159] [<ffffffffa045112a>] ? btrfs_ioctl_balance+0x16a/0x530 [btrfs] 108 [ 1970.911172] [<ffffffffa045112a>] btrfs_ioctl_balance+0x16a/0x530 [btrfs] 109 [ 1970.911186] [<ffffffffa0456df8>] btrfs_ioctl+0x588/0x2cb0 [btrfs] 110 [ 1970.911189] [<ffffffff81024ec5>] ? native_sched_clock+0x35/0xa0 111 [ 1970.911191] [<ffffffff81024f39>] ? sched_clock+0x9/0x10 112 [ 1970.911194] [<ffffffff810b7175>] ? local_clock+0x25/0x30 113 [ 1970.911197] [<ffffffff810cfe7f>] ? up_read+0x1f/0x40 114 [ 1970.911200] [<ffffffff81067a84>] ? __do_page_fault+0x254/0x5b0 115 [ 1970.911202] [<ffffffff810d6a46>] ? __lock_acquire+0x396/0xbe0 116 [ 1970.911206] [<ffffffff81243830>] do_vfs_ioctl+0x300/0x520 117 [ 1970.911209] [<ffffffff8124fc6d>] ? __fget_light+0x13d/0x160 118 [ 1970.911212] [<ffffffff81243ad1>] SyS_ioctl+0x81/0xa0 119 [ 1970.911217] [<ffffffff8114a49c>] ? __audit_syscall_entry+0x9c/0xf0 120 [ 1970.911220] [<ffffffff81795429>] system_call_fastpath+0x16/0x1b [ 1970.911217] [<ffffffff8114a49c>] ? __audit_syscall_entry+0x9c/0xf0 120 [ 1970.911220] [<ffffffff81795429>] system_call_fastpath+0x16/0x1b 121 [ 1970.911228] as D ffff880225e93460 0 6423 6421 0x00000080 122 [ 1970.911231] ffff880049657ad0 0000000000000046 ffff880049657fd8 00000000001d59c0 123 [ 1970.911235] 00000000001d59c0 ffff880225e93460 ffff880225631a30 7fffffffffffffff 124 [ 1970.911238] ffff880035762b20 ffff880035762b18 ffff880225631a30 ffff880201c44000 125 [ 1970.911241] Call Trace: 126 [ 1970.911244] [<ffffffff8178e209>] schedule+0x29/0x70 127 [ 1970.911247] [<ffffffff81793621>] schedule_timeout+0x281/0x460 128 [ 1970.911250] [<ffffffff810d47d5>] ? mark_held_locks+0x75/0xa0 129 [ 1970.911252] [<ffffffff817946ac>] ? _raw_spin_unlock_irq+0x2c/0x40 130 [ 1970.911255] [<ffffffff8178f89c>] wait_for_completion+0xfc/0x140 131 [ 1970.911258] [<ffffffff810b31c0>] ? wake_up_state+0x20/0x20 132 [ 1970.911268] [<ffffffffa0400ff7>] btrfs_async_run_delayed_refs+0x127/0x150 [btrfs] 133 [ 1970.911280] [<ffffffffa041d0c8>] __btrfs_end_transaction+0x208/0x390 [btrfs] 134 [ 1970.911292] [<ffffffffa041d260>] btrfs_end_transaction+0x10/0x20 [btrfs] 135 [ 1970.911304] [<ffffffffa0423088>] btrfs_dirty_inode+0x78/0xe0 [btrfs] 136 [ 1970.911307] [<ffffffff8124cf55>] ? touch_atime+0xf5/0x160 137 [ 1970.911319] [<ffffffffa0423154>] btrfs_update_time+0x64/0xd0 [btrfs] 138 [ 1970.911321] [<ffffffff8124cdb5>] update_time+0x25/0xd0 139 [ 1970.911323] [<ffffffff8124cf79>] touch_atime+0x119/0x160 140 [ 1970.911327] [<ffffffff811acf34>] generic_file_read_iter+0x5f4/0x660 141 [ 1970.911330] [<ffffffff810d47d5>] ? mark_held_locks+0x75/0xa0 142 [ 1970.911332] [<ffffffff81790ed6>] ? mutex_lock_nested+0x2d6/0x520 143 [ 1970.911335] [<ffffffff81024ec5>] ? native_sched_clock+0x35/0xa0 144 [ 1970.911338] [<ffffffff8122d82b>] new_sync_read+0x8b/0xd0 145 [ 1970.911340] [<ffffffff8122dfdb>] vfs_read+0x9b/0x180 146 [ 1970.911343] [<ffffffff8122ecf8>] SyS_read+0x58/0xd0 147 [ 1970.911345] [<ffffffff81795429>] system_call_fastpath+0x16/0x1b 148 [ 1970.911347] as D ffff88022bf01a30 0 6433 6431 0x00000080 149 [ 1970.911351] ffff88016f3ffad0 0000000000000046 ffff88016f3fffd8 00000000001d59c0 150 [ 1970.911354] 00000000001d59c0 ffff88022bf01a30 ffff8800ba1a3460 7fffffffffffffff 151 [ 1970.911419] ffff88017faa6820 ffff88017faa6818 ffff8800ba1a3460 ffff880201c44000 152 [ 1970.911423] Call Trace: 153 [ 1970.911426] [<ffffffff8178e209>] schedule+0x29/0x70 154 [ 1970.911429] [<ffffffff81793621>] schedule_timeout+0x281/0x460 155 [ 1970.911432] [<ffffffff810d47d5>] ? mark_held_locks+0x75/0xa0 156 [ 1970.911435] [<ffffffff817946ac>] ? _raw_spin_unlock_irq+0x2c/0x40 157 [ 1970.911438] [<ffffffff8178f89c>] wait_for_completion+0xfc/0x140 158 [ 1970.911440] [<ffffffff810b31c0>] ? wake_up_state+0x20/0x20 159 [ 1970.911452] [<ffffffffa0400ff7>] btrfs_async_run_delayed_refs+0x127/0x150 [btrfs] 160 [ 1970.911464] [<ffffffffa041d0c8>] __btrfs_end_transaction+0x208/0x390 [btrfs] 161 [ 1970.911476] [<ffffffffa041d260>] btrfs_end_transaction+0x10/0x20 [btrfs] 162 [ 1970.911488] [<ffffffffa0423088>] btrfs_dirty_inode+0x78/0xe0 [btrfs] 163 [ 1970.911490] [<ffffffff8124cf55>] ? touch_atime+0xf5/0x160 164 [ 1970.911502] [<ffffffffa0423154>] btrfs_update_time+0x64/0xd0 [btrfs] 165 [ 1970.911505] [<ffffffff8124cdb5>] update_time+0x25/0xd0 166 [ 1970.911507] [<ffffffff8124cf79>] touch_atime+0x119/0x160 167 [ 1970.911510] [<ffffffff811acf34>] generic_file_read_iter+0x5f4/0x660 168 [ 1970.911513] [<ffffffff810d47d5>] ? mark_held_locks+0x75/0xa0 169 [ 1970.911516] [<ffffffff81790ed6>] ? mutex_lock_nested+0x2d6/0x520 170 [ 1970.911518] [<ffffffff81024ec5>] ? native_sched_clock+0x35/0xa0 171 [ 1970.911521] [<ffffffff8122d82b>] new_sync_read+0x8b/0xd0 172 [ 1970.911523] [<ffffffff8122dfdb>] vfs_read+0x9b/0x180 173 [ 1970.911526] [<ffffffff8122ecf8>] SyS_read+0x58/0xd0 174 [ 1970.911528] [<ffffffff81795429>] system_call_fastpath+0x16/0x1b 175 [ 1970.911530] ld D ffff880225e93460 0 6435 6370 0x00000080 176 [ 1970.911534] ffff880049623ad0 0000000000000046 ffff880049623fd8 00000000001d59c0 177 [ 1970.911537] 00000000001d59c0 ffff880225e93460 ffff8800364b4e90 7fffffffffffffff 178 [ 1970.911541] ffff880035762820 ffff880035762818 ffff8800364b4e90 ffff880201c44000 179 [ 1970.911544] Call Trace: 180 [ 1970.911547] [<ffffffff8178e209>] schedule+0x29/0x7 It is easy to reproduce this problem using my scripts… > On 10/18/2014 07:21 AM, Petr Janecek wrote: >> Hello, >> >>>> so far I haven't succeeded running btrfs balance on a large >>>> skinny-metadata fs -- segfault, kernel bug, reproducible. No such >>>> problems on ^skinny-metadata fs (same disks, same data). Tried both >>>> several times on 3.17. More info in comments 10,14 in >>>> https://urldefense.proofpoint.com/v1/url?u=https://bugzilla.kernel.org/show_bug.cgi?id%3D64961&k=ZVNjlDMF0FElm4dQtryO4A%3D%3D%0A&r=cKCbChRKsMpTX8ybrSkonQ%3D%3D%0A&m=w2UsZEYXkYBIP7OADCD4aaaiMhrfPRT6P52q9vkc07k%3D%0A&s=e638e9ef49e562448ef1e564bcdc8ddc0ac2ef7e07a6e30c7405ec489ba4e672 >>> >>> I can't reproduce this, how big is your home directory, and are you >>> still seeing corruptions after just rsyncing to a clean fs? Thanks, >> >> as I wrote in comment 10, it has improved since year ago when I >> reported it: I see no corruption at all, neither after rsync, nor after >> balance crash: btrfs check doesn't find anything wrong, files look ok. >> The only problem is that after adding a disk the balance segfaults on a >> kernel bug and the fs gets stuck. When I run balance again after >> reboot, it makes only a very small progress and crashes again the same >> way. >> >> There are some 2.5TB of data in 7.5M files on that fs. And couple >> dozen ro snapshots -- I'm testing 3.17 + revert of 9c3b306e1c9e right >> now, but it takes more than day to copy the data and recreate all the >> snapshots. But a test with ^skinny-metadata showed no problems, so I >> don't thing I got bitten by that bug. >> >> I have btrfs-image of one of previous runs after crashed balance. >> It's 15GB. I can place it somewhere with fast link, are you interested? >> >> > > Yup, send me the link and I'll pull it down. Thanks, > > Josef > > -- > To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in > the body of a message to majord...@vger.kernel.org > More majordomo info at > https://urldefense.proofpoint.com/v1/url?u=http://vger.kernel.org/majordomo-info.html&k=ZVNjlDMF0FElm4dQtryO4A%3D%3D%0A&r=cKCbChRKsMpTX8ybrSkonQ%3D%3D%0A&m=w2UsZEYXkYBIP7OADCD4aaaiMhrfPRT6P52q9vkc07k%3D%0A&s=5db2bf67575db1c2c60f26d25b0419e691e95fffaf526828334f7896ee687a2e Best Regards, Wang Shilong