On 2018/9/5 下午3:41, Nikolay Borisov wrote:
>
>
> On 5.09.2018 08:53, Qu Wenruo wrote:
>>
>>
>> On 2018/9/5 下午1:42, Nikolay Borisov wrote:
>>>
>>>
>>> On 5.09.2018 05:10, Qu Wenruo wrote:
>>>>
>>>>
>>>> On 2018/8/16 下午9:10, Nikolay Borisov wrote:
>>>>> This commit enables the delayed refs infrastructures. This entails doing
>>>>> the following:
>>>>>
>>>>> 1. Replacing existing calls of btrfs_extent_post_op (which is the
>>>>> equivalent of delayed refs) with the proper btrfs_run_delayed_refs.
>>>>> As well as eliminating open-coded calls to finish_current_insert and
>>>>> del_pending_extents which execute the delayed ops.
>>>>>
>>>>> 2. Wiring up the addition of delayed refs when freeing extents
>>>>> (btrfs_free_extent) and when adding new extents (alloc_tree_block).
>>>>>
>>>>> 3. Adding calls to btrfs_run_delayed refs in the transaction commit
>>>>> path alongside comments why every call is needed, since it's not always
>>>>> obvious (those call sites were derived empirically by running and
>>>>> debugging existing tests)
>>>>>
>>>>> 4. Correctly flagging the transaction in which we are reinitialising
>>>>> the extent tree.
>>>>>
>>>>> 5 Moving btrfs_write_dirty_block_groups to btrfs_write_dirty_block_groups
>>>>> since blockgroups should be written to disk after the last delayed refs
>>>>> have been run.
>>>>>
>>>>> Signed-off-by: Nikolay Borisov <nbori...@suse.com>
>>>>> Signed-off-by: David Sterba <dste...@suse.com>
>>>>
>>>> Is there something (maybe btrfs_run_delayed_refs()?) missing in
>>>> btrfs-image?
>>>>
>>>> btrfs-image from devel branch can't restore image correctly, the block
>>>> group used bytes is not correct, thus it can't pass misc nor fsck tests.
>>>
>>> This is really strange, all fsck/misc tests passed with those patches.
>>> Can you be more specific which tests exactly you mean ?
>>
>> One case is fsck/020 with lowmem mode. (Original mode lacks block
>> group->used check).
>>
>> More specifically, fsck/020/keyed_data_ref_with_shared_leaf.img
>>
>> Using btrfs-image from my distribution (v4.17.1) and devel branch btrfs
>> check: (cwd is btrfs-progs, devel branch)
>>
>> $ btrfs-image -r
>> tests/fsck-tests/020-extent-ref-cases/keyed_data_ref_with_shared_leaf.img
>> ~/test.img
>> $ btrfs check --mode=wmem ~/test.img
>> Opening filesystem to check...
>> Checking filesystem on /home/adam/test.img
>> UUID: 12dabcf2-d4da-4a70-9701-9f3d48074e73
>> [1/7] checking root items
>> [2/7] checking extents
>> [3/7] checking free space cache
>> [4/7] checking fs roots
>> [5/7] checking only csums items (without verifying data)
>> [6/7] checking root refs done with fs roots in lowmem mode, skipping
>> [7/7] checking quota groups skipped (not enabled on this FS)
>> found 1208320 bytes used, no error found
>> total csum bytes: 512
>> total tree bytes: 684032
>> total fs tree bytes: 638976
>> total extent tree bytes: 16384
>> btree space waste bytes: 305606
>> file data blocks allocated: 93847552
>> referenced 1773568
>>
>> But if using btrfs-image with your delayed ref patch:
>> $ ./btrfs-image -r
>> tests/fsck-tests/020-extent-ref-cases/keyed_data_ref_with_shared_leaf.img
>> ~/test.img
>>
>> # No matter if I'm using btrfs-check from devel or 4.17.1
>> $ btrfs check --mode=wmem ~/test.img
>> Opening filesystem to check...
>> Checking filesystem on /home/adam/test.img
>> UUID: 12dabcf2-d4da-4a70-9701-9f3d48074e73
>> [1/7] checking root items
>> [2/7] checking extents
>> ERROR: block group[4194304 8388608] used 20480 but extent items used 24576
>> ERROR: block group[20971520 16777216] used 659456 but extent items used
>> 655360
>> ERROR: errors found in extent allocation tree or chunk allocation
>> [3/7] checking free space cache
>> [4/7] checking fs roots
>> [5/7] checking only csums items (without verifying data)
>> [6/7] checking root refs done with fs roots in lowmem mode, skipping
>> [7/7] checking quota groups skipped (not enabled on this FS)
>> found 1208320 bytes used, error(s) found
>> total csum bytes: 512
>> total tree bytes: 684032
>> total fs tree bytes: 638976
>> total extent tree bytes: 16384
>> btree space waste bytes: 305606
>> file data blocks allocated: 93847552
>> referenced 1773568
>>
>> I'd say, although lowmem check is still far from perfect, it indeed has
>> extra checks original mode lacks, and in this case it indeed exposes
>> problem.
>
>
> I'm not able to reproduce it:
>
> make TEST_ENABLE_OVERRIDE=ue TEST_ARGS_CHECK="--mode=lowmem" test-fsck
> [TEST] fsck-tests.sh
> [TEST/fsck] 001-bad-file-extent-bytenr
> [TEST/fsck] 002-bad-transid
> [TEST/fsck] 003-shift-offsets
> [TEST/fsck] 004-no-dir-index
> [TEST/fsck] 005-bad-item-offset
> [TEST/fsck] 006-bad-root-items
> [TEST/fsck] 007-bad-offset-snapshots
> [TEST/fsck] 008-bad-dir-index-name
> [TEST/fsck] 009-no-dir-item-or-index
> [TEST/fsck] 010-no-rootdir-inode-item
> [TEST/fsck] 011-no-inode-item
> [TEST/fsck] 012-leaf-corruption
> [TEST/fsck] 013-extent-tree-rebuild
> [TEST/fsck] 014-no-extent-info
> [TEST/fsck] 015-tree-reloc-tree
> [TEST/fsck] 016-wrong-inode-nbytes
> [TEST/fsck] 017-missing-all-file-extent
> [TEST/fsck] 018-leaf-crossing-stripes
> [TEST/fsck] 019-non-skinny-false-alert
> [TEST/fsck] 020-extent-ref-cases
> [TEST/fsck] 021-partially-dropped-snapshot-case
> [TEST/fsck] 022-qgroup-rescan-halfway
> [TEST/fsck] 023-qgroup-stack-overflow
> [TEST/fsck] 024-clear-space-cache
> [TEST/fsck] 025-file-extents
> [TEST/fsck] 026-bad-dir-item-name
> [TEST/fsck] 027-bad-extent-inline-ref-type
> [TEST/fsck] 028-unaligned-super-dev-sizes
> [TEST/fsck] 029-valid-orphan-item
> [TEST/fsck] 030-reflinked-prealloc-extents
> [TEST/fsck] 031-metadatadump-check-data-csum
> [TEST/fsck] 032-corrupted-qgroup
> [TEST/fsck] 032-freespacetree-corrupted-extent-offset
> [TEST/fsck] 033-lowmem-collission-dir-items
> [TEST/fsck] 034-bad-inode-flags
> [TEST/fsck] 035-inline-bad-ram-bytes
> [TEST/fsck] 035-rescan-not-kicked-in
>
>
> git bl -9
> dac6a0a6f5ca btrfs-progs: Merge alloc_reserved_tree_block(2|) (3 weeks ago)
> <Nikolay Borisov>
> da184a3838e6 btrfs-progs: Remove __free_extent2 (3 weeks ago) <Nikolay
> Borisov>
> 76b75e40606e btrfs-progs: Remove old delayed refs infrastructure (3 months
> ago) <Nikolay Borisov>
> 870c58e08673 btrfs-progs: Wire up delayed refs (3 months ago) <Nikolay
> Borisov>
> e045b219e397 btrfs-progs: Make btrfs_write_dirty_block_groups take only trans
> argument (3 weeks ago) <Nikolay Borisov>
> 00f8d76c8a27 btrfs-progs: Add delayed refs infrastructure (3 months ago)
> <Nikolay Borisov>
> d03f79e8deb6 btrfs-progs: Add alloc_reserved_tree_block2 function (3 months
> ago) <Nikolay Borisov>
> d4a4831fffe9 btrfs-progs: Add __free_extent2 function (3 months ago) <Nikolay
> Borisov>
> 7faaca0d9f78 Btrfs progs v4.17.1 (4 weeks ago) <David Sterba>
>
> Are you sure your branch is not dirty with other changes?
Oh, I'm using the (maybe out-of-data) David's devel branch.
For the "Wire up delayed refs" the commit hash is indeed different.
In that branch it indeed has some difference.
Is there any branch I could fetch from?
Thanks,
Qu
>
>
>>
>> Thanks,
>> Qu
>>
>>
>>>
>>>>
>>>> Thanks,
>>>> Qu
>>>>
>>>>> ---
>>>>> check/main.c | 3 +-
>>>>> extent-tree.c | 166
>>>>> ++++++++++++++++++++++++++++++----------------------------
>>>>> transaction.c | 27 +++++++++-
>>>>> 3 files changed, 112 insertions(+), 84 deletions(-)
>>>>>
>>>>> diff --git a/check/main.c b/check/main.c
>>>>> index bc2ee22f7943..b361cd7e26a0 100644
>>>>> --- a/check/main.c
>>>>> +++ b/check/main.c
>>>>> @@ -8710,7 +8710,7 @@ static int reinit_extent_tree(struct
>>>>> btrfs_trans_handle *trans,
>>>>> fprintf(stderr, "Error adding block group\n");
>>>>> return ret;
>>>>> }
>>>>> - btrfs_extent_post_op(trans);
>>>>> + btrfs_run_delayed_refs(trans, -1);
>>>>> }
>>>>>
>>>>> ret =eset_balance(trans, fs_info);
>>>>> @@ -9767,6 +9767,7 @@ int cmd_check(int argc, char **argv)
>>>>> goto close_out;
>>>>> }
>>>>>
>>>>> + trans->reinit_extent_tree =rue;
>>>>> if (init_extent_tree) {
>>>>> printf("Creating a new extent tree\n");
>>>>> ret =einit_extent_tree(trans, info,
>>>>> diff --git a/extent-tree.c b/extent-tree.c
>>>>> index 7d6c37c6b371..2fa51bbc0359 100644
>>>>> --- a/extent-tree.c
>>>>> +++ b/extent-tree.c
>>>>> @@ -1418,8 +1418,6 @@ int btrfs_inc_extent_ref(struct btrfs_trans_handle
>>>>> *trans,
>>>>> err =et;
>>>>> out:
>>>>> btrfs_free_path(path);
>>>>> - finish_current_insert(trans);
>>>>> - del_pending_extents(trans);
>>>>> BUG_ON(err);
>>>>> return err;
>>>>> }
>>>>> @@ -1602,8 +1600,6 @@ int btrfs_set_block_flags(struct btrfs_trans_handle
>>>>> *trans, u64 bytenr,
>>>>> btrfs_set_extent_flags(l, item, flags);
>>>>> out:
>>>>> btrfs_free_path(path);
>>>>> - finish_current_insert(trans);
>>>>> - del_pending_extents(trans);
>>>>> return ret;
>>>>> }
>>>>>
>>>>> @@ -1701,7 +1697,6 @@ static int write_one_cache_group(struct
>>>>> btrfs_trans_handle *trans,
>>>>> struct btrfs_block_group_cache *cache)
>>>>> {
>>>>> int ret;
>>>>> - int pending_ret;
>>>>> struct btrfs_root *extent_root =rans->fs_info->extent_root;
>>>>> unsigned long bi;
>>>>> struct extent_buffer *leaf;
>>>>> @@ -1717,12 +1712,8 @@ static int write_one_cache_group(struct
>>>>> btrfs_trans_handle *trans,
>>>>> btrfs_mark_buffer_dirty(leaf);
>>>>> btrfs_release_path(path);
>>>>> fail:
>>>>> - finish_current_insert(trans);
>>>>> - pending_ret =el_pending_extents(trans);
>>>>> if (ret)
>>>>> return ret;
>>>>> - if (pending_ret)
>>>>> - return pending_ret;
>>>>> return 0;
>>>>>
>>>>> }
>>>>> @@ -2049,6 +2040,7 @@ static int finish_current_insert(struct
>>>>> btrfs_trans_handle *trans)
>>>>> int skinny_metadata >>>>
>>>>> btrfs_fs_incompat(extent_root->fs_info, SKINNY_METADATA);
>>>>>
>>>>> +
>>>>> while(1) {
>>>>> ret =ind_first_extent_bit(&info->extent_ins, 0, &start,
>>>>> &end, EXTENT_LOCKED);
>>>>> @@ -2080,6 +2072,8 @@ static int finish_current_insert(struct
>>>>> btrfs_trans_handle *trans)
>>>>> BUG_ON(1);
>>>>> }
>>>>>
>>>>> +
>>>>> + printf("shouldn't be executed\n");
>>>>> clear_extent_bits(&info->extent_ins, start, end, EXTENT_LOCKED);
>>>>> kfree(extent_op);
>>>>> }
>>>>> @@ -2379,7 +2373,6 @@ static int __free_extent(struct btrfs_trans_handle
>>>>> *trans,
>>>>> }
>>>>> fail:
>>>>> btrfs_free_path(path);
>>>>> - finish_current_insert(trans);
>>>>> return ret;
>>>>> }
>>>>>
>>>>> @@ -2462,33 +2455,30 @@ int btrfs_free_extent(struct btrfs_trans_handle
>>>>> *trans,
>>>>> u64 bytenr, u64 num_bytes, u64 parent,
>>>>> u64 root_objectid, u64 owner, u64 offset)
>>>>> {
>>>>> - struct btrfs_root *extent_root =oot->fs_info->extent_root;
>>>>> - int pending_ret;
>>>>> int ret;
>>>>>
>>>>> WARN_ON(num_bytes < root->fs_info->sectorsize);
>>>>> - if (root =extent_root) {
>>>>> - struct pending_extent_op *extent_op;
>>>>> -
>>>>> - extent_op =malloc(sizeof(*extent_op), GFP_NOFS);
>>>>> - BUG_ON(!extent_op);
>>>>> -
>>>>> - extent_op->type =ENDING_EXTENT_DELETE;
>>>>> - extent_op->bytenr =ytenr;
>>>>> - extent_op->num_bytes =um_bytes;
>>>>> - extent_op->level =int)owner;
>>>>> -
>>>>> - set_extent_bits(&root->fs_info->pending_del,
>>>>> - bytenr, bytenr + num_bytes - 1,
>>>>> - EXTENT_LOCKED);
>>>>> - set_state_private(&root->fs_info->pending_del,
>>>>> - bytenr, (unsigned long)extent_op);
>>>>> - return 0;
>>>>> + /*
>>>>> + * tree log blocks never actually go into the extent allocation
>>>>> + * tree, just update pinning info and exit early.
>>>>> + */
>>>>> + if (root_objectid =BTRFS_TREE_LOG_OBJECTID) {
>>>>> + printf("PINNING EXTENTS IN LOG TREE\n");
>>>>> + WARN_ON(owner >=TRFS_FIRST_FREE_OBJECTID);
>>>>> + btrfs_pin_extent(trans->fs_info, bytenr, num_bytes);
>>>>> + ret =;
>>>>> + } else if (owner < BTRFS_FIRST_FREE_OBJECTID) {
>>>>> + BUG_ON(offset);
>>>>> + ret =trfs_add_delayed_tree_ref(trans->fs_info, trans,
>>>>> + bytenr, num_bytes, parent,
>>>>> + root_objectid, (int)owner,
>>>>> + BTRFS_DROP_DELAYED_REF,
>>>>> + NULL, NULL, NULL);
>>>>> + } else {
>>>>> + ret =_free_extent(trans, bytenr, num_bytes, parent,
>>>>> + root_objectid, owner, offset, 1);
>>>>> }
>>>>> - ret =_free_extent(trans, bytenr, num_bytes, parent,
>>>>> - root_objectid, owner, offset, 1);
>>>>> - pending_ret =el_pending_extents(trans);
>>>>> - return ret ? ret : pending_ret;
>>>>> + return ret;
>>>>> }
>>>>>
>>>>> static u64 stripe_align(struct btrfs_root *root, u64 val)
>>>>> @@ -2694,6 +2684,8 @@ static int alloc_reserved_tree_block2(struct
>>>>> btrfs_trans_handle *trans,
>>>>> struct btrfs_delayed_tree_ref *ref =trfs_delayed_node_to_tree_ref(node);
>>>>> struct btrfs_key ins;
>>>>> bool skinny_metadata =trfs_fs_incompat(trans->fs_info, SKINNY_METADATA);
>>>>> + int ret;
>>>>> + u64 start, end;
>>>>>
>>>>> ins.objectid =ode->bytenr;
>>>>> if (skinny_metadata) {
>>>>> @@ -2704,10 +2696,25 @@ static int alloc_reserved_tree_block2(struct
>>>>> btrfs_trans_handle *trans,
>>>>> ins.type =TRFS_EXTENT_ITEM_KEY;
>>>>> }
>>>>>
>>>>> - return alloc_reserved_tree_block(trans, ref->root, trans->transid,
>>>>> - extent_op->flags_to_set,
>>>>> - &extent_op->key, ref->level, &ins);
>>>>> + if (ref->root =BTRFS_EXTENT_TREE_OBJECTID) {
>>>>> + ret =ind_first_extent_bit(&trans->fs_info->extent_ins,
>>>>> + node->bytenr, &start, &end,
>>>>> + EXTENT_LOCKED);
>>>>> + ASSERT(!ret);
>>>>> + ASSERT(start =node->bytenr);
>>>>> + ASSERT(end =node->bytenr + node->num_bytes - 1);
>>>>> + }
>>>>> +
>>>>> + ret =lloc_reserved_tree_block(trans, ref->root, trans->transid,
>>>>> + extent_op->flags_to_set,
>>>>> + &extent_op->key, ref->level, &ins);
>>>>>
>>>>> + if (ref->root =BTRFS_EXTENT_TREE_OBJECTID) {
>>>>> + clear_extent_bits(&trans->fs_info->extent_ins, start, end,
>>>>> + EXTENT_LOCKED);
>>>>> + }
>>>>> +
>>>>> + return ret;
>>>>> }
>>>>>
>>>>> static int alloc_reserved_tree_block(struct btrfs_trans_handle *trans,
>>>>> @@ -2772,39 +2779,50 @@ static int alloc_tree_block(struct
>>>>> btrfs_trans_handle *trans,
>>>>> u64 search_end, struct btrfs_key *ins)
>>>>> {
>>>>> int ret;
>>>>> + u64 extent_size;
>>>>> + struct btrfs_delayed_extent_op *extent_op;
>>>>> + bool skinny_metadata =trfs_fs_incompat(root->fs_info,
>>>>> + SKINNY_METADATA);
>>>>> +
>>>>> + extent_op =trfs_alloc_delayed_extent_op();
>>>>> + if (!extent_op)
>>>>> + return -ENOMEM;
>>>>> +
>>>>> ret =trfs_reserve_extent(trans, root, num_bytes, empty_size,
>>>>> hint_byte, search_end, ins, 0);
>>>>> BUG_ON(ret);
>>>>>
>>>>> + if (key)
>>>>> + memcpy(&extent_op->key, key, sizeof(extent_op->key));
>>>>> + else
>>>>> + memset(&extent_op->key, 0, sizeof(extent_op->key));
>>>>> + extent_op->flags_to_set =lags;
>>>>> + extent_op->update_key =kinny_metadata ? false : true;
>>>>> + extent_op->update_flags =rue;
>>>>> + extent_op->is_data =alse;
>>>>> + extent_op->level =evel;
>>>>> +
>>>>> + extent_size =ns->offset;
>>>>> +
>>>>> + if (btrfs_fs_incompat(root->fs_info, SKINNY_METADATA)) {
>>>>> + ins->offset =evel;
>>>>> + ins->type =TRFS_METADATA_ITEM_KEY;
>>>>> + }
>>>>> +
>>>>> + /* Ensure this reserved extent is not found by the allocator */
>>>>> if (root_objectid =BTRFS_EXTENT_TREE_OBJECTID) {
>>>>> - struct pending_extent_op *extent_op;
>>>>> -
>>>>> - extent_op =malloc(sizeof(*extent_op), GFP_NOFS);
>>>>> - BUG_ON(!extent_op);
>>>>> -
>>>>> - extent_op->type =ENDING_EXTENT_INSERT;
>>>>> - extent_op->bytenr =ns->objectid;
>>>>> - extent_op->num_bytes =ns->offset;
>>>>> - extent_op->level =evel;
>>>>> - extent_op->flags =lags;
>>>>> - memcpy(&extent_op->key, key, sizeof(*key));
>>>>> -
>>>>> - set_extent_bits(&root->fs_info->extent_ins, ins->objectid,
>>>>> - ins->objectid + ins->offset - 1,
>>>>> - EXTENT_LOCKED);
>>>>> - set_state_private(&root->fs_info->extent_ins,
>>>>> - ins->objectid, (unsigned long)extent_op);
>>>>> - } else {
>>>>> - if (btrfs_fs_incompat(root->fs_info, SKINNY_METADATA)) {
>>>>> - ins->offset =evel;
>>>>> - ins->type =TRFS_METADATA_ITEM_KEY;
>>>>> - }
>>>>> - ret =lloc_reserved_tree_block(trans, root_objectid,
>>>>> - generation, flags,
>>>>> - key, level, ins);
>>>>> - finish_current_insert(trans);
>>>>> - del_pending_extents(trans);
>>>>> + ret =et_extent_bits(&trans->fs_info->extent_ins,
>>>>> + ins->objectid,
>>>>> + ins->objectid + extent_size - 1,
>>>>> + EXTENT_LOCKED);
>>>>> +
>>>>> + BUG_ON(ret);
>>>>> }
>>>>> +
>>>>> + ret =trfs_add_delayed_tree_ref(root->fs_info, trans, ins->objectid,
>>>>> + extent_size, 0, root_objectid,
>>>>> + level, BTRFS_ADD_DELAYED_EXTENT,
>>>>> + extent_op, NULL, NULL);
>>>>> return ret;
>>>>> }
>>>>>
>>>>> @@ -3329,11 +3347,6 @@ int btrfs_make_block_group(struct
>>>>> btrfs_trans_handle *trans,
>>>>> sizeof(cache->item));
>>>>> BUG_ON(ret);
>>>>>
>>>>> - ret =inish_current_insert(trans);
>>>>> - BUG_ON(ret);
>>>>> - ret =el_pending_extents(trans);
>>>>> - BUG_ON(ret);
>>>>> -
>>>>> return 0;
>>>>> }
>>>>>
>>>>> @@ -3429,10 +3442,6 @@ int btrfs_make_block_groups(struct
>>>>> btrfs_trans_handle *trans,
>>>>> sizeof(cache->item));
>>>>> BUG_ON(ret);
>>>>>
>>>>> - finish_current_insert(trans);
>>>>> - ret =el_pending_extents(trans);
>>>>> - BUG_ON(ret);
>>>>> -
>>>>> cur_start =ache->key.objectid + cache->key.offset;
>>>>> }
>>>>> return 0;
>>>>> @@ -3814,14 +3823,9 @@ int btrfs_fix_block_accounting(struct
>>>>> btrfs_trans_handle *trans)
>>>>> struct btrfs_fs_info *fs_info =rans->fs_info;
>>>>> struct btrfs_root *root =s_info->extent_root;
>>>>>
>>>>> - while(extent_root_pending_ops(fs_info)) {
>>>>> - ret =inish_current_insert(trans);
>>>>> - if (ret)
>>>>> - return ret;
>>>>> - ret =el_pending_extents(trans);
>>>>> - if (ret)
>>>>> - return ret;
>>>>> - }
>>>>> + ret =trfs_run_delayed_refs(trans, -1);
>>>>> + if (ret)
>>>>> + return ret;
>>>>>
>>>>> while(1) {
>>>>> cache =trfs_lookup_first_block_group(fs_info, start);
>>>>> @@ -4026,7 +4030,7 @@ static int __btrfs_record_file_extent(struct
>>>>> btrfs_trans_handle *trans,
>>>>> } else if (ret !=EEXIST) {
>>>>> goto fail;
>>>>> }
>>>>> - btrfs_extent_post_op(trans);
>>>>> + btrfs_run_delayed_refs(trans, -1);
>>>>> extent_bytenr =isk_bytenr;
>>>>> extent_num_bytes =um_bytes;
>>>>> extent_offset =;
>>>>> diff --git a/transaction.c b/transaction.c
>>>>> index 96d9891b0d1c..bfda769210ee 100644
>>>>> --- a/transaction.c
>>>>> +++ b/transaction.c
>>>>> @@ -61,7 +61,6 @@ static int update_cowonly_root(struct
>>>>> btrfs_trans_handle *trans,
>>>>> u64 old_root_bytenr;
>>>>> struct btrfs_root *tree_root =oot->fs_info->tree_root;
>>>>>
>>>>> - btrfs_write_dirty_block_groups(trans);
>>>>> while(1) {
>>>>> old_root_bytenr =trfs_root_bytenr(&root->root_item);
>>>>> if (old_root_bytenr =root->node->start)
>>>>> @@ -98,6 +97,17 @@ int commit_tree_roots(struct btrfs_trans_handle *trans,
>>>>> if (ret)
>>>>> return ret;
>>>>>
>>>>> + /*
>>>>> + * If the above CoW is the first one to dirty the current tree_root,
>>>>> + * delayed refs for it won't be run until after this function has
>>>>> + * finished executing, meaning we won't process the extent tree root,
>>>>> + * which will have been added to ->dirty_cowonly_roots. So run
>>>>> + * delayed refs here as well.
>>>>> + */
>>>>> + ret =trfs_run_delayed_refs(trans, -1);
>>>>> + if (ret)
>>>>> + return ret;
>>>>> +
>>>>> while(!list_empty(&fs_info->dirty_cowonly_roots)) {
>>>>> next =s_info->dirty_cowonly_roots.next;
>>>>> list_del_init(next);
>>>>> @@ -147,6 +157,12 @@ int btrfs_commit_transaction(struct
>>>>> btrfs_trans_handle *trans,
>>>>>
>>>>> if (trans->fs_info->transaction_aborted)
>>>>> return -EROFS;
>>>>> + /*
>>>>> + * Flush all accumulated delayed refs so that root-tree updates are
>>>>> + * consistent
>>>>> + */
>>>>> + ret =trfs_run_delayed_refs(trans, -1);
>>>>> + BUG_ON(ret);
>>>>>
>>>>> if (root->commit_root =root->node)
>>>>> goto commit_tree;
>>>>> @@ -164,11 +180,18 @@ int btrfs_commit_transaction(struct
>>>>> btrfs_trans_handle *trans,
>>>>> ret =trfs_update_root(trans, root->fs_info->tree_root,
>>>>> &root->root_key, &root->root_item);
>>>>> BUG_ON(ret);
>>>>> +
>>>>> commit_tree:
>>>>> ret =ommit_tree_roots(trans, fs_info);
>>>>> BUG_ON(ret);
>>>>> - ret =_commit_transaction(trans, root);
>>>>> + /*
>>>>> + * Ensure that all comitted roots are properly accounted in the
>>>>> + * extent tree
>>>>> + */
>>>>> + ret =trfs_run_delayed_refs(trans, -1);
>>>>> BUG_ON(ret);
>>>>> + btrfs_write_dirty_block_groups(trans);
>>>>> + __commit_transaction(trans, root);
>>>>> write_ctree_super(trans);
>>>>> btrfs_finish_extent_commit(trans, fs_info->extent_root,
>>>>> &fs_info->pinned_extents);
>>>>>
>>>>
>>